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Integrity of Engineering Components 


Foreword 


Engineering components are subjected to varieties of stress as well as environmental 
conditions while in service. This introduces, with passage of time, a process of 
degeneration of the components and therefore may not permit their further use. Such 
components may be part of a power plant, off-shore structure, refinery, aircraft or 
automobile, rolling mill or even of a large machine with weldings, and so on. The 
designer, by anticipating some of the service conditions, may be able to prescribe the 
component for a stipulated period of service. In real-life, however, a number of 
imponderables offset the calculations thereby shortening the expected design-life. 
Furthermore, a design, by convention, uses a factor-of-safety term that indeed may be 
making a component less competitive techno-economically and be providing a “life 
expectation” that is too conservative. 

Both retiring a component ‘too early’ or ‘too late’ force a penalty which may 
be quite high. For example, retiring a costly power plant component too early 
would be uneconomical while not replacing it at the appropriate time might 
result in sudden failures and outages which would eventually be not only uneco¬ 
nomical but also lead to casualties and litigation in some cases. Therefore, an 
evaluation of integrity of components is considered very useful and pragmatic 
in the present highly competitive industrial scenario with emphasis on quality and 
safety. The case for integrity evaluation is further strengthened when consideration 
is also given to an extension of ‘useful life’ leading to cost effectiveness. The latter 
is only possible when the health and integrity of critical components are accurately 
established. 

The above issues assume great importance in view of the fact that a large proportion 
of power, petroleum and chemical plants, the world over, have been in operation for 
very long. Critical components in many of them have indeed outlived their design lives 
and decisions regarding their continuance are overdue. This is particularly true in 
India, where prohibitive costs of replacement, unwillingness on the part of owners to 
venture into substitution or retrofitting, and a lethargy of attitude, have all contributed 
to delays. An enhanced level of information and knowledge on the integrity evaluation 
techniques, if properly disseminated, will bring about a change in the prevailing 
climate. 

.Significantly, the theme of integrity evaluation combines both a high level of 
theoretical analysis and field application. There are, for example, several techniques 
of NDT (viz. magneto-acoustic emission, ultrasonics using attenuation and resonance, 
special eddy current testings etc.) that are strongly based on high science/technology 
and have been very useful to the shop floor operator as well, in recent times. 
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Simultaneously, physics-based understanding of behaviour of materials under specific 
service conditions (i.e. creep, fatigue, their interaction and so on) has helped 
analyse the behaviour of plant materials and components. This has extended the 
horizon of candidate materials while contributing to improved plant performance. 

Taking all these into account it was felt by the editors of’Sadhana that a special issue 
on the integrity aspects of industrial components would be very timely. A number of 
experts, drawn from various specialised disciplines from India and abroad, have 
contributed articles to this issue. It will perhaps be appreciated that the aiea of 
component integrity is so wide in its scope that it could easily fill-in several thick 
volumes. What has, however, been attempted within the confines of one issue of 
‘ Sadhana ’, is to provide an insight into the various broad generic areas and address 
important issues relating to component integrity. 

At the very root of integrity-evaluation of any component, lies an estimate of the 
nature and quantum of damage accumulated either in service or during manu¬ 
facture/assembly. More often than not, one is not allowed to perform “destructive” 
tests in many of these situations. Therefore, the first article deals with various 
non-destructive methods used in component integrity evaluation particularly empha¬ 
sizing the newer but established techniques. 

Any industrial component would normally experience some ‘applied stress’ in 
operation - the nature could be monotonic or cyclic. However, presence of any residual 
stress in the component, along with the applied one, would very much affect its real-life 
behaviour and performance. Hence, the second and third articles discuss the varieties of 
residual stresses - their origin, nature and evaluation techniques. The environment 
under which a component works, primarily affects its performance, health and thus the 
integrity. A number of articles, covering different aspects of stress corrosion and 
hydrogen embrittlement, as well as the modelling of various environment conditions 
have also been included. 

Any modern industrial operation would involve joining of parts through welding. 
This is a very broad area and in its own right has the potential to cover a full issue of any 
standard journal. However, the two articles on welding primarily deal with weld joint 
failures and application of fracture toughness concepts to life prediction. 

A large number of power plant components, in particular, operate at high 
(400 U C) temperatures and simultaneously under pressure. This would place them 
under ‘creep’ conditions and without a knowledge-base of the creep behaviour of 
metallic materials it would not be possible even to design, let alone establish the 
integrity of these components (e.g. boiler tubes, headers, economisers, super heaters 
and so on). A set of papers on creep deal with various general aspects of creep 
life prediction as well as with the emerging concepts of creep crack growth. 

Components in a variety of machines, automobiles and aircraft experience fluctuat¬ 
ing stress conditions during operation, and therefore demand consideration of fatigue 
for design and for damage estimation in service.. Again, superimposed with tempera¬ 
ture, the connotations of fatigue will change. Hence, the articles on fatigue address both 
high- and low-temperature situations; further they also dwell on low-cycle fatigue, 
which is experienced by line-pipes and components in a power plant apart from other 
industrial components. 

Finally, the section on -applications deals with several specific situations and 
components. The basic and generic concepts and their utility in engineering com¬ 
ponents are discussed here through a number of articles. 
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Non-destructive testing and evaluation for structural integrity 

BALDEV RAJ, T JAYAKUMAR and B P C RAO 

Division for PIE & NDT Development, Indira Gandhi Centre for Atomic 
Research, Kalpakkam 603 102, India 

Abstract. Reliable performance of a component or structure depends on 
pre-service quality of the component and in-service degradation of the 
component under operating conditions. The role of non-destructive evalua¬ 
tion (NDE) in ensuring pre-service quality and also monitoring in-service 
degradation to avoid premature failure of the components/structures is 
ever increasing. There are many NDE techniques based on various physical 
principles. The end objective of NDE is detection and characterisation of 
anomalies such as defects, stresses and microstructural degradations in 
materials. This is accomplished by establishing correlation between a non¬ 
destructive^ measured physical/derived parameter and quantitative infor¬ 
mation on defects/stresses/microstructures. The NDE information to¬ 
gether with design parameters are taken into consideration for evaluation 
of integrity and life assessment of the components/structures. In this paper, 
a brief description of the physical concepts of NDE methods and the 
physical/derived parameters that are used for assessing defects, stresses and 
microstructures are given. A few case studies highlighting the importance of 
non-destructive testing and evaluation for structural integrity assessment 
are also discussed based on the investigations carried out at the authors’ 
laboratory. Emerging concepts like intelligent processing of materials, 
expert systems, neural networks, use of multisensors with fusion of data and 
exploitation of signal analysis and imaging approaches are also addressed 
in this paper. 

Keywords. Non-destructive evaluation; structural integrity; life assess¬ 
ment; defects; stresses; microstructures. 

1. Introduction 

Assessment of integrity of engineering components and structures is becoming increas¬ 
ingly important for both economic and safety reasons. It is now widely accepted that all 
components and structures possess “defects” from the start of their service life (this 
forms an important and initial assumption in fracture mechanics) and that defect 
assessment is the way to control structural integrity reliably. Defects are those 
anomalies which when not removed or repaired might lead to ultimate loss of 
structural integrity of the material or component. Such defects include surface and 
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sub-surface cracks, inclusions, pores, incompletely joined regions in welds or other 
joints and residual stresses, macro- or microstructural degradations etc. in metallic 
materials and delaminated or debonded regions, fibre fracture, matrix crazing etc. in 
composites. While surface defects are of immediate concern, due consideration is also 
to be given to sub-surface defects and other anomalies. In either case, it is important to 
understand the factors that govern the likely growth of defects/anomalies or cracks 
originating from them. In particular the size of the defect, its nature, its location, the 
stress to which it is subjected and the local properties of the material in which it is 
embedded, all play a major role in determining its rate of growth (Silk & Whaphan 
1989). According to fracture mechanics, defects present in materials lead to failure by 
growing to a critical, self-propagating size. The fracture mechanics concepts allow one 
to calculate the critical sizes of defects as a function of their depth, length, active stress 
system and stress intensity and such properties of the material as its elastic modulus, 
yield strength and fracture toughness (David 1981). On the other hand, by knowing the 
dimensions of defects present in a component, it is possible to estimate both remaining 
life of the component and extent of degradation. 

In the pre-service scenario, defects may be present at the raw material stage or might 
have been introduced during machining, fabrication, heat treatment and assembling. 
Similarly, defects may be generated due to deterioration of the material/component/ 
structure as a result of one or a combination of operating conditions like temperature, 
stress, chemical environment and irradiation leading to mechanisms such as creep, 
fatigue, stress corrosion, embrittlement etc. These mechanisms result in deterioration of 
mechanical properties, crack initiation and propagation, leaks in pressurised compo¬ 
nents and catastrophic failures. Hence, reliable performance of a component/structure 
depends on the pre-service quality and in-service degradation of the component under 
operating conditions. The pre-service quality can be achieved by good engineering 
practices i.e. by way of selecting suitable quality raw materials and ensuring that 
harmful defects are not produced during the subsequent stages of fabrication and 
assembly. In-service degradation of the component can be assessed by quantitative 
characterisation of anomalies such as defects, stresses and micro-structural degrada¬ 
tions. Their early detection and assessment is essential to avoid damage and loss of 
structural integrity of components. In this regard, non-destructive evaluation (NDE) 
plays an important role for the reliable assessment of the anomalies by performing 
inspections at various stages. To give an example, the acoustic emission technique is 
used for detection of initation or propagation of any microcracks, and also any fluid 
leakage at inaccessible locations, by picking up signals from accessible locations. Using 
this technique, on-line monitoring of critical components and structures can give 
advance warning before occurrence of an accident or catastrophe due to material 
degradation under service conditions. 

There are many non-destructive test (NDT) techniques based on various physical 
principles. The interaction of a medium (like electromagnetic radiation, sound waves) 
with various types of anomalies present in a material under study are used to quantita¬ 
tively assess structural integrity. NDE procedures involve establishing correlations 
between non-destructively measured physical/derived parameters and quantitative 
information about anomalies. To give an example, eddy-current NDT involves 
measurement of change impedance and determination of its correlation with anomalies 
such as change in electrical conductivity or microstructure, or the presence of a crack. 
These correlations, together with design parameters and inputs from fracture mechanics, 
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are taken into consideration for integrity and life assessment of the component. It 
should be emphasised that anomaly assessment information by NDE techniques is 
more meaningful when applied together with expertise from other related disciplines 
such as metallurgy, corrosion, fracture mechanics, stress analysis, design and oper¬ 
ation. This approach enables evaluation of the degradation vis-a-vis performance and 
life prediction of components. Application of computers and robotics during inspection 
further enhances the sensitivity for defect detection. Though engineering structures are 
fabricated from metallic substances, as well as ceramics and composites, the discussions 
in this paper essentially pertain to integrity monitoring of structures of metallic origin. 

There has been a continuous and ever-increasing demand from the industry for the 
NDE professionals in developing new techniques and procedures to achieve the 
“minimum defect component with maximum confidence and reliability” goal. There 
has been a tremendous effort put in worldwide to meet this challenge and encouraging 
results have been obtained. For example, due to fracture mechanics considerations, it is 
required to detect defects smaller than 100 microns in structures, made of engineering 
ceramics. This requirement calls for development, implementation and application of 
high resolution and high sensitivity techniques. High frequency ultrasonics and 
microtomography are the two recently evolved techniques to meet the above demands. 
At the same time, industry is more conscious of utilising the benefits of NDE towards 
better performance and extension of life of various components/structures. 

The knowledge of microstructural variations in a component is also very important 
to ensure the desired performance. Traditionally, microstructures are investigated by 
microscopic methods which are destructive in nature. At the authors’ laboratory 
pioneering work has been done to adopt NDE techniques for the investigation of 
microstructural degradations. These techniques are useful for damage assessment, life 
estimation and extension applications of ageing plants and components, in particular, 
for continued service of components subjected to unacceptable service conditions 
during system malfunction or under accidental conditions. 

In addition to the information about defects and microstructures, knowledge of 
residual stresses in a component is also essential for reliable assessment of structural 
integrity. Residual stresses are the system of stresses which exist in a material or 
component when it is free from external loads. The maximum value to which the 
residual stress can reach in a material is its elastic limit. Manufacturing processes are 
the most common causes of residual stress. In some instances, residual stress may also 
be induced later in the life of the structure during installation, or assembly or operation. 
Another common cause of residual stress is in-service repair or modification. Among 
the three types of residual stresses viz. macroscopic, microscopic and sub-microscopic, 
macriscopic residual stresses are important. They are generally caused by in- 
homogenous plastic strains in a structure. During welding, plastic deformation is 
introduced mainly by thermal stresses due to large temperature gradients present at 
various stages of the welding process. Most of the engineering applications and fracture 
mechanics calculations require the knowledge of residual stresses of this type only. 
Again in this type, tensile residual stresses are generally detrimental, increasing the 
susceptibility of a component to failures due to such causes as fatigue and stress 
corrosion. Compressive residual stresses are usually beneficial, tending to reduce the 
above susceptibilities. Thus residual compressive stresses are often deliberately created 
on the surface. The role of NDE in detecting and assessing residual stresses is ever 
increasing and today well established techniques/procedures exist in this regard. 
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In this paper, a brief description of the physical concepts of various NDT methods 
and the physical/derived parameters which are used for assessing the above mentioned 
anomalies, namely defects/stresses/microstructure, are given. A few case studies high¬ 
lighting the importance of non-destructive testing and evaluation for structural 
integrity assessment are discussed. In many cases, the investigations carried out at the 
authors’ laboratory are given as examples. This paper also gives an account of the 
emerging concepts like expert systems, imaging, robotics, intelligent processing of 
materials, fusion of NDT data and signal analysis approaches that are responsible for 
enhanced reliability in NDE. Finally, the paper also brings out various research and 
development activities carried out at the authors’ laboratory leading to new techniques 
and procedures. 


2. NDE techniques for structural integrity monitoring 

The main aim of NDE is to detect and characterise anomalies that can adversely affect 
the performance of the component under test without impairing its intended service. 
NDE implies complete responsibility to assess the seriousness of a situation realisti¬ 
cally in order to enable the management to make the right decisions. Thus, NDE is 
testing, evaluation and responsibility. This almost always implies something more than 
the mere detection of defects. It is now widely accepted that locations where residual 
stresses and microstructural degradations exist are potential sites for the origin of 
defects. Hence, NDE involves detection and characterisation of defects, stresses and 
microstructural degradations. Their early detection is essential to avoid damage and 
loss of structural integrity of components. In this regard, the role of NDE is ever 
increasing to ensure reliability of performance in both pre-service and in-service stages. 

Usually, to properly assess defect severity, one needs to know the type of defect, its 
location, approximate size, shape and orientation..NDT techniques available for the 
assessment of integrity of structures/components prior to, during and after their 
installation, are many. Some of the widely used techniques include visual examination, 
liquid penetrant testing, leak testing, vibration monitoring, magnetic particle inspec¬ 
tion, utrasonic testing, eddy current testing, gamma and X-radiography, acoustic 
emission, tomography, magnetic flux leakage method, laser holography and inter¬ 
ferometry, infrared thermography etc. Broadly, these techniques can be classified into 
two categories, depending on the nature of inspection capability, viz. static and 
dynamic techniques. While static techniques can be used to detect cracks in materials 
off-line, dynamic techniques are used for on-line monitoring. Two widely used dynamic 
NDT techniques are acoustic emission and infrared thermography. All other tech¬ 
niques mentioned above come under the category of static techniques. 

Since the interrogating field and the interactions are different, the minimum detect¬ 
able defect sizes are different in different NDT techniques and mainly depend on the 
instrumentation. Consistency and repeatability of measured parameters have to be 
ensured. While selecting an NDT technique for a specific application, not only the 
minimum detectable defect size, but also the probability and the confidence level of 
detection are to be considered. Most NDT techniques have a wide range of applications 
and capabilities and comparison of data is valid only for a particular application, 
a specific type of defect and a particular material. For example, for detection of fatigue 
cracks in light-alloy panels, it is not surprising that eddy current and penetrant testing 
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are superior to ultrasonic testing and radiography. Hence, choice of the NDT tech¬ 
nique depends on factors such as applicability, accessibility and suitability based on 
analysis and past experience. It may sometimes be necessary to use a combination of 
two or more techniques, in the best complementary way, to carrv out complete and 
reliable inspection. Information about material composition, properties, micro-struc¬ 
ture, fabrication procedure and environment is essential to identify the potential sites 
for origin of defects, their nature, probable size and orientation. All the above 
mentioned details would aid in selecting a suitable combination of techniques. In this 
regard, detailed mock-up studies are essential prior to actual inspection, to optimise 
equipment parameters, design and selection of sensors, selection of standard defects, 
operating conditions and procedure for recording and evaluation of NDT data to 
arrive at the desired sensitivity and reliability. Reliability is also ensured by following 
procedures as per codes and standards. A more recent concept in this direction is fusion 
of NDT data, which attempts to create a global picture of the fitness for purpose of the 
inspected zones. It involves combining data from more than one NDT method and 
displaying of a 3-D visual of the anomaly. In other words, if the “shadow” information 
from a radiograph is combined with depth information from an ultrasonic shear wave 
probe and surface breaking information from magnetic particle inspection, then the 
NDE personnel can provide a comprehensive description of the defect. This informa¬ 
tion can be presented and manipulated in a 3-D graphical format, making it much 
easier for a non-specialist to interpret the results of inspection in a reliable manner 
(Edwards etal 1993). 

It is now well established from fracture mechanics considerations that planar defects 
(cracks, lack of fusion etc.) are more serious than volumetric defects (porosity, gas holes, 
round inclusions etc.) from the loss of strength point view. It is now clear that the depth 
or height of defect is more detrimental than its length and also surface breaking defects 
are more harmful than totally internal defects. A consequence of these findings is the 
changes in the acceptance levels for different types of defects. 

For pre-service inspection, a definite requirement exists for more accurate NDT 
techniques which could be applied with the aim of producing a structure with no flaws 
greater than a predetermined size which can be as small as the defect detectable by 
NDT methods. Advancements in the fields of electronics and computer sciences, 
development of new methodologies, better understanding of interaction between the 
interrogating media and materials and new challenging requirements are pushing the 
NDE technology to achieve significant progress. A major spin-off in the recent past in 
this direction is “intelligent processing of materials”. This concept ensures 100% 
acceptability of raw materials and fabricated components by providing feedback to the 
manufacturing/fabrication processes about the presence/occurrence of defects on-line 
thus allowing for immediate corrective action (Baldev Raj 1994b). 

Imaging techniques show a promise towards enhanced detection and characterisa¬ 
tion of defects in components. It has often been said that an NDE image is worth 
a thousand NDE signals. NDE personnel feel more conforlable in interpreting images 
than analysing signals. Imaging can be done with many different media, including 
optical, infrared, X-ray, gamma-ray, ultrasound, eddy current, thermal wave, magnetic 
resonance etc. Thus, there is a broad spectrum of techniques from which selection can 
be made. Infrared imaging, ultrasonic C-scan imaging, scanning acoustic microscopy, 
scanning laser acoustic microscopy, computer aided tomography and eddy current 
imaging are some of the techniques currently being used worldwide for wide-ranging 
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applications.The imaging format provides a global perspective of the inspected region 
and allows a balanced interpretation. Also, imaging techniques have the potential of 
automating the measurement process, providing estimates of defect size from the image 
data, producing accurate characterisation of cracks and improving the probability of 
detection (Baldev Raj 1993a). 

The latest state of the art in NDE is the development of expert systems. Expert 
systems are interactive computer programs which attempt to simulate the expert’s 
thought processes and provide a wide range of advice. An unusual feature of these 
systems in comparison with more traditional algorithmic programs is the ability to 
handle heuristics, or rules-of-thumb. This feature makes the expert system an attractive 
tool for evaluation of NDT data and failure analysis. The use of robotic devices for 
reliable NDE of components is assuming great importance in many applications like 
the nuclear, space and chemical industries especially when there is a need for high 
probability of defect detection, reduction in radiation exposure to NDE personnel and 
limiting the operator dependency. Manipulators based on robotic principles have 
come into existence and the demand for manipulators with computerised data acquisi¬ 
tion, processing and evaluation is growing faster for pre-service and in-service inspec¬ 
tions (Baldev Raj 1992). 

In the following sections, the principles, capabilities and applications of some of the 
widely used NDE techniques for the assessment of structural integrity are discussed. 
Special emphasis has been laid on discussing the physical principles of NDT techniques 
and the interaction of the interrogating medium/field with the anomalies that affect 
structural integrity. Besides an attempt has been made to briefly discuss the recent 
trends in various techniques. 

2.1 Visual inspection 

Visual inspection (VI) constitutes an important and widely used NDE method for 
structural integrity evaluation. VI is a very valuable method for the evaluation of 
discontinuities open to the surface. However, it is not suitable for detecting subsurface 
and interior discontinuities. VI is a relatively simple and effective technique that is 
complete without detailed analysis of data/signals/patterns like other NDT techniques. 
In VI, an NDE inspector arrives at an immediate decision solely on what he observes 
during inspection. Capabilities of VI can be enhanced considerably by using simple 
aids such as magnifying glass, microscopes etc. for viewing, dimensional measurements 
etc. The advances in the area of optics have had far-reaching consequences in the field of 
visual inspection. Industrial telescopes, usually known as borescopes or introscopes 
enable inspection of inaccessible surfaces. They are best known for in-situ examination 
of inner surfaces of tubes/piping and also for the inspection of surfaces of turbine blades 
in aero-engines for the detection of cracks and corrosion without needing dismantling 
(Halmshaw 1989). The advent of fibroscopes with rotating headshas increased the 
versatility of visual inspection. With significant advances in display, storage and 
handling of images, it is now possible to record the information permanently and to 
carry out detailed investigations referring to the records of prior inspections to 
determine whether there has been crack growth, formation of additional cracks, or 
other progressive changes. The use of image processing techniques greatly enhances the 
sensitivity of defect detection and helps in obtaining reliable information about their 
dimensional measurements 
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2.2 Liquid penetrant inspection 

Liquid penetrant inspection (LPI) is another widely used NDE method for the 
detection of discontinuities open to the surface in any material/component which has 
a non- absorbant surface, irrespective of the component geometry and its material 
properties. LPI utilises the natural accumulation of a fluid, called penetrant, around 
a disconti- nuity to create a recognisable indication of a crack or other surface opening. 
A typical example to understand the principle involved in LPI is the ready visibility of 
cracks in concrete structures after a rain. LPI is directly concerned with the behaviour 
of fluids on surface (Halmshaw 1989). Surface tension and capillary are the two areas of 
fluid mechanics relevant to LPI. Surface tension is the property that affects the flow of 
the fluid on the surface to be inspected. Capillary forces drive the penetrant along the 
material surface into the surface openings and, finally out of them and into the 
developer. In brief, in LPI a liquid is applied to the surface of the component for 
a certain predetermined time (dwell time) during which the penetrant seeps through the 
surface discontinuity. After this, the excess penetrant is removed from the surface. The 
surface is then dried and a developer is applied to it. The penetrant that remains in the 
discontinuity is absorbed by the developer indicating the presence as well as the 
location, size and nature of the discontinuity. Optical recognition exploiting the full 
range of light spectrum from visible to ultravoilet light, is used for the defect detection 
in LPI. 

The LPI processes are typically classified by the indicating method i.e. visible or 
fluorescent dye and the method used to remove the excess penetrant from the 
component surface. The three widely adopted LPI systems are water-washable, 
post-emulsifiable and solvent removable. Fluorescent dye penetrants have high sensi¬ 
tivity and are particularly useful for detecting cracks of small widths (down to 0-5 
microns). The increased sensitivity is due to the absence of any visible background 
(either white or coloured) layer covering the area under inspection which would 
otherwise absorb a part of the incident light. LPI technique is very useful for detection 
of fatigue cracks, stress corrosion cracks and other service-induced defects open to the 
surface. The absence of a volumetric interrogating medium in LPI restricts the 
technique to detect variations/abnormalities open to the surface. Microstructure and 
stresses in the components cannot be assessed by LPI. 

2.3 Magnetic methods 

Magnetic methods of NDE are among the oldest and most pervasive because of the 
dominant production and use of steel and related ferromagnetic materials in the 
industry. When a ferromagnetic component is magnetised, the magnetic lines of force 
(the magnetic flux) are predominantly inside the material. However when there is 
a surface breaking or a sub-surface discontinuity, the field is distorted, causing local 
magnetic flux leakage fields. Effectively, the discontinuity causes a sudden localised 
change in magnetic permeability. These leakage fields are monitored/detected/re¬ 
corded either by sprinkling finely divided ferromagnetic powders or by using sensors 
and the information on the location, shape and extent of the discontinuity producing 
the leakage field is obtained (Mix 1987). On the basis of recording and evaluation of the 
leakage fields, the magnetic methods can be classified into magnetic particle inspection 
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(MPI) and magnetic leakage flux (MLF) and magnetic Barkhausen Noise (MBN) 
techniques. Because of the added advantage of these techniques over LPI in detecting 
subsurface defects, it is logical to specify magnetic methods for the structural integrity 
evaluation of all ferromagnetic materials. After the inspection by magnetic methods, it 
is necessary to subject the component to demagnetisastion, an act of reducing the flux 
density level in the component to a level at which its external field does not cause 
problems in service such as (a) wear due to pick-up of ferromagnetic parts in the case of 
rotating components, (b) arc deflection during welding and (c) interference effect on 
nearby instruments etc. 

MPI consists of magnetisation of the component, application of magnetic powder 
and examination of powder patterns. The indications of defects are preserved by 
photography or video recording or by the use of peel-off transparent adhesive films. To 
obtain higher sensitivity for defect detection, fluorescent magnetic particles suspended 
in oil or kerosene are employed rather than the dry powders. Magnetic silicone rubber 
containing magnetic particles in suspension are widely used for the inspection of 
relatively inaccessible regions e.g. bolt holes and internal threads. These areas are of 
great concern in military aircraft and detection of cracks in these regions is challenging. 
In this method, a highly viscous rubber solution is applied on the component surface 
before magnetisation. After magnetisation, sufficient time is allowed for the particles to 
migrate to the leakage fields before the solution is cured. This method is also known as 
magnetic rubber inspection (Zheng & Zhu 1993). 

One major limitation of MPI is the necessity for application of magnetic field in 
different directions. This is because of the fact that the leakage field, and in turn the 
sensitivity in MPI, is maximum only when a defect is perpendicular to the direction of 
the magnetic field. Hence, in order to detect defects in different orientations, the 
magnetic field has to be applied in more than one direction. There are a wide variety of 
methods to magnetise a component such as prod, head shot, central conductor, yoke, 
cable etc. These methods essentially aim at producing circular and/or longitudinal 
magnetisation in the component. The method selected depends on such factors as the 
geometry of the component, desired direction of magnetic field, the expected orienta¬ 
tion of defects and whether or not the defects are subsurface. Both AC and DC methods 
can be used. Because of the skin depth effect, AC methods are restricted to the detection 
of near-surface defects and therefore are not very effective for detection of subsurface 
cracks. 

Procedures based on wet fluorescent MPI techniques have been successfully devel¬ 
oped at the authors’ laboratory for the structural integrity evaluation of 235MW low 
pressure turbine rotors of PHWRs (Baldev Raj 1993b). This effort consisted of 
performing MPI on turbine blades of all five stages and fir-tree root regions of all stages 
adopting cable and yoke methods of magnetisation respectively. Current ratings and 
sensitivity were optimised with the help of field indicators and gauss meters. These 
procedures have also been successfully used to perform MPI in-situ without lifting the 
rotor from the turbine bed. Defects detected by these procedures were characterised 
using direct current potential drop technique (refer §2-8 below). Detailed in-situ 
metallography and residual stress measurements (X-ray diffraction technique) were 
earned out to supplement the MPI observations and to assess the damage at those 
regions. In-situ metallography involves polishing, etching, examination of the suspec- 
method° nS ^ * P ° rtab,e microsco P e and recording of indications by the replication 
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In MLF method, the presence and extent of defects is detected by monitoring the 
surface for localised fluctuations in the magnetic field with sensors like inductive coils, 
Hall probes, magnetometers and magnetoresistive, magnetic resonance and magneto¬ 
graphic type probes (Forster 1977). As in MPI, in this technique also, two forms of 
magnetisation i.e. DC and AC are commonly used. The AC method has higher 
detection sensitivity.than the DC method and defect signals are proportional to the 
depth of the defect. Since magnetisation is local, demagnetisation is not required. 
However, DC method is capable of detecting sub-surface defects and is relatively 
insensitive to interference from the material surface noise caused by roughness, scale 
etc. (Junger & Brock 1990). 

The higher sensitivity of the flux leakage methods (close to that of magnetic particle 
inspection) and the additional advantage of automatic evaluation without a human 
inspector makes the technique amenable to on-line inspection at the shop floor. AC 
MFL technique is used to characterise stresses in high strength steels and their welds. 
One of the primary application areas for practical MLF systems is that of axi- 
symmetric components i.e. tubes, pipes, rods, bars, rounds, billets etc. Components of 
these geometries are particularly amenable to fully automated and relatively high speed 
scanning in a production line during their manufacture. MLF method is also widely 
accepted for in-service inspection of underground pipes in petrochemical industries 
(Collins et al 1984). This method has been successfully applied for the detection and 
assessment of defects in highly irregular component such as helicopter rotor blade 
D-spars, gear teeth, artillery projectiles etc. Another successful application is the 
inspection of rolling element anti-friction bearings which form critical components in 
gas turbine engines, high performance aircraft, helicopters, rocket engines, satellites etc. 
(Beissner et al 1980). Advances in sensor technology and data acquisition and analysis 
have resulted in realisation of fully automated MLF inspection method. More recently, 
Finite Element (FE) computer programs are being used to simulate this method for 
both simple and complex crack/component configurations. For defects such as fatigue 
cracks where there is evidence of a variable permeability effects, FE modelling provides 
an opportunity of realistic method of analysis (University Communique 1987). 

Magnetic methods are potential methods for evaluation of microstructural degrada¬ 
tions. Magnetic flux pertubation and acoustic emissions are generated when an 
induced magnetic field in ferromagnetic materials is swept in a hysteresis loop. The 
former is referred to as Magnetic Barkhausen Noise (MBN) and the latter is called 
Acoustic Barkhausen Noise (ABN). MBN signals are produced as a result of discrete 
changes in magnetisation caused mainly by the motion of the 180° domain walls as the 
magnetic field is varied. The signals are detected by a search coil. Similarly ABN signals 
are generated during the hysteresis sweep due to the rotation of the 90° domain walls 
and the associated magnetostriction. Since MBN is related to the nucleation and 
movement of magnetic domain walls and their interactions with various micro- 
structural features such as grain boundaries, precipitates, stresses etc. this technique is 
found to be useful for several applications including microstructural characterisation, 
characterisation of post-weld heat treatment in the weld joints and the measurement ‘of 
residual stresses. Significant work has been done at the authors’ laboratory in this 
direction (Bhattacharya 1994). In the case of 17-4 PH steel, MBN technique has been 
employed to identify the stages i.e. (i) build up of coherency strains, (ii) loss of coherency 
and (iii) interlath austenitic formation which otherwise nejd transmission electron 
microscopy and field ion microscopy investigations (Bhattacharya et al 1992b). In the 



14 


Baldev Raj, T Jayakumar and B P C Rao 


case of 2-25Cr-lMo steel, precipitation of Mo 2 C could be identified using the MBN 
technique. Recrystallisation from martensite structure leading to ferrite formation in 
addition to coarse carbide precipitation could also be characterised. In 9Cr-l Mo steel, 
the incipient occurrence of delta ferrite could be detected by magnetic techniques. Even 
by transmission electron microscopy, it is difficult to identify the incipient occurrence. 
MBN technique has been successfully used for the characterisation of post-weld heat 
treatment in the weld joints of steam generator modules of the proposed Prototype 
Fast Breeder Reactor. These studies efficiently brought out the adaptability of MBN 
analysis for quality assurance of welds. ABN technique has been used for assessment of 
residual stresses in weld joints of structural steels (Bhattacharya et al 1992a). The rms 
voltage of the ABN signal has been used as a parameter for studying the influence of 
stress relief annealing on the stress distribution. Increase in the rms voltage of the ABN 
signal corresponding to the decrease in the residual stresses could be observed. In the 
as-welded condition, stresses are higher at the centre of the weld region and decrease 
gradually at the regions away from it. Reduction in stresses due to annealing could be 
identified in the form of increase in ABN signal. The utilisation of magnetic parameters 
derived from hysteresis loop measurement and MBN and ABN signals for assessment 
of microstructures is emerging as a very promising technique for structural integrity 
monitoring. 


2.4 Radiographic techniques 

Radiographic techniques (RT) are effective industrial methods for the NDE of compo¬ 
nents/structures. Radiographic techniques use a beam of penetrating radiation such as 
X-rays or gamma rays or atomic particles such as neutrons, protons and electrons 
(Baldev Raj & Venkatraman 1989). When the beam passes through a component, some 
of the radiation energy is absorbed and the intensity of the beam is reduced. Variations 
in beam intensity are recorded on a film or fluoroscopic screen with digital system 
coupled with video monitors, and are seen as difference in shading typical of the type 
and size of any defect present. When the defect is an absence of material such as void or 
porosity, the image will appear darker than the surroundings. On the other hand, when 
t * n ^ e ^ e ^ ect * s more absorptive than the surrounding material, the image 

w* lghter in appearance. In other words, the density of defect indications on the 
m or other recording media mentioned above, is inversely related to the total linear 
ensity o the material in the beam path. Decreased densities in the appearances of the 
111 1 ? 3 ^ 0nS 0n rac ^°8 ra phs will therefore indicate more dense material along the 
ota arn path. Hence, porosity i.e., gas hole in the material, would appear on the 

i ^ a ° T a S P°* S darker than the surrounding material and 
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ComvLnk 6 mat ^ rial relative t0 the component (Bray & Stanley 1989b). 
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iridium-192, caesium-137, thulium-170 and ytterbium-169 are used as typical gamma 
ray sources. Recent advances in X-ray tube technology have resulted in focal spots 
which are an order of magnitude smaller, thus giving rise to a whole new classification 
of microfocus radiography techniques (Peugeot 1982). It is thus possible to detect 
minute defects in thin components, thin welds of steel, aluminium or titanium alloys 
which was not possible earlier using conventional radiography. The most impressive 
feature of the microfocus X-ray source is its ability to produce extremely sharp 
radiographs at exceedingly short film-to-focus distances (Halmshaw 1989). In X-ray 
tubes, a small focal spot size of about 10 microns could be realised by electromagnetic 
and electrostatic focusing techniques of the electron beam. An outstanding example 
that brings out this capability is the work carried out at the authors’ laboratory on the 
radiography of tube-to-tube sheet welds in 2-25 Cr-1 Mo steel tubes of steam generators 
of Prototype Fast Breeder Test Reactor (PFBR). Using rod anodes with true radial 
panoramic emission, full circumferential radiographs can be obtained at around 80 k V 
in less than one minute on a Agfa D2 film with a sensitivity better than 1%. This 
sensitivity is much better than that can be obtained with the best radio-isotope based 
technique (Venkatraman et al 1993c). The advent of X-ray image intensifies has 
resulted in replacing the conventional fluoroscopy system to a real time radiography 
system. X-ray radiography is an important post-irradiation examination technique for 
the evaluation of nuclear fuel elements. This application is, however, beset with 
a problem called gamma fogging (exposing of film to gamma rays coming from fuel) 
which makes the interpretation of the radiographs difficult if not impossible. Successful 
procedures utilising post radiography image processing methods have been developed 
at the authors’ laboratory to minimise gamma fogging. 

Radiography with thermal neutrons (energies between 0 01 and 0-3 eV) has emerged 
as a powerful NDT technique, called Neutron Radiography (NR), for examination of 
nuclear materials and many other non-nuclear applications as well. Thermal neutrons 
have some properties which make them especially interesting for radiography. While 
the attenuation of X-rays in materials increases with atomic number in an orderly 
manner, the mass absorption co-efficients of elements for thermal neutrons appear 
almost random since neutron absorption does not depend on the electronic structure of 
the atom, but on the interaction with the atomic nucleus. Hence, X-rays are stopped by 
dense materials and passed by light materials and in many instances neutrons have the 
reverse quality. For example, neutrons will penetrate the body of a large metal valve to 
give a good image of an internal rubber seal while X-rays will give the defects present in 
the metallic portion. It is now widely accepted that X-radiography and neutron 
radiography are complimentary to each other (Baldev Raj 1994a). NR is used in the 
metallurgical examination of irradiated fuel sub-assemblies, fuel pins and control rods. 
Also it can reveal information about movement of fuel pellets in fuel pin, dimensional 
changes of pins and pellets, pin to pin spacing, and plutonium distribution in the pellets. 
In aerospace industries NR has been used for inspecting the presence of solidified oil 
and grease in lubrication holes and passages. NR has been used to detect earlier stages 
of corrosion in aircraft structures that cannot be found with X-radiography. Other 
applications of NR include, inspection of rubber seals, plastics insulations, capacitors, 
potting of transformers, explosives and turbine blades made by investment casting and 
archaeological structures etc. At the authors’ laboratory, a 30 kW uranium 233 fuelled 
swimming pool type reactor called Kalpakkam Mini (KAMINI) is nearing completion. 
The flux at the core is expected to be around 10 13 n/cm 2 /s and the flux at the 
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radiographic site would be 3 x 10 6 to 10 7 n/cm 2 /s. Two beam tubes are available for 
radiographic purposes, one for active sample radiography and the other for non-active 
sample radiography. Special gadgetry has been designed indigenously for precise 
positioning/rotation of components in front of neutron beam, remote controlled 
cassette drive mechanism and for carrying out real time imaging. A resolution of the 
order of 30 to 40 microns is expected (Kasiviswanathan et al 1989). 

The major constraints of conventional radiography include (a) inability to accurately 
reveal the depth and height of defects so important for the structural integrity 
assessment by means of fracture mechanics, (b) poor discrimination of changes in 
density of an object being detected, and (c) confusion caused due to two-dimensional 
image of three-dimensional component. These limitations are overcome by Computer- 
aided Tomography (CT) which is an useful concept for NDE. In CT, the transmission of 
radiation beams (X-ray, gamma-ray or neutron) through an object in a single plane is 
measured under various object angles and positions. From the transmitted inten- sities 
for many beam orientations, a CT image representing a map of absorption coefficients 
in the measured cross-sections is calculated and presented in visual form on a video 
display using a graphic system and colour-coded scales that represent various densities 
as different colours (Vontz et al 1988). Cracks, pores and density fluctuations in the 
material can be displayed by a change of colour. CT can be obtained in a digital form 
and can be further mathematically processed to extract the required information 
(Halmshaw 1989), A typical interesting application of CT is the assessment of engineer¬ 
ing ceramics. Engineering ceramics have a number of excellent properties, in particular, 
high strength at elevated temperatures and oxidation resistance, which make them 
attractive materials for a variety of applications. On the other hand, the monolithic 
ceramics are very brittle (i.e. they possess very low fracture toughness), and the 
permitted defect is as small as 20 microns in certain applications. CT has been applied 
to the examination of advanced ceramics and has proven to be a very efficient method 
within the required detection limits (Arnold 1990). While gamma and X-ray are used for 
general applications the former is preferred for the inspection of thicker components. 
Neutrons are also being used for CT especially for the evaluation of nuclear fuel 
sub-assemblies and fuel pellets. 

The latest state-of-the-art is development of expert systems using artificial intel¬ 
ligence for radiography. Efforts are underway at the authors’ laboratory to develop 
an expert system for optimisation of procedure for radiography (Venkatraman et al 
1993b). 

2.5 Ultrasonic testing 

Ultrasonic testing (UT) uses sound waves having frequencies usually in the mega cycle 
range. A sound wave travelling through a material will lose energy when it encounters 
an anomaly. This is manifested by a loss in pulse amplitude as well as a change in its 
appearance. There are three basic processes that account for loss of pulse energy, 
namely beam spreading, absorption and scattering. Beam spreading is primarily 
a geometric function where the intensity decreases with the square of the distance 
travelled. Absorption accounts for the mechanical energy converted to heat energy as 
the wave propagates. Scattering results from reflection at grain boundaries, small 
cracks and other anomalies. This phenomenon is utilised to non-destructively measure 
grain size in materials (Bray & Stanley 1989a). 
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Two basic methods in UT are pulse-echo and through transmission. While the 
former method makes use of a single transducer, the latter makes use of two. In the 
pulse-echo method, a transducer made of piezoelectric material (also called probe) 
transmits a pulse of mechanical energy into the component. The energy passes into the 
material, reflects from the back surface, and is detected by the same transducer, yielding 
a signal on an oscilloscope with a time-base. The oscilloscope would normally show the 
original pulse of the ultrasonic transducer (front surface echo), the back reflection and 
any extra blip indicating a reflection from a defect in the material. From the oscillo¬ 
scope timing, the depth of the defect below the surface can be determined. Alternatively, 
in the transmission method, transducers are placed onopposite sides of the component 
and any reduced intensity sensed by the receiving transducer indicates the defect 
shadowing part of the ultrasonic radiation. The location of the defect cannot be 
obtained. Both pulse-echo reflection and transmission methods are in use and their 
selection depends on the accessible surface of the component. UT can detect defects 
oriented both in the plane of and normal to the surface of components using normal 
beam or angle beam transducers. By suitable design of ultrasonic transducer, ultrasonic 
beams can be introduced into a material at almost any angle. There are several forms of 
ultrasonic waves, the most widely used in NDT being compressional (longitudinal) and 
transverse (shear) waves. In a specific application of tube testing for detecting defects 
normal to the wall, the beam is converted to a shear wave which is propagated around 
the circumference or along the axis. In order to detect defects efficiently by UT, it is 
necessary to make the wavelength comparable to the expected defect size. Hence, for 
detection and assessment of smaller defects, it is necessary to use high frequency. The 
UT data can be displayed in three modes viz. A-scan, B-scan and C-scan (Szilard 1982; 
Baldev Raj et al 1994a). 

Ultrasonic attenuation can be used to monitor material properties. Empirical 
correlations have been obtained between ultrasonic attenuation and the impact 
strength, fracture toughness, grain size and tensile strength of steels. Similarly, ultra¬ 
sonic velocity measurements can be used to measure residual stresses in materials. This 
methodology uses shear waves polarised in two mutually perpendicular directions. 
These waves have slightly different velocities and so interfere, so that as the transducer 
is rotated, the interference vanishes when the polarising planes are parallel and 
perpendicular to the stress axis. Once this axis is known, the actual stress can be 
computed from the velocities (Halmshaw 1989). Stress measurements using ultrasonic 
technique are also dependent on the acousto-elastic effect i.e. strain-induced ultrasonic 
wave velocity variations in materials. By precise measurement of ultrasonic velocity, 
information about stress can be obtained. Using pulse-echo overlap method, accuracy 
of transit time measurements of the order of ± 1 nano second have been achieved by us 
(Palanichamy et al 1992a). 

Ultrasonic methods are also widely used for detection and characterisation of defects 
in structural welds (Pelseneer & Louis 1974). We have developed procedures for 
reliable and automated detection of defects down to 5% wall thickness in the end cap 
weld joints of thin-walled fuel elements of PHWRs (Subramanian et al 1991). In view of 
the difficulties encountered during conventional UT due to upsets produced during 
welding process, easily implementable signal analysis approaches have been used 
(Baldev Raj et al 1992). High sensitivity defect detection and characterisation in 
weldments of maraging steels used for rocket motor casings in the aerospace industry is 
of great importance. Tight cracks (3 mm x 1 mm) produced in these welds by fatigue 
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loadings were detected by us using cluster and pattern analysis of UT data. Detection of 
such small defects enhances the payload capacity of the rocket (Kalyanasundaram et al 
1991a). 

Austenitic stainless steel (AUSS) components and structures find extensive applica¬ 
tions in various industries and they enjoy a dominant position (75% approximately) 
among other stainless steels. This is chiefly because of high level of fabricability and 
corrosion resistance. UT is the most widely used NDE technique for the assessment of 
AUSS and their welds. UT of AUSS welds and castings is much more difficult than that 
of other stainless steels because of the following: (a) severe attenuation at high 
frequencies, (b) presence of anisotropic grain structure (dendrites) which cause the 
ultrasonic beam to bend (beam skewing), and (c) noisy behaviour of the material 
masking the pulses from defects (Juva & Haarvisto 1977). A whole range of new UT 
methodologies have been investigated in order to improve the UT inspection of AUSS 
welds and castings. Some of them include ultrasonic spectroscopy, acoustic holography 
and SAFT etc. Automated systems are now available for the inspection of pipe welds in 
critical components in nuclear, oil and other industries with limited access. An 
integrated prototype rule based expert system (ES) which helps in taking quick and 
reliable decisions in the UT procedures to be adopted for efficient NDE of AUSS welds 
has been developed at the authors’ laboratory (Rajagopalan et al 1992). This ES 
considers the following details before making a decision: dendritic microstructure of 
the weld, weld dimensions, geometry, joint type, welding procedure, metallurgical 
history, reflective characteristics of various types of defects, equipment parameters etc. 
Inspection of inaccessible regions and complex geometries would become simpler if the 
ultrasonic beam path is known. Visualisation of software for simulating the path taken 
by an acoustic/ultrasonic wave inside the weld is currently under development at the 
authors’ laboratory. 

A wide variety of conventional UT methods are available to measure crack depth 
like (i) defect echo height method, (ii) decibel drop method, (iii) edge echo method, 
(iv) scattering method and (v) composite aperture method. However, characterisation 
and sizing of defects requires sophisticated instrumentation, methodologies and com¬ 
puter-based automation. A few' techniques/concepts that have proven capabilities for 
defect and material characterisation include; ultrasonic phased array (ALOK), time of 
flight diffraction (TOFD) and synthetic aperture focusing technique (SAFT). Elec¬ 
tromagnetic acoustic transducers (EMAT), being of the non-contact type offer the 
potential advantage of performing inspection of high temperature components without 
the need for any physical coupling. Similarly, laser generated ultrasonic sound gener¬ 
ation and interferometric detection techniques show a great promise in this regard for 
characterisation of materials at high temperatures or in hazardous environments. This 
technique, has been used to detect laminar defects in red-hot steel billets. Space 
limitations preclude full-fledged discussion of these advanced techniques. However, the 
reader may refer to the literature available on these topics (Silk 1984; Muller et al 1986; 
Bohn et al 1987). 

Ultrasonic phased arrays hold out considerable promise. By using a number of 
elements excited with different time delays, it is possible to both steer and focus the 
beam in transmission and to image on reception of the beam. The ability to electroni¬ 
cally control the beam characteristics of the phased array probe has an attraction in 
automated systems where a large number of different probe angles may be utilised for 
reliable inspection (Cochran et al 1991). 
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ALOK (Amplituden und Laufzeit Orts Kurven) meaning “amplitude and time of 
flight locus curves” is a recent development with its origin in Germany. This method 
incorporates all the features required for automated inspection of pressurised compo¬ 
nents in any industry viz. high sensitivity and resolution, reliability and reproducibility, 
quick information on the condition of the component and ability to analyse a crack 
without re-examination. These inspection systems have the capability to detect cracks 
even in complex geometries. In contrast to conventional assessment of amplitude 
information, the ALOK method uses the time-of-flight of the echo signals to get 
complete information about the cracks like its position, size, shape and orientation. 
The time-of-flight locus curves are obtained from the time-of-flight (e.g. echo from 
a reflector) vs. probe position during its transit. To get the time-of-flight (and the 
corresponding amplitude) information, only the peak amplitudes from an A-scan and 
their times-of-flight (selected out of the rf signal) are recorded. This results in significant 
reduction in data without any loss of information. The data are also displayed in the 
form of B- and C-scans. Flow reconstruction is made by geometric planar triangula¬ 
tion or by iterative methods. 

SAFT is another promising technique in ultrasonic NDE. The basic idea of SAFT is 
to measure the complete sound field scattered by a crack on an orbit around the crack. 
During reconstruction, this sound field is calculated back into that region where the 
scattering occurred making use of well-known wave propagation formulae. The result 
is the three-dimensional amplitude distribution of the sound field inside the scanned 
orbit. In an ideal case, sound field will be non-zero only on the surface of the scatterer, 
thus obtaining an image of the crack by describing its surface. Because a 360° scanning 
around a defect may be possible only in a few cases like rods or shafts, one is forced to 
reduce the scanned aperture to only the part corresponding to the accessible surface. 
Therefore, only that part of the crack which could be interrogated by the ultrasonic 
beam would be imaged. To maximize the interrogated area of the crack, probes with 
large beam opening angles have to be used i.e. small probes or the defocused part of the 
far field probes. In addition, it is possible to synthesize scanning around the defect using 
different beam angles and different inspection techniques like pitch-catch or tandem. 

Another innovation in the recent past is the time-of-flight diffraction (TOFD) 
technique. TOFD is widely used for accurate location and sizing of cracks. Interaction 
of an acoustic wave with a crack results in the generation of diffracted waves. It is 
important to note that the diffracted energy can produce a significantly large pulse. 
While this pulse will be smaller than the largest that can be obtained with reflection 
techniques, it will usually be comparable with pulse echoes obtained in practice. As 
a result of diffraction, a cylindrical wave front effectively originates at the crack tip. By 
suitable measurements of this diffracted wave, it is possible to locate the origin of the 
cylindrical wave, or in other words, to locate the tip of the crack. This leads to the 
concept of the use of diffracted ultrasound to size cracks. In order to achieve this, the 
most obvious measurements to make on the diffracted pulses are their arrival times. 
This, in turn, couples ’diffraction’ and ‘time of flight’ together. The technique was 
originally pioneered on surface-breaking cracks. But, with improvements in data 
processing techniques, it has been possible to evaluate subsurface and internal defects. 
The technique produces a significant improvement in accuracy of defect sizing as 
compared to conventional pulse echo data (Temple 1987). 

The technique that has established itself as a powerful tool for defect and material 
characterisation during PSI and ISI is acoustic microscopy. Acoustic microscopy 
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produces very high resolution and high magnification images of internal features in 
a component through the use of short wavelength (very high and ultrahigh frequencies) 
ultrasound. There are three types of acoustic microscopy systems. They are scanning 
acoustic microscopy (SAM), scanning laser acoustic microscopy (SLAM) and C-SAM. 
In SAM, a point source of ultrasound is produced in a fluid such as water by a highly 
focused transducer. The transducer is pulsed and alternately acts as a transmitter and 
receiver. The transducer scans in a rectilinear raster pattern to obtain the image. In 
SLAM, a focused laser beam is employed as an ultrasonic wave detector by virtue of the 
fact that displacement at an interrogated surface modulates light reflected from the 
material. The displacement is localised within acoustic dimensions. The image of the 
sample is produced by using plane waves of ultrasound and scanning by laser over the 
field of view at the rate of 30 images per second which is about 300 times faster than 
SAM. SLAM is generally used in transmission mode whereas SAM is used in reflection 
mode. C-SAM is somewhat analogues to the C-scan ultrasonic technique. The acoustic 
microscopy techniques have been used for detection and characterisation of cracks in 
a variety of materials such as ceramics, metals, polymers and composites and they have 
the capability to be utilised from initial characterisation of the base materials to the 
final quality check of the precision fabricated assemblies. They have been found useful 
in crack detection in thin, flat or smoothly curved surfaces. With the use of phased array 
and a solid coupling system, acoustic microscopy could become an effective method for 
inspecting components/structures both during pre-service and in-service. This method 
has been used to detect and characterise surface and sub-surface cracks down to 100 
microns in ceramics (Baldev Raj et al 1994b). 

Fibre-reinforced composite materials are being increasingly used in defence, space 
and commercial applications due to their low weight, high strength and stiffness 
properties. This neccesitates the development and standardisation of NDE techniques 
for meeting fitness-for-purpose requirements. UT enjoys the status of being a reliable 
method for evaluation of composite structures in spite of the difficulties posed by 
composite structures, for example high attenuating behaviour, heterogeneous nature 
and anisotropic properties. Different kinds of defects that can occur in fibre-reinforced 
plates during fabrication and in-service are contaminants, voids, unpolymerised resin, 
resin-rich and resin-starved areas, ply gaps, foreign inclusions, delaminations, crazing, 
surface scratches etc. Detection and characterisation of some of these defects has been 
carried out by us using an automatic ultrasonic scanning system. Two different 
experimental methods (ultrasonic immersion and dry coupling) have been used. For 
effective detection and characterisation of defects, procedures have been developed 
using advanced signal processing and analytical approaches such as auto-power 
spectrum, total spectral energy content and demodulated auto-correlation pattern etc. 
(Kalyanasundaram et al 1991b). 

In the area of microstructural characterisation, we have successfully adopted 
ultrasonic velocity and attenuation measurements in the following applications; 
(i) characterisation of precipitation behaviour and estimation of volume fracture of f in 
Nimonic alloy PE 16 (ii) assessment of microstructural degradation (dissolution of 
tempered martensite structure to ferrite and carbides) in 9Cr-l Mo steel, (iii) chrac- 
terisation of precipitation hardening behaviour in 17-4 PH steel (iv) chracterisation of 
various microstructures in A533B pressure vessel steel, (v) estimation of porosity in 
aluminium alloy weldments, (vi) determination of texture coefficients in cold worked 
AISI type 304 stainless steel, and (vii) assessment of creep damage in low alloy ferritic 
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steels (Jayakuipar et al 1992a). Many of these results would be useful in developing 
procedures for micro-structural degradation and damage assessment applications 
utilising ultrasonic techniques (Bhattacharya et al 1993; Jayakumar et al 1993, 1994). 

Ultrasonic velocity measurements are useful for determining several important 
material parameters like moduli, Poisson’s ratio, residual stresses, texture, porosity and 
characterisation of secondary phases in microstructure etc. Some of these parameters 
influence the velocity only by an order of 0-1 %. In order to study these variations, high 
resolution techniques for ultrasonic velocity measurements are necessary. We have 
devised a methodology for precise ultrasonic velocity measurements using cross¬ 
correlation technique. The accuracy in the velocity measurements is enhanced by 
adopting cubic spline fitting of the cross-correlation function. The resolution in time 
delay measurements required for computing the velocity was of the order of 0-5 ns. This 
methodology has been applied for finding the influence of microstructure on ultrasonic 
velocity in 9 Cr-1 Mo ferritic steel and 17-4 PH stainless steel (Rao et al 1993b). In the 
next sub-section, the possibility for use of ultrasonic velocity measure- ments in aiding 
quality control during fabrication and damage assessment during service are discussed. 

2.5a Ultrasonic velocity measurements for microstructural evaluation in 9 Cr-1 Mo 
steel - A case study : 9Cr-t Mo steel is a candidate material for boiler/super heater 
steam circuitry of power plants. Ultrasonic velocity measurements were carried out on 
various heat-treated 9 Cr-1 Mo ferritic steel specimens with a view to study the 
influence of microstructure on ultrasonic velocity. Heat treatment consists of aging 
treatment at 823, 923 and 1023 K for durations of 2, 10, 50, 200, 500 and 1000 h. 
Ultrasonic velocity measurements were carried out on these specimens using a 15 MHz 
longitudinal transducer. The microstructure after standard treatment (normalising at 
1323 K for 30min followed by tempering at 1023 K for 1 h) is martensite with M 2 X 
carbides at lath boundaries and small amounts of ferrite due to partial dissolution of 
martensite (reference specimen). The reference specimen has the lowest velocity com¬ 
pared to all the aged specimens. At all the temperatures, it is observed that as the aging 
time increases, velocity first increases and then decreases or remains almost constant 
after attaining a peak depending on the temperature. The increase in velocity is lower at 
1023 K as compared to that at 823 and 923 K. This is attributed to the formation of 
smaller volume fraction of M 2 X at 1023 K due to simultaneous formation of ferrite and 
M 23 C 6 by dissolution of some of the martensite. It can also be observed that the 
maximum in velocity has occurred at lower durations of aging with increase in aging 
temperatures due to faster precipitation of carbides at higher temperatures. The time 
periods for maximum in velocity at 823, 923 and 1023 K are 200, 100 and 10 h 
respectively. At 1023 K beyond 10 h of aging, the velocity is found to decrease steadily 
whereas at 823 and 923 K the velocity remains almost constant or decreases slightly 
after attaining the peak. The steady decrease in velocity at 1023 K after 10 h of aging is 
attributed to the continuous dissolution of martensite forming ferrite and M 23 C 6 . 
Since very little or no ferrite formation takes place even upto 1000 h at 823 and 923 K, 
the velocity remains constant after attaining the peak. 

These measurements clearly indicate that formation of M 2 X precipitates and ferrite 
influence the ultrasonic velocity. Any increase in velocity with time at a given location 
on a component indicates formation of precipitates of M 2 X. Similarly, a decrease in 
velocity with time clearly indicates formation of ferrite which reduces the strength 
properties. Similarly, for a given material, the influence of an unwanted phase in other 
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materials on the velocity can be established and in-service degradation can be assessed 
periodically by performing velocity measurements. 

2.6 Acoustic methods 

Acoustic methods(AM), also known as sonic testing; vibration testing and mechanical 
impedance testing, depend on exciting vibrations in a specimen by local impact and 
then measuring some properties of the vibrations such as resonance frequency, decay 
time etc. A simple example of an acoustic method is the inspection of a railway wheel by 
tapping it with a hammer and detecting the vibrations with a transducer. There are 
several other important applications of acoustic methods, the most important being 
testing of cast iron specimens where it has been shown that the nature and size of 
carbon content can be related directly to the reasonance frequency of the casting. This 
method is extensively used for quality control of malleable iron castings. Bonded joints, 
defects in composites and plastic components have been located and assessed using this 
technique. Testing of adhesive bonding in laminates and honeycomb structures for 
aerospace applications is nowadays carried out by these methods. Using much lower 
frequencies of the order of a few kilohertz, masonry structures have been tested by these 
methods. The structure is excited by a blow with a steel-tipped hammer and the 
frequency of the propagated signal is analysed through a transient recorder in 
conjunction with a computer. When the method was applied to masonry bridge 
structures, the signal was found to be sensitive to significant defects such as large voids 
and cracks (Halmshaw 1989). 

Vibration monitoring of operating plant and machinery to give early warning of 
impeding failure or potential fault conditions forms a popular NDE method. Even the 
most simple machinery has a number of sources of vibration and these vibrations are 
transmitted through the machine structure. The vibration spectrum depends on the 
machine speed, loading, lubrication etc. The vibration is characterised by amplitude, 
frequency and phase and is usually measured by a small electromechanical transducer 
that senses displacement, velocity, and acceleration. Periodic measurements carried 
out on a machine provide information on any significant changes that are indicative of 
deterioration in the machinery. These measurements may be continuous, on-line or 
periodic. 

2.7 Eddy current testing 

Eddy current testing (ECT) methodology uses the principles of electromagnetic 
induction to inspect electrically conducting components for detection, characterisation 
and classification of abnormalities. In ECT, an alternating current (frequency range 
kHz-MHz) is made to flow in a coil (also called probe) which, in turn, produces an 
alternating magnetic field around it. This coil when brought close to the electrially 
conducting surface of the material to be inspected, induces an eddy current flow in it 
due to electromagnetic induction. These eddy currents are generally parallel to the 
direction of coil winding. The presence of any defect or discontinuity in the material 
disturbs the eddy current flow. These eddy currents, in turn, generate an alternating 
magnetic field (in the opposite direction) which may be detected either as a voltage 
across a second coil or by the perturbation of the impedance of the original coil (Libby 
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1971). The impedance change in the probe coil is mainly affected by electrical 
conductivity, magnetic permeability and geometry of the material test frequency, 
spacing between the coil and the material (also called lift-off or fill factor), and also by 
the presence of any defect or discontinuity in the material. The impedance change can 
be measured and can be correlated with changes in the above parameters. When an 
eddy current probe is scanned over a component consisting of a defect, the amplitude of 
the eddy current signal (impedance change trajectories) provides information about the 
severity of the defect, and the phase angle provides information about its depth. 
Natural cracks or sets of artificial defects made by electric-discharge machining (EDM) 
or specially produced fatigue cracks are used to calibrate the ECT instruments and to 
characterise cracks/defects in a reliable manner. 

A change in microstructure is generally associated with a change in either electrical 
conductivity or magnetic permeability. Hence, ECT can be used to assess micro- 
structural degradation of components. While dislocation structures and residual 
stresses created in the materials affect the electrical conductivity, the formation of 
magnetic phases affect the magnetic permeability. Studies were carried out by us on the 
solution annealed AISI 304 stainless steel plates subjected to different, degrees of cold 
work. A good correlation was observed between degree of cold work and the ECT 
output. Formation of magnetic phases beyond 25% cold work was also observed and 
confirmed qualitatively by ferritoscope and microscopy (Kasiviswanathan et al 1986). 
These studies clearly indicate the possible use of ECT for detection and quantification 
of microstructural degradation. Other important applications of ECT include metal 
sorting, thickness measurement, corrosion monitoring, fatigue crack detection, liquid 
sodium level monitoring, conductivity measurements etc. 

During the last two decades, significant progress has been made in the theoretical 
and practical aspects of ECT (Shyamsunder & Rao 1992). Axi-symmetric finite 
difference and finite element models (FEM) have enabled understanding the physics of 
electromagnetic field/defect interactions. At our laboratory, the work has shown 
significant promise. A 2D FEM code capable of operating on a personal computer has 
been developed. This model has been used to predict the impedance changes, the 
magnetic flux line and eddy current density contours to typify the interactions in ECT. 
The model was used to optimise probe design for garter spring location in the coolant 
channels of Pressurised Heavy Water Reactors, the details of which would be given in 
the case study described at the end of this section. Signal trajectories were predicted for 
various coil designs (of same cross-sectional area) and optimum design parameters viz. 
width of coil, depth of coil and spacing between two differential coils of the probe were 
arrived at, considering the maximum amplitude signal with differential nature. Very 
good agreement between the predicted and experimental trajectories was observed for 
ECT probes with varying design parameters due to the presence of a garter spring (Rao 
et al 1990). 

Stress corrosion cracking is the major failure mode observed in the case of heat 
exchanger tubes, since they are subjected to continuous flow of high temperature fluids, 
steam, and other aggressive environments. The deterioration of the tube material leads 
to the possibility of leaks. ECT is the most popular technique for periodic monitoring of 
these tubes because of its ease of operation, versatility and reliability. Conventional 
single frequency ECT is inadequate when interferences arise from such sources as 
support plates and other interferences/structures. Many a times, it is under the support 
plates that corrosion damage often takes place. Multifrequency techniques which 
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eliminate signals from unwanted parameters like support plate, probe wobble etc. are 
widely used. This technique involves simultaneous excitation of more than one 
frequency in ECT probe and processing of the corresponding analog signals to 
eliminate signals due to unwanted parameters. We have standardised procedures for 
condition monitoring of steam generators and condensers using two frequency ECT 
and successfully carried out inspection of such components in power and petrochemi¬ 
cal industries. Procedures have also been developed by us to eliminate three parameters 
(support plates, probe wobble and sodium deposits) that would be present during 
ECT of heat exchangers of fast breeder reactors (Rao et al 1989). 

ECT of ferromagnetic heat exchanger tubes is difficult due to their high and 
continuously varying magnetic permeability. These variations produce high amplitude 
ECT signals that mask the signals from defects. The use of magnetic saturation can 
overcome these difficulties to a large extent. Ferromagnetic tubes can be satisfactorily 
inspected by ECT if they are magnetically saturated. Magnetic saturation essentially 
indicates a condition where all magnetic dipoles are aligned in the same direction 
parallel to each other and the material starts behaving non-ferromagnetically. Magnet¬ 
ic saturation is easily achieved during tube manufacture using an external DC 
saturation coil and eddy current testing is carried out using an encircling eddy current 
probe. However, the same cannot be implemented during in-service inspection (ISI) of 
heat exchangers or steam generators due to limited space from the boreside of the tube. 
In-service inspection of ferromagnetic tubes by eddy current testing (ECT) is a major 
challenge. In this area, we have successfully designed high strength SmCo 5 permanent 
magnet-based eddy current probe and results were evaluated using a calibration tube 
with artificial defects (Shyamsunder et al 1993). 

For pre-service and in-service inspection of ferromagnetic tubes, a new technique 
that is showing great promise and potential is remote field eddy current testing 
(RFECT). The primary advantage of this technique is the ability to inspect tubular 
products with equal sensitivity to both internal and external metal loss or other 
anomalies, linear relationship between wall thickness and the measured phase lag and 
absence of lift-off problems. The technique features the ability to inspect both ferro and 
non-ferromagnetic materials with equal sensitivity to internal or external anomalies 
(Kilmore & Ramachandran 1989). Pioneering work has been carried out by us with 
respect to the development of an RFECT instrument and the computer simulation of 
the technique. Wall loss down to 15% was detected using an indigenously developed 
RFECT instrument. The presence of transition and remote field zone and the affect of 
tube diameter and wall thickness on them have been studied using a 2D-FEM code 
(Rao et al 1993a). Efforts are underway to optimise the frequency and other parameters 
to obtain better detection sensitivity. 

At our laboratory, procedures employing ECT for integrity evaluation of small diameter 
thin wall fuel cladding tubes of fast reactors (Fast Breeder Test Reactor, 5T mm outer dia 
and 038 mm thick; Prototype Fast Breeder Reactor, 8-2 mm outer dia and 0-5 mm thick) 
have also been standardised. Seven percent wall thickness deep longitudinal and transverse 
notches and natural defects produced during manu- facturing stage have been successfully 
characterised (Barat et al 1982). Complementary capabilities of ECT and ultrasonic testing 
techniques have been established. These procedures are applicable for carrying out NDE 
during manufacturing stages, prior to and after irradiation stages. The special probes and 
gadgetries required for carrying out inspection of irradiated fuel cladding tubes, 
because of radioactivity and other associated problems have been developed in-house. 



Non-destructive testing and evaluation for structural integrity 


25 


One recent technique that has shown tremendous potential in detecting defects in 
expansion transition zones and rolled joints of heat exchangers with equal sensitivity to 
both longitudinal and transverse defects in Phased Array ECT (PAECT). This system 
uses a substantially different bridge circuit and probes that produce a constant 
magnitude rotating magnetic field which is insensitive to expansion transition zone, 
tube sheet or support plate. We have designed and developed a tandem PAECT probe 
that minimises the insensitive zones, in turn, leading to enhanced sensitivity for 
inspection of tubes (Shyamsunder et al 1994). 

ECT has benefited with the availability of relatively low cost Personal Computers 
(PCs) and the accompanying revolution in digital technology (Rao et al 1992). A recent 
outcome of these advances in the area of ECT is eddy current imaging (ECI). We have 
developed computer-based ECI system to scan the object surface and create impedance 
images in the form of gray levels or pseudo colours. This consists of a PC-controlled 
X- Y scanner which scans the component point-by-point in a raster fashion with an 
ECT probe, acquires data using a 12-bit analogue to digital converter card, and finally 
processes and displays data in the form of images. Using ECI, it is possible to obtain 
images of defects in two dimensions enhancing the defect detection capability. Fatigue 
cracks, corrosion pits, EDM notches and other types of defects have been imaged and 
characterised (Rao et al 1993c). In the case of fatigue cracks, ECI is found to be capable 
of revealing the orientation of the crack which is an important feature for fatigue crack 
growth and fracture mechanics studies. ECI has also been used to detect the weld centre 
line in stainless steel welds with an accuracy of 0-1 mm. This could enable scanning of 
the welds during angle beam UT and obtaining accurate information on defect 
location. Other developmental works carried out include: optimisation of probe design 
for ECI and implementation of filtering and image processing methodologies to 
improve the quality of images. The above discussions clearly indicate the potential of 
ECI in detecting and characterising defects in metallic materials effectively. 

NDE plays a major role in ensuring safe and reliable operation of Indian nuclear 
reactors. One of the important condition monitoring technique used in these reactors is 
ECT. In the following section, such application of ECT i.e. structural integrity 
evaluation of pressure tube (PT)/Calandria tube (CT) assemblies of Pressurised Heavy 
Water Reactors (PHWRs) is discussed. 

2.7a EC T for structural integrity of P TjC T assemblies - a case study. Pressurised 
Heavy Water Reactors are the backbone of India’s nuclear power programme. These 
reactors use natural U0 2 as fuel and heavy water as moderator (that slows down the 
neutrons for their effective participation in the heat generating fission reaction). These 
reactors are located at Kota, Rajasthan (RAPS) Kalpakkam, Tamilnadu (MAPS), 
Narora, Uttar Pradesh (NAPS) and Kakrapara, Gujarat (KAPP). A typical 235 MW 
PHWR consists of 306 channels. Each of these channels consists of a pressure tube 
(9220mm long, 82-6 mm dia and 4 mm thick) made of Zircaloy-2 which is positioned 
horizontally between two tube sheets. Each PT contains 12 fuel bundles and heavy 
wat$r coolant passes through it. The outlet from all 306 PTs forms the primary heat 
transport system. Each PT is surrounded by a Calandria tube (107-8 mm diameter and 
1 mm thick) again made of Zircaloy-2 leaving a nominal gap of 8-5 mm. Outer regions 
of the CT form the moderator system. Garter springs (two in earlier reactors and 4 in 
more recent reactors) are positioned in each of the annular channels so that gaps 
between the PTs ad CTs are maintained (i.e. they do not touch each other during 
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reactor operation) to prevent heat transfer from primary heat transport system to 
moderator system. The garter springs are made of an alloy of zirconium, niobium and 
copper. The material of construction of PTs and CTs is changed to zirconium-2-5% 
niobium alloy from Kakrapara Atomic Power Project (KAPP) onwards because of its 
better creep and corrosion properties. 

Subsequent to the failure of a PT in the Pickering-2 reactor, similar to PHWR, in 
Canada in 1983, it was realised that the garter springs could get displaced during hot 
conditioning and/or during the operating of the reactor (Field et al 1984). This 
displacement increases the unsupported length of the PTs. This increase, coupled with 
the in-reactor creep of the PTs may make the outer surface of the PTs (at a max. temp, 
of 548 K) touch the cold inner surface of the CTs (at a temperature of 333 K) leading to 
the formation of a cold spot and, further, to the generation of massive hydrides through 
intense hydrogen diffusion to the cold spot region. This, in turn, results in formation of 
blisters and cracks leading to ultimate failure of the PT. In view of this, the structural 
integrity of PTs needs to be considered not only with respect to their own properties, 
but together with the CTs and the garter springs as well. This can be accomplished by 
performing NDE at various stages viz. during pre-hot conditioning, post-hot condi¬ 
tioning and in-service to monitor the following: 

(a) location of garter springs between PT and CT; 

(b) profiling of gap between PT and CT; 

(c) defect detection in PT. 

ECT has been proven to be one of the best technique to fulfill the above mentioned 
requirements. Development of procedures, design of required ECT probes and training 
of personnel for carrying out the above mentioned assignments have been undertaken 
by the authors’ laboratory. FEM model has been developed and used to optimise probe 
design for garter spring locations. Such probes have been successfully used for carrying 
out PT inspections at various PHWRs. The accuracy of garter spring locations has 
been + 5 mm and that of gap profile + 0-5 mm. Significant work has also been done at 
the authors’ laboratory for the development of a PC-based data acquisition system for 
acquiring data from an ECT equipment and a database software package for documen¬ 
tation, evaluation and analysis of garter spring data from different reactors during 
various inspection campaigns (Rao et al 1991). The defect detection in the PTs is mainly 
concerned with the detection of localised hydride blisters, if any. However, other types 
of defects may also be present. In a full-scale mock-up assembly consisting of various 
artificial notches, 5% wall thickness deep longitudinal notch and 8% wall thickness 
deep transverse notch were detected unambiguously. 

2.8 Potential drop techniques 

Potential drop technique is widely used for crack depth measurements in metallic 
materials. A crack detected by other NDT techniques can be reliably assessed for its 
length and orientation by this technique. Also this technique is used to monitor the 
growth of a crack during the service life of a component. The instruments required for 
this technique are simple, and portable. Basically, there are two types of techniques, 
namely Direct Current Potential Drop (DCPD) technique and Alternating Current 
Potential Drop (ACPD) technique. While the former is useful for detecting and 
assessing surface and subsurface defects, the latter is useful for detecting and assessing 
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only surface defects since current is confined to the surface layers only due to classical 
skin effect phenomenon (Baldev Raj et al 1994b). 

In these techniques, current is sent into the component perpendicular to the crack 
through two current electrodes, and the voltage drop across the crack is measured 
using two voltage electrodes (or the current electrodes themselves and also called 
probe) and evaluated for parameters of the crack such as depth and inclination to the 
surface. When the two electrodes of the probe separated by a distance of D units, 
straddle a crack of depth h, the path length between the electrodes becomes D + h, 
producing a potential difference of V2 as against VI for crack-free regions. The 
potential differences VI and V2 are measured and the crack depth is computed. The 
instrument, therefore, consists essentially of a power supply unit to set up the field and 
a sensitive voltmeter and a suitable probe with two current points spaced by a known 
distance. The DCPD technique is not affected by the magnetic permeability, but only 
affected by the electrical resistivity and the geometry of the specimen. In order to 
evaluate the crack parameters from the measured voltages, calibration charts are 
needed. We have used this technique to size the fatigue cracks detected by magnetic 
particle inspection technique in a turbine rotor. 

Numerical modelling offers the possibility of obtaining calibration charts for 
specimens and cracks with complicated geometries and with variations in material 
properties. Because of need for good electrical contact, the ACPD technique requires 
extensive cleaning of the component surface. This makes it difficult to deploy in the 
field. Also the ACPD technique is really suited only for the sizing of cracks already 
found by other techniques. These drawbacks can be overcome by adopting new 
technique called alternating current field measurement (ACFM). ACFM involves the 
measurement of magnetic field disturbances rather than electrical fields. The electrical 
field is introduced by induction (and therefore requires no electrical contact) and the 
probe is designed to introduce a uniform input field approximately perpendicular to 
the expected orientation of the crack. With two orthogonal field sensors, it is possible to 
both detect and size surface breaking defects without calibration. This technique has 
been successfully used for reliable and cost-effective inspection of wide range of metallic 
components including offshore platforms/structures, pressure vessels, pipe supports, 
threads, sub-sea structures etc. (Topp 1994). 

2.9 Acoustic emission technique 

Acoustic Emission Technique (AET) has established its rightful place among the 
available NDE techniques.' Acoustic emissions (AE) are the spontaneously generated 
elastic waves created by discrete movements within the volume of a material being 
stressed. AE can be detected from the movements as small as those created by phase 
transformations within a crystalline structure or from movements as large as the 
propagation of a fatigue crack under low cycle fatigue conditions. Thus,un AE source is 
the site of a dynamic change in the material which responds to stimuli by relehsingpart 
of the energy in the form of elastic stress waves (Mclntire & Miller 1987). This dynamic 
nature of AET makes ft a highly potential technique for monitoring the structural 
integrity of components/structures in various industries with respect to identifying 
growing cracks in them. AET enables detection and location of defects even in complex 
geometries. The detection of AE is usually accomplished with a piezoelectric crystal 
sensor coupled to the part under test. The sensor output is amplified through a high 
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gain-low noise preamplifier, filtered in order to remove any extraneous low frequency 
hardware noises, conditioned and displayed. Conventional displays of AE data depict, 
rate of AE, total cumulative counts and amplitude of AE, r.m.s. voltage versus other 
environmental parameters such as load, temperature, time etc. Various AE signal 
parameters helpful in evaluating cracks include ringdown counts, events, peak ampli¬ 
tude and their distributions, r.m.s. value, rise time and event duration. Signal analysis, 
pattern recognition and neural network approaches offer promising possibilities for 
enhancing the usefulness of this technique. 

AET is particularly useful for monitoring structural integrity as it can detect growing 
cracks down to size of 25 microns, even in the presence of noisy environments. Also, the 
cracks originating from the inner surface are as easily detectable and distinguishable as 
those from the outer surface. AET assesses the behaviour of defects under test 
conditions, an information of major significance for performance evaluation. Further, 
AET is very useful in predicting fatigue failures in advance (Moore & Tsang 1977). AET 
has been successfully applied to the detection and monitoring of corrosion fatigue, 
hydrogen embrittlement and stress corrosion cracking. The inherent advantages of AET 
are that it is non-Iocalised, it is not necessary to examine specific regions of a structure - a 
large volume can be tested at one time etc. Many components in high performance aircraft 
structures are fabricated from structural alloys which, while having high strength levels, 
have relatively poor crack arresting capability. Small cracks, which have a greater 
potential to remain undetected, can approach critical size with very little extension. For 
these types of structures, only AET offers reliable capability to detect any increment of 
crack growth during structural loading. AET is best suited for on-line monitoring of 
welds, as it can detect and locate weld defects as they occur. This possibility enables 
control of quality and any repairs required can be carried out expeditiously and 
economically. Event rate and r.m.s. voltage of AE signals could be used to detect, locate 
and differentiate weld cracking, inclusions and process equipment malfunctions like 
short circuit, current variations and arc drifting etc. (Holroyd 1986). 

In our laboratory, we have emphasized the use of AET for both structural integrity 
monitoring and materials research applications (fracture, deformation, phase trans¬ 
formations, oxidation etc.) (Khanna et al 1987; Baldev Raj et a\ 1989, 1990). AET has 
been used to assess the integrity of pressurised components in operating plants by way 
of detecting leaks. Pressurized pipes, such as gap pipe lines and heat transport piping in 
power stations, are generally designed to satisfy the leak-before-break concept (i.e. 
a crack in a tube grows to cause a small leak long before reaching the critical crack size 
for catastrophic pipe rupture). However, this concept requires that a leak is detected at 
an early stage. Thus, AE monitoring is a potential technique for continuous surveil¬ 
lance for detection of cracks (Jayakumar et al 1992b). 

AET has been applied at the authors’ laboratory for understanding deformation and 
fracture process in austenitic alloys (AISI type 316 stainless steel and Nimonic alloy 
PE-16), oxidation behaviour in ferritic steels, glass-crystalline transition temperature 
determination and structural relaxation studies in metallic glasses and martensite 
transformation in unstable austenite (AISI type 304 stainless steel). In the case of 
Nimonic PE-16 alloy, various AE parameters have been used to correlate the AE 
behaviour with particle shearing and Orwan looping processes taking place in the 
presence of gamma prime and decohesion and fracture of MC occurring during tensile 
deformation (Jayakumar et al 1991). AET has been used by us to study martensite 
formation in cold worked AISI type 316 stainless steel during cooling after aging and to 
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determine the temperature range for its formation during cooling (Mukhopadhyay 
et al 1994). 

As the AE signals generated in austenitic stainless steel are weak, an innovative 
technique has been developed by us to amplify the AE signals (Baldev Raj et al 1990). 
The technique is based on the understanding that external injection of ultrasonic waves 
simultaneously is expected to interact with subcritical AE sources and give rise to 
enhanced AE signals which otherwise would not have been released at those stress 
levels. Amplification factors of 2 to 20 could be obtained for peak-to- peak voltage of 
0-2 to 0-7 V of the injected signals (Baldev Raj 1990). AET has also been applied for 
finding possible growth of a repaired crack during service of a pipeline carrying steam 
at high pressure and temperature. Comparison of AE data near the crack region and at 
a region away from it indicated events with higher mean peak amplitude near the crack 
region indicating growth of crack. Subsequent radiographic examination indicated 
missing of root-pass weld which might have acted as the source for stress concentra¬ 
tion/crack initiation. In the following section, a case study highlighting the successful 
application of AET for detection and location of leak paths in the end shield and 
pressure tubes of PHWRs is discussed. ' 

2.9a AET for leak detection in PHWR components : AET has been successfully 
applied for detection and location of leak paths present on an inaccessible side of an end 
shield of unit 1 of Rajasthan Atomic Power Station (RAPS) (Kalyanasundaram et al 
1989). This methodology was based on the fact that air and water-leak AE signals have 
different characteristic features. Baseline data was generated from a sound end shield of 
a PHWR for characterising the background noise. A mock up end shield system with 
saw-cut leak paths was used to verify the validity of the methodology. AET was carried 
out to detect light-water leakage from one of the end shields. Time-domain analysis of 
AE signals obtained by air pressurisation of the end shield to 0T24 MPa was used for 
detection and location of leak paths. However, time-domain analysis could not be applied 
for detection of subsequent leaks, found after repair and operation of the system, as there 
was limit on maximum air pressurisation of the end shield to 0-035 MPa. Since, the time- 
domain analysis did not provide any meaningful results, frequency spectral analysis 
approach was used. Autopower spectra showed presence of characteristic frequency 
associated with the air leaks. It was also observed that the chracteristic frequency of the 
signal was different for the two leak paths and was attributed to the size, shape and 
morphology of the leak paths. In another instance, for findings a leaking pressure tube 
of MAPS, another PHWR type of reactor, advanced frequency analysis approaches have 
been adopted (Kalyanasundaram et al 1992). In this case, simple frequency spectrum could 
not be used for detection of the leaking pressure tube. Only by using the spectral energy 
ratio at two different frequency bands and its change with pressure, it was possible tc 
identify the leaking pressure tube. The above studies indicate the advantage of frequ^nr-x 
domain analysis over time-domain analysis for detection and location of leaks oc < 
relatively at low pressures and under noisy environments. 

2.10 Infrared thermography 

Infrared thermography (IRT) refers to the mapping of the temperati 
face of an object. This technique utilises the infrared spectral band 
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radiation. With the aid of a suitable detector, infrared radiation can be converted into 
an image and visualized for interpretations. The method uses a special television 
camera with an IR sensitive detector and a lens which transmits IR radiation. Such 
cameras can operate at conventional video rates. The temperature variations in the 
object under investigation are displayed as shades of gray or can be converted into 
a pseudo-colour image. Temperature variations as small as 0-2 degrees can be detected. 
At ambient temperatures and above, all objects emit electromagnetic radiations 
predominantly in the infrared region which is invisible to the eye. Variations in the 
temperature of the surface of the object can be converted into a thermal image and 
visualised. This helps detection of deviations from normal temperature distribution, 
from a distance. It is this advantage that the technique exploits. The basic idea behind 
thermographic NDE is that if a pulse of heat is applied to one side of a specimen, the 
spatial distribution of the heat flux on the opposite surface will depend on the 
homogeneity of the specimen (i.e. the presence of internal defects), the diffusivity (which 
is related to thermal conductivity and volume capacity) and time. The difference 
between thermographic methods and other methods using electromagnetic radiations 
is that in the IRT method the heat is propagated by a diffusion process and not 
instantaneously. The difference in temperature on the surface facing the detector can be 
measured and related to the size of the internal defect. Infrared thermography can be 
classified basically into two categories viz. passive and active. In passive technique, the 
natural heat distribution is measured over the surface of a hot structure. This is 
generally used for temperature monitoring of structures for assessing their integrity. In 
an active technique, heating or cooling is induced or applied to the part or complete 
surface and the redistribution of heat or temperature across of test surface is measured. 
This is generally used in NDE (Baldev Raj et al 1990a). 

IRT has several important features which make it attractive for inspection of 
materials, components and plants. The method is non-contact, dry, safe, fast and 
displays results of inspection in a convenient visual format. IRT has established 
a unique position for testing and evaluation of circuit boards, transmission lines, 
furnaces, air-conditioned enclosures, gas and fluid transfer lines etc. This technique is 
ideally suited for wear determination of refractories in blast furnaces, hot blast stoves 
and steel stoves and in the inspection of rotary kiln lining. It has been proved to be 
a very important tool for predictive maintenance in electrical systems. It can locate any 
impending failures or locate points which are likely to cause trouble. The technique, 
being noncontact, is very promising for remote applications. 

In the field of materials evaluation, IRT has been applied for monitoring fatigue 
damage and for mapping of stress fields during cyclic mechanical testing. The technique 
has also found applications in the control of hot rolling and welding process, detection 
of surface- cracks and surface defects in induction-heated hot-rolled plates (Kubiak 
1968). Recent applications of thermography include on-line detection of hot spots, 
fatigue damage and weld defects in components. Detection of initiation and growth of 
cracks is also possible using this technique (Cielo et al 1987). 

Thermography is used extensively to monitor the structural integrity of electrical 
plants and substations (Venkatraman et al 1992). In this direction, various condition 
monitoring campaigns have been successfully performed by the authors’ laboratory, 
some of them include; 230 kV main transformer of PHWRs, 33 kV switchyard at Indira 
Gandhi Centre for Atomic Research (IGCAR) and concrete wall of turbine building of 
Fast Breeder Test Reactor (FBTR). Similarly, IRT has proved to be indispensable for 
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predictive maintenance in petrochemical industry. In a campaign at a petrochemical 
industry, hot oil heaters, charge heaters and stack duct-to-air preheaters were inspected 
by us. Hot spots were observed in certain zones on the hot oil heaters. They were 
attributed to thinning down of the refractory lining inside the oil heaters and recom¬ 
mended for maintenance during subsequent shutdown. The above-mentioned cam¬ 
paigns have clearly established the capability of IRT technique for condition 
monitoring. 

Apart from this, thermography is being used as a valuable tool for the investigationof 
weld pools. At our laboratory, studies are being carried out on the adaptability of 
thermography for on-line monitoring of welds (Venkatraman et al 1993a). Thermo¬ 
graphic methods are widely used in aerospace and transport industries for the testing of 
non-metallic materials such as composites and laminates where conventional NDE 
techniques like RT and UT produce results which are difficult to interpret. IRT has 
proven to be a valuable tool for detection of impact damage, debonds etc. in composites 
(McLaughlin et al 1980). 

2.11 Laser holography 

Holography is becoming more and more popular in NDE as its capabilities are 
understood better. When light is incident on an object, the envelop of the scattered light 
(called object wave front) contains the visual information of the object. Holography is 
a technique for recording and reconstructing this object wavefront. Holographic 
interferometry (HI) technique essentially involves the interference of the object and 
reference beams (laser) emanating from the component under investigation and 
recording of the interference pattern on a high resolution photographic plate (holo¬ 
gram). The information in the hologram is extracted by reconstruction of the image by 
passing a wave similar to the reference beam. The HI technique, being non-contact, is 
finding place as a complementary or sometimes unique NDE technique in many 
specialised applications. It is being routinely used in the aircraft industry for inspection 
of aircraft tyres, microwave antenna etc. This method has shown promising results in 
the NDE of composite panels, honeycomb structures and bond assessment in adhesively 
bonded structures (Babu Rao & Baldev Raj 1990). HI technique can be used to 
investigate the deformation states on the surfaces of pressure vessels. The defects in 
such components can be differentiated on the basis of their severity. Fringe pattern 
density is used to find the nature of defects (symmetric or asymmetric etc.). We have 
used such methodology for the inspection of pressure vessels (Baldev Raj et al 1986). 
Using this method, the effect of stimuli (like load, temperature etc.) can also be assessed. 
Fabrication and welding.processes alter the mechanical properties of the material and 
introduce residual stresses leading to manifestation of non-uniform deformations. HI 
technique is well suited to measure these deformations, establish fitness-for-purpose 
and suggest changes in the manufacturing process and design. HI has been used to 
establish accept/reject criterion based on measurement of residual stress introduced 
during pressure testing in a high energy rate formed stainless steel pressure vessel. 
Good quality vessels are characterised by small and reasonably uniform deformation 
fields. On the other hand, poor quality vessels exhibit large local yielding and much 
greater overall residual strains (Halmshaw 1989). Holographic NDT is particularly 
useful on complex shapes where other NDE methods such as UT, RT are difficult ©r 
time consuming to apply. 
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We have developed new methodology employing project interferometry for profiling 
of stainless steel fuel cladding tubes of fast reactors. Diameter variations of the order of 
few microns have been reliably measured (Babu Rao et al 1992). This work established 
the adaptability of this methodology for quality control of tubes towards enhanced 
structural integrity. 

2.12 X-ray diffraction technique 

X-ray diffraction (XRD) technique is widely used for measuring residual stresses (RS) in 
components. RS are generally caused by inhomogeneous plastic strains in a structure. 
Tensile residual stresses are generally detrimental, increasing the susceptibility of 
a component to failures due to such causes as fatigue and stress corrosion. On the 
contrary, compressive residual stresses are usually beneficial, tending to reduce the 
above susceptibilities (Palanichamy et al 1992b). Thus residual compressive stresses are 
often deliberately created on the surface. The role of NDE in detecting and assessing 
residual stresses is ever increasing. NDE techniques that have shown capability include 
UT, MBN and X-ray diffraction (Palanichamy et a! 1992b). 

XRD technique measures the interplanar spacing of the lattice to arrive at elastic 
strains on the surface layers (generally up to 20-30 microns depending on the density) 
of the material and converts these data into stresses. To find stress, it is not necessary to 
obtain the separations of the crystallographic lattice planes in the unstressed condition. 
The X-ray technique can rapidly measure applied and residual stresses on a small area 
on the surface of polycrystalline materials. This technique is based on the fact that the 
shift of location of the diffraction profile (also called the peak position) is proportional 
to the stress value. Thus peak position determination is of fundamental importance in 
X-ray stress analysis. PC-based systems are now available for reliable and rapid 
measurement of stresses (Kurita 1987). 

We have successfully used this technique to assess residual stress and post weld heat 
treatment (PWHT) in autogenous butt weld joints of 2-25 Cr-1 Mo steel tubes which 
will be used in the steam generator assemblies of the prototype fast breeder reactor. The 
XRD data has shown the nature of stress at the weld centre. XRD technique studies 
have confirmed that the proposed PWHT at 988 K for 30 min is sufficient to remove the 
residuals stresses (Bhattacharya et al 1992c). This has been further validated by 
microhardness measurements in weld and parent metal regions. Residual stresses in 
explosively welded aluminium and stainless steel joints have also been analysed (Rai 
etal 1993). Residual stress measurements have been carried out on AIS1 type 304L 
stainless steel dished ends which failed due to transgranular stress corrosion cracking 
(Rai et al 1994). Residual stress measurements have also been carried out in another 
important structural material i.e. carbon steel, using XRD, UT and ABN techniques 
(Palanichamy et al 1993). The above studies clearly reveal the capability and potential 
of XRD technique for the assessment of residual stresses in components, an essential 
information for structural integrity evaluation. 

3. Concluding remarks 

An overview on the capabilities and limitations of various NDT techniques that can be 
used for the structural integrity of components is discussed. A few case studies related to 


Table 1. Summary of applicability and capability of various NDT techniques used for assessment of anomalies® in materials. 
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structural integrity based on the investigations carried out by us have been described. 
The conventional NDT techniques called the big five, viz. magnetic particle, radi¬ 
ography, ultrasonic, eddy current and liquid penetrant techniques can be adapted for 
many situations. Newer techniques like acoustic emission and thermography are found 
to be extremely useful and effective for assessment of integrity of components because 
of their dynamic nature. Table 1 provides information about the capability and 
applicability of various NDE techniques for the assessment of defects, stresses and 
microstructural degradations in materials. Improvements in the existing tech¬ 
niques/methodologies and development of new techniques have paved the way for 
obtaining quantitative information about the anomalies such as defects, stresses and 
microstructural degradations for easy, sensitive and reliable structural integrity evalu¬ 
ation. A few examples are acoustic microscopy, SAFT, TOFD, eddy current imaging, 
microtomography, Barkhausen noise analysis (magnetic and acoustic) methods, X-ray 
stress analysis etc. In recent years, NDT has benefited much with the advent of digital 
computers, compatible cards and interfacing boards. It is now possible to perform 
computer modelling to simulate real-life situations to study the interactions between 
interrogating fields and anomalies. This enables optimised equipment parameters and 
NDT sensor design for enhancing the sensitivity for anomaly detection. Expert 
systems, neural network analysis and signal processing and analysis procedures along 
with the newer NDT procedures provide added impetus to meet the challenges of 
quality assurance, in-service inspection and failure analysis. However, it is necessary to 
fully appreciate the capabilities and limitations and the complementary nature of 
various NDE techniques to enable accurate, reliable and cost-effective inspection. 


We wish to thank Dr Placid Rodriguez for his keen interest and constant encourage¬ 
ment during the course of this work. 
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Residual stresses of types II and III and their estimation 

P KESAVAN NAIR and R VASUDEVAN 

Department of Metallurgical Engineering, Indian Institute of Technology, 
Madras 600036, India 

Abstract. Residual stresses (RS) are induced in metallic materials by 
a variety of working and fabrication processes. They are generally classified 
into three categories (types I,. II and III) depending upon their range of 
influence. Separation of type I and II stresses often requires stress measure¬ 
ments on thin specimens. Type III stresses are related closely to grain 
fragmentation and micro-strains associated with plastic deformation. Per¬ 
haps the oldest and the most rigorous method of estimating these stresses is 
still the Warren-Averbach analysis developed during the late forties/early 
fifties. Other techniques involving integral breadths and variance of the 
profiles have also been developed. In all these methods developed during 
the early stages, prior to seventies, a major requirement was well-separated 
non-overlapping profiles. The late sixties and early seventies saw a dra¬ 
matic increase in computational capabilities with the advent of powerful 
electronic computers. This led to the introduction of curve-fitting pro¬ 
cedures into the field of X-ray diffraction. The most remarkable achieve¬ 
ment of this period is the development of the Rietveld Method. Although 
this method was initially developed to tackle the neutron diffraction 
profiles, which are as a rule nearly symmetric and Gaussian in nature, the 
method saw rapid developments during the eighties. At present, techniques 
based on concepts developed by Rietveld could be applied to essentially 
asymmetric and non-Gaussian multiple spectral components of X-ray 
diffraction profiles. Pattern decomposition techniques which separate com¬ 
posite powder diffraction profiles into individual profiles are now available. 
In combination with single line profile analysis techniques, this provides 
a powerful tool in the hands of researchers. A typical example of such 
a single line profile analysis is given. 

Keywords. Residual stresses; microstresses; microstrain; line profile 
analysis; profile fitting. 


1. Introduction 

Residual stresses present in a material can be broadly classified into two major 
categories: macro- and micro-residual stresses. The macro residual stresses are some 
times referred to as stresses of type I. Following the latter scheme, the micro-residual 
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Figure 1 . Schematic representation of residual stresses of types I, II and III. 


stresses are further classified into two: stresses of types II and III. The basis of this 
classification is shown in figure 1. The evaluation of type I or macro residual stresses are 
by now well established (Macherauch et al 1973; Hauk & Macherauch 1984). This 
paper gives an introduction to the more widely used methods for the evaluation of 
stresses of types II and III. Although both these stresses are commonly referred to as 
micro-residual stresses, the estimation of these two types of stresses follows different 
principles. While the evaluation of type III stresses has achieved a relatively high degree 
of sophistication, that of type II stresses is still relatively difficult; accurate and reliable 
methods comparable to those available for the evaluation of the other two types are still 
to be evolved. The principle of X-ray evaluation of type I stresses is based on an accurate 
measurement of interplanar spacings of a particular set of crystallographic planes. 
Since both types I and II stresses affect this spacings in a similar manner, any attempt to 
evaluate the former without taking into account the effect of the latter is bound to be 
inaccurate. The estimation of type II stresses is discussed in this specific context. 


2. Evaluation of type II stresses 

In general, all the three types of stresses are likely to be present in a polycrystalline bulk 
sample. Stresses of types II & III will directly affect the lattice strains. A possible 
definition of type II stresses could be that it is those stresses existing in the crystals even 
after complete removal of type I stresses. Indeed, strictly speaking, type II stresses will 
very greatly from grain to grain. In a line-grained polycrystalline material it becomes 
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rather difficult to evaluate such stresses on a grain-to-grain basis. This necessarily 
means that only an average value can be obtained. The method of measurement is the 
susal multiple exposure X-ray diffraction (XRD) technique. Since both type I and type 
II stresses affect the diffraction profile in a similar fashion, the results obtained through 
a routine residual stress measurement through the multiple exposure technique are, in 
general, a sum of these two stresses. The proportion in which they contribute to the 
total can vary greatly. 

Type II stresses can be defined in a more precise way as 

a u = [trda/da] one crystallite — a 1 (1) 

where 

a = <Tj + a n + ff in ( 2 ) 

and the subscripts I, II, and III means the stresses of types I, II, and III. 

The separation of type I and type II stresses is possible only if the distribution is of 
a specific type, in specimens conforming, to definite geometrical requirements (Hauk 
& Nkolin 1988): for example, in strips and sheets, if the distribution is symmetrical, it is 
possible to estimate the distribution across the depth. This stipulation arises from the 
fact that individual estimation of type I or type II stresses can be done only through 
deflection methods. As is well known, the principle behind the estimation of residual 
stresses through deflection methods depends upon relating the dimensional or shape 
changes of a specimen caused by removal of material to the relieved stresses. This in 
turn requires that the geometry of the specimen be simple (Treuting & Read 1951). 

Taking the case of thick strips with a symmetrical stress distribution of type I and 
type II stresses, a procedure having the following steps can be adopted: (1) Successive 
layers are removed from both top and bottom surfaces of the specimen in a symmetri¬ 
cal fashion, which ensures that the removal as such will not have any significant effect 
on the subsequent measurements. Consequently, a routine measurement employing 
a multiple exposure technique on the surface layers after symmetrical etching can 
provide type I + type II stress distributions as functions of depth in this type of 
specimen. (2) A straightforward stress estimation employing the deflection methods can 
provide the distribution of type I stresses alone. In this case the removal of material is 
only from one side. The deflection produced can be related to the stresses of type 
I present in the material. (3) Thin specimens are carefully removed from various 
positions of the sample in such a way that the removed thin slice is essentially free of 
type I stresses (Hauk 1983). A routine residual stress measurement employing multiple 
exposure technique can provide the distribution of type II stresses. The essential 
features of the above three steps are illustrated in figure 2. If step 3 could be successfully 
completed, a direct evaluation of type II stresses is obtained, which can be cross 
checked employing the additive nature of type I and type II stresses obtained in steps 
1 and 2. However, it is to be remembered that steps such as the one just detailed are 
rather specific in application. A general method to evaluate the type II stress distribu¬ 
tion is not yet available. 

It is clear from the above discussions that the estimation of type II stresses involves 
residual stress measurement employing multiple exposure X-ray diffraction tech¬ 
niques. All the limitations applicable to the evaluation of residual stresses employing 
this technique such as linearity of the D vs sin 2v F plots, existence of sharp stress 
gradients normal to the surface, coarse grain size etc can cause problems in these cases 
as well (Hauk 1986). 
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Figure 2. Evaluation of vari¬ 
ous types of stresses on samples 
with symmetrical distribution 
of residual stresses across the 
thickness, (a) Geometry of 
specimen with residual stress 
distribution, (b) Location of 
specimen for evaluation of 
<jj + er jj after symmetrical etch¬ 
ing. (c) location of specimen for 
evaluation of II. (d) Deflection 
of specimen on etching on one 
side for estimation of a v 


3. Type III stresses and their evaluation 

Type III stresses, as illustrated in figure 1, have a range of action relatively small when 
compared to the other two. As a consequence, they are estimated as average values. 
Their effect on the diffraction profile is also different. If present, they tend to change the 
shape of the profile, rather than the position. Unfortunately, there are quite a number of 
factors which can affect the shape of the diffraction profile: the dimensions and 
morphology of the coherently diffracting domains, nonstochiometry, faults disloca¬ 
tions, instrument geometry etc., apart from internal stresses. This list is only illustrative 
and not exhaustive. Although a number of parameters are required to characterize the 
shapes of diffraction profiles, the one most widely used is the breadth of the line profile. 

It is generally accepted that the observed diffraction profile (the h {20) profile) is 
a convolution of the instrument aberration (g(26) profile) and the pure diffraction 
profile which is due to the specific micro freatures of the sample {f {26) profile) (Wilson 
1963; Warren 1969). The can be written as 


'It. 






h(26) = g(26) x f(2d) = g(29')f(20 - 2P)d(2d). 


( 3 ) 
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Since each point in the observed profile is assumed to be the result of a summation of 
the product of g and/ over all possible values of/(20), the computations involved are 
quite time consuming. One option would be to do this computation in Fourier space; 
alternately it can be done analytically, rather than numerically, through a proper 
choice of functions to present the individual profiles. Chronologically, the Fourier 
methods found wide acceptance, much before the second option was implemented. 

The components contributing to the diffraction profile as mentioned above are 
discussed individually below. 

3.1 The pure diffraction profile f 

The principal contributors to this profile are the ‘particle size’ or the coherently 
diffracting domain size. This size D is related to the breadth of the size profile through 
the relation named after Scherrer: 

p{D) = ld/D cos0, (4) 

where 9 is the Bragg angle, and k a constant close to 1. 

The second contributing factor to the/profile is the microstrain or the residual stresses of 
type III. This also broadens the residual stress profile according to the relation: 

P(e) = 4etan0. (5) 

It is generally believed that both these profiles are symmetrical in shape. The former due 
to particle size is assumed to be having a Cauchy (Lorentzian) and the latter a Gaussian 
profile (Haider & Wagner 1966; Nandi & Sengupta 1978; de Keijser el al 1982). 

It is to be remembered that other factors such as compositional inhomogeneities 
might also affect the breadths of this profile. But generally such effects are assumed to 
be negligible. 

Although the term particle size is used to refer to the coherently diffracting domain 
size, the terminology is valid perhaps only in the case of inorganic materials where 
strain effects are negligible. The apparent domain size derived from the breadth of the 
diffraction profile is a mean length normal to the diffracting planes, averaged over the 
entire irradiated diffracting volume of the sample. It is to be remembered that size 
broadening depends strongly on the shape or form of the domain. Consequently the 
breadth of the profile is likely to be anisotropic in the sense that there might be 
considerable variations depending upon the reflections considered. 

The breadth usually could be the full width at half maximum (FWHM) or the 
integral breadth (IB). While the former is not amenable to any direct physical inter¬ 
pretation, the latter, in reciprocal space, can be thought of as the volume weighted 
average crystalline size i.e. 

D — f* = pcosd/f (6) 

where j3 is the integral breadth on a 29 scale and is the integral breadth on a d* (i.e. 
sin 9IX) scale. 

3.2 The strain broadening 

As mentioned before, the broadening due to the lattice strain is considerd to be nearer 
to Gaussian in shape. However due to anisotropy of elastic properties, the breadths 
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need not vary smoothly with Bragg angle (Delhez et al 1980). The breadth will thus very 
much depend upon the particular reflection considered. 

3.3 The instrumental profile g 

The instrumental profiles vary smoothly with angle (Cox et al 1988). The profile shape 
may also vary between pure Gaussian and pure Cauchy. At low angles, geometric 
aberrations are predominant and at high angles, effects due to wavelength distributions 
predominate. Usually the g profile is experimentally obtained as the profile from 
a well-annealed defect-free standard sample. 

3.4 Separation of size and strain effects 

3.4a Methods based on integral breadths : Classical or conventional ways of separa¬ 
tion require at least two orders of the same reflection in order to separate the size and 
strain effects. However, before attempting such a separation it is essential that the 
instrumental effects are eliminated from the observed profile. For this purpose the 
experimentally obtained g profile from an appropriate standard material is used. As 
a first step the effect due to the doublets is eliminated through analytical procedures 
such as the Rachinger’s method (Pease 1948; Rachinger 1948) and the pure diffraction 
profile / is obtained through any of the following relationships (Jagodzinski 1949; 
Anantharaman & Christian 1956; Taylor 196IX 

p — B-b 

p = B-b 2 /B 

p 2 = B 2 -b 2 

where p is the corrected breadth, B and b are the breadths of observed and instrumental 
profiles after eliminating the wavelength dispersion effects. Equation (7) assumes that 
both the observed profiles are Lorentzian and (9) assumes that both are Gaussian in 
shape. Equation (8) yields a value in between. This corrected breadth p includes the 
effects due to both size and strain effects. 

The procedure for obtaining the strain or particle size from this corrected breadth 
involves the assumption that the breadth of the size profile p D and that of the strain 
profile P e are related through (Anantharaman 1959; Rama Rao & Anantharaman 1963) 

/? 2 = /£ + #. (10) 

This when combined with (4) and (5) above yields a linear relationship between 
(PcosOjl) 2 and (sin 6/X) 2 with a slope of 16e 2 and an intercept of the reciprocal of the 
square of the apparent crystallite size. It is the necessity of plotting (10) to obtain the 
slope and intercept, which dictates that at least two orders of reflections be available. 
Alternately, if the sample is reasonably isotropic, one can use the breadths of other 
reflections as well for the above plot. The size obtained in this manner includes effects 
due to stacking faults, dislocations etc. 

Yet another method of obtaining the pure diffraction profile in a more direct fashion 
is the well-known Stake’s correction (Stokes 1948; Warren 1969). Here the removal of 
effects due to the instrumental profile is carried out in Fourier space by expressing the 
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observed and the experimentally obtained g profile as a Fourier series. The Fourier 
coefficient of/ profile can be shown to be given by 

F(n) = H(n)/G(n), (11) 

where n is the harmonic number and F(n), G{n) and H(n ) are the respective Fourier 
coefficients of/, g and h profiles. This method is completely rigorous and makes no 
special assumptions regarding the profile shape. Once the coefficients of the pure 
diffraction profile are obtained, the latter can be synthesized and treated in a similar 
fashion as above. However, a more elegant way of separating the strain and size effects 
is the Warren-Averbach Method, discussed below (Warren & Averbach 1950,1952). 

3.4b The Warren-Averbach method: The Fourier real coefficients of the pure dif¬ 
fraction profile can be related to the coefficients of the size and strain profiles through 


A(L) = A D (L)A e (L), (12) 

where L refers to a distance normal to the diffracting planes and the superscripts D and 
e to particle size and strain respectively. For small values of L, the coefficients of the size 
and strain profiles can be approximated by 

A D (L) = 1 — L/D hkl , (13) 

A e (L) = 1 - 2n 2 L 2 [<e 2 > - <e> 2 ]S 2 , (14) 

where D hkl is the domain size normal to the reflecting planes hid and e is the microstrain 
or type III strain. Si is given by 

Si = (2sin0/A) 2 = {h 2 -He 2 + l 2 )/a 2 . (15) 

Combining (12)—(14) for small values of L, one obtains 

In A(L) = In A D (L) - 2n 2 L 2 [<e 2 > - <e> 2 ]Sj. (16) 


Thus a plot of In A(L) against Si will be a linear one with the intercept giving In A D (L), 
the particle size coefficient. The slope of this plot is a measure of the microstrain, given 
by[<e 2 >~ <e> 2 ]: 

< 4> 112 =[< e 2 >~< e > 2 ] 1/2 

= [In A D (L) — hi A(L)] 1/2 /(27t)LS 0 , (17) 

where e L is the component of the strain normal to the reflection plane, averaged over 
the distance L. This strain is a function of the averaging distance and usually decreases 
with L, reaching an asymptotic value for large L. An accurate and acceptable 
interpretation of this plot is still not available. 

3.5 Limitations on the application of integral breadth and Fourier methods 

Both integral methods and Fourier methods presume the existence of well separated 
and independent reflections from the sample. All too often situations arise where the peaks 
are extensively overlapped by neighbouring reflections. Very often even moderately 
well-resolved reflections might be absent. Such complex patterns are very common in 
modern materials such as composites, ceramics, polymers etc. The reflections may rise 
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not only from well crystallized grains, but also from quasicrystalline, nanocrystalline or 
amorphous materials. Analysis of such patterns are becoming a necessity in the present 
day context. Fortunately, pattern decomposition methods are available now which 
enable one to handle such situations with a reasonable degree of confidence. 

%4f ’ ; 

3.6 Methods based on profile refinement * 


As mentioned above, most of the classical methods require well-resolved powder 
diffraction profiles and input data of high quality. These two requirements often lead to 
a dead end. The entire process of profile analysis got new life and interest in it was 
renewed with the introduction of least squares structure refinement based on concepts 
originally proposed by Rietveld (1967,1969). The basic idea was to least square refine 
the parameters until the best fit is obtained between the entire observed diffraction 
pattern as a whole and a calculated pattern based on simultaneously refined par¬ 
ameters which affect the diffractogram one way or the other. Factors Such as crystal 
structure, instrumental effects, specimen imperfections or any other effect could be 
included so long as they can be modelled. An important feature of this method is the 
feedback during the refinement process itself between the improving goodness of fit and 
the relevant structural parameters. What is refined are the parameters themselves and 
not the profile. Although the method was originally proposed for the analysis of 
neutron diffraction profiles, it was quickly realized that its domain of application is 
really vast. In the present context the method has high potential to provide data on the 
structural imperfections of the material to be analysed. 

An essential requirement for this purpose is to ascribe a suitable analytical function 
to the line profile. Early workers, as already mentioned, assumed a wholly Gaussian or 
a Lorentzian profile. However, later it was shown that deficiencies do exist in both these 
cases (Suortti et al 1979; Young 1980). At present it is recognized that a pseudo-Voigt or 
a Pearson VII profile shape function can be considered to be most suitable for the 
purpose (Hall et al 1977; Young et al 1982; Thompson et al 1987). The Voigt function is 
the result of an analytical convolution of a Gaussian and a Lorentzian, 

V(x) = [ C v (z)G v (x - z)dz. (18) 

with 

C v (x) = [1 + (n 2 x 2 /ffy 1 = (1 + 4x 2 )/(2w c ) 2 ] ~ 1 
and 

G v (x) = exp(— tcx 2 /Pg) = exp[( — 41n2)x 2 /(2w G ) 2 ] 

where C v and G v indicate the Cauchy and Gaussian components having integral 
breadths of and /? G respectively with 2w c and 2w G as the corresponding full widths at 
half maximum. I 0 is the peak maximum intensity (at x = 0) and the integral breadth of 
the Voigt profile, V{x). As can be seen, the range of profiles that can be handled varies 
from pure Gaussian to pure Cauchy, depending on the ratio of the component widths. 
The pseudo-Voigt function on the other hand provides for a mixing parameter in the 
form of fractions of superposed Lorentzian and Gaussian components needed to fit an 
observed profile (Hindeleh & Johnson 1972), 

pV(x) = l 0 lC<’ r (x) + G° r (x)-], 


f- 






(19) 
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where 

o v (x)-rii 1 + nv/w 1 , 

G pV (x) = (i -^)exp[-nV/OS G/ ) 2 ], 

V with 

The subscripts C and G' denote that the respective integral breadths refer to Cauchy 
and Gaussian profiles respectively. r\ is the fraction of the Cauchy component required 
to obtain the best fit during the least squares process. In most of the modern computer 
programs this fraction itself is one of the parameters to be refined during execution. 
Since the Gaussian is a relatively broad profile and the Lorentzian a narrow one, the 
mixing of the two can represent a symmetric profile. The Pearson VII is also 
a symmetric profile function which can equally well represent a symmetric diffraction 
profile: 

pVII(x) = 1 0 (l + x 2 /ma 2 )~ m , 1 oo, (20) 

^ with 

a = pT {m)/[rmt ) 1/2 F(m — 1/2), 

where m = 1 implies a pure Cauchy profile and m — oo, a pure Gaussian one. The 
Gamma Function is denoted by the symbol T. Both Voigt and Pearson VII functions 
can represent asymmetrical profiles by employing the respective split-type functions. 
In this case, the diffraction profile is split into a low angle side and a high angle side 
at the peak maximum. Two different Pearson VII or Voigt functions, as the case may 
be, are then fitted on either side. In other words the profile is represented by two 
functions having a common peak maximum position and intensity (Snyder 1993). 
However, they may have different values of full width at half maximum, decay rates, 
and other profile parameters. Further, the functions denoted by P(x) low and P(x) high , 
representing the low and high angle sides of the profile should satisfy the following 
condition: 



P(x) low dx + 


* + OO 

Jr 


P(x)high<*X= 1 . 


( 21 ) 


In most of these refinement procedures a number of profile parameters like FWHM, 
the integrated intensity, the peak position etc. are simultaneously refined and final 
optimized values obtained. Once an optimized profile fit is obtained, the Gaussian and 
the Cauchy components of the profile can be separated, and the size and strain 
calculated based on (4) and (5) assuming the Cauchy breadth for size and the Gaussian 
breadth for strain (de Keijser et al 1982). 

Another approach is to obtain the Cauchy and Gaussian integral breadths from 
the experimental values of the integral breadth (0) and the ratio (0) of the fullwidth 
at half maximum to the integral breadth of the g and h profiles. This can be done 
graphically (Langford et al 1988) or using empirical equations such as (de Keijser 
et al 1982), 


0 C = 0(2*0207 - 0*4803 0 - 1*7756 0 2 ), 

Pg = 0[0*6442 + 1*4187(0 - 2/n) l/2 - 2*2043 0 + 1*8706 0 2 ]. 


( 22 ) 

( 23 ) 
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Figure 3. The experimental profile from electroless Ni-P deposit with 44 
wt. % phosphorus after annealing at 400°C for 2 h. 


Once the Cauchy and Gaussian breadths of the experimental g and h profiles are known 
those of the / profile can be calculated employing equations similar to (7) and (9). 
Further, the crystalline size and strain could be estimated using the Cauchy and 
Gaussian breadths of the/profile in (6) and (5) respectively. 

Alternately, if two orders of the reflection are available, the integral breadths of the 
total profile can be straightaway made use of to obtain the size and strain components. 

A cross-comparison between the results of a multiple line analysis and single line 
analysis might not produce identical values for profile parameters. This is mainly 



Figure 4. The experimental profile from electroless Ni-P deposit with 4 4 
wt. % phosphorus after annealing at 40Q°C for 2 h (enlarged intensity scale). 
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because the weightage of the Gaussian and Cauchy contents for different orders of 
reflection will not, in general, be the same (de Keijser et al 1983). 

A typical example of a single line size-strain analysis is given below in order to 
demonstrate the capabilities of the approach. 

3.7 Profile separation and single line profile analysis: An example 

In figure 3 an experimental profile obtained from a Ni-P electroless deposit on mild 
steel substrate is presented. The diffractogram was obtained after annealing the sample 
for 2 h at 400°C. The P content of the deposit was 4-4%. As is evident there is extensive 
overlap between the main high intensity Ni (111) profile and the emerging Ni 3 P peaks. 
The same pattern is shown on an enlarged scale to have a better appreciation of the 
extensive overlap between the profiles in figure 4. The profile was deconvoluted with a 
least square fitting procedure employing the pseudo Voigt function (Toraya 1986). The 
separated profiles are presented in figure 5. As mentioned earlier, a number of 
parameters are simultaneously refined. The final refined parameters are presented 
in table 1. 

For the purpose of single line analysis, annealed pure Ni powder was used as 
a standard. The refined parameters obtained from this profile (not shown) were used for 
correcting the observed profile breadths and the size and strain calculated employing 
(4) and (5). 

4. Concluding remarks 

Procedures for separating type I and type II stresses are discussed. The separation 
procedure is a combination of X-ray diffraction and mechanical dissection type of 
techniques. Consequently a complete distribution of type II stresses can be obtained 



Figure 5. The deconvoluted individual diffraction profiles and total profile corre¬ 
sponding to figure 4. 
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Table 1. Parameters from profile fitting and results of single line 
analysis. 

Parameters from profile fitting 


Parameters are in degrees (Two theta scale) 

Profile shape: pseudo-Voigt 
Radiation used: cobalt 
LI = 1-788965, L2 = 1-792850, K = 0-4970 
Theta range: 48-00-56-00 Step : 0-03 
B1 = 30-03 B2 = 0-00 B3 = 0-00 


Profile identification: Ni(111) 

Integrated Intensity 
Peak intensity (A 1 4- A2) 

Peak intensity position 
Integral breadth (A1 + A2) 
Alpha 1 and alpha 2 separation 

Integrated intensity 
Peak intensity 
Peak maximum position 
Integral breadth 
FWHM 
Cauchy content 
Gaussian content 
Cauchy breadth 
Gaussian breadth 


5179-866000 


10018-730000 


52-529820 


0-517018 


0-122791 


Alpha 1 

Alpha 2 

3453-244000 

1726-622000 

7273-720000 

3612-031000 

52-504000 

52-626790 

0-474756 

0-478020 

0-357625 

0-357625 

0-519539 

0-537562 

0-480461 

0-462438 

0-309947 

0-320140 

0-249624 

0-244629 


Results of single line analysis 


Profile identification: Ni(lll) 

Integral breadth of diffraction profile 
FWHM of diffraction profile 
Cauchy breadth of diffraction profile 
Gaussian breadth of diffraction profile 
Cauchy content of diffraction profile 
Gaussian content of diffraction profile 

Particle size = 605 A 
Micro strain = 1-821 E — 03 


0-3875569 

0-3038880 

0-1888990 

0-2057786 

0-5129336 

0-4870664 


only under limited conditions. The estimation of type III stresses has made remarkable 
advance in recent years. Induction of the Rietveld concepts into profile analysis has 
given greater flexibility in their estimation. 
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Creep crack growth in power plant materials 
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Abstract. Economic considerations have made it desirable to extend the 
30 to 40 year operating life of power plants by another 10 to 20 years. Crack 
growth at elevated temperatures is art important consideration in estima¬ 
ting the remaining life, determining operating conditions and deciding 
inspection criteria and intervals for power plant materials. This paper 
presents an overview of high-temperature crack growth phenomenon in 
such materials. The focus is on various techniques used for characterizing 
creep crack growth (CCG) and creep-fatigue crack growth (CFCG) in high- 
temperature materials. The collection of data, their analysis and the 
interpretation of results is discussed in detail, especially for CFCG labora¬ 
tory testing. The discussion is primarily focussed on creep-ductile materials 
such as those used in power plant applications. Special considerations for 
elevated temperature crack growth in weldments are also presented. Final¬ 
ly, the application of these concepts to the life prediction of power plant 
components is also discussed. 

Keywords. Creep crack growth; power plant applications; creep-fatigue 
crack growth; high temperature crack growth; life of power plants. 

1. Introduction 

A significant number of power plants all over the world have been in operation for 
durations close to their design lives of 30 to 40 years. Due to economic considerations it 
is desirable to extend their operating life by another 10 to 20 years. According to 
economic studies, the cost of life extension of a typical power plant is only 20% to 30% 
of the cost of building a new plant whereas the benefits are much higher (Dooley 
& Viswanathan 1987). In order to ensure that power plant components are retired for 
cause rather than on the basis of their ‘design life’, remaining-life-assessment must be 
made. Crack growth at elevated temperatures becomes an important consideration in 
the estimation of remaining life, in the determination of operating conditions and in 
deciding inspection criterion and intervals. Fracture mechanics approaches that have 
been developed over the past few decades have proven useful in providing a rigorous 
framework for accomplishing these goals. 

In the next few sections, an overview of the materials commonly used in fossil power 
plant components, the service conditions they are exposed to and the types of failures 
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that are commonly encountered during service is provided. This information is used as 
the basis to provide motivation for the use of remaining crack growth life assessment 
technology for life prediction and deciding safe inspection intervals for components in 
these power plants. The relevance of high temperature crack growth and fracture 
mechanics approaches in this context has been highlighted. 

1.1 Materials used in power plant components 

Power plant components are made of a variety of materials which are capable of 
withstanding a wide range of operating conditions in the plant, depending upon the 
local conditions of pressure, temperature and chemical environment (Viswanathan 
1989). Low-alloy ferritic steels containing carbon, molybdenum and/or vanadium 
constitute the bulk of the material used in steam power plants (Viswanathan 1989). 
High-alloyed tempered martensitic steels containing ~ 12% Cr along with Ni and Co 
based superalloys are the other widely used materials. Table 1.1 in Altekan (1980) gives 
the properties required for various components in steam power plants and the 
materials used for their construction. 

The materials mentioned above are subjected to demanding pressure and tempera¬ 
ture conditions which are summarized by Jaffee (1979) for a steam power plant. These 
demanding conditions lead to the degradation of these materials and consequently to 
the partial or complete failure of the component. 

1.2 Definitions of failure of components 

While catastrophic fracture or rupture is the most evident form of failure, table 1 
(Viswanathan & Dooley 1986) shows the various failure criteria and definitions used by 
the industry to retire a component before such failures occur. Failure of a component 


Table 1. Failure criteria and definitions of component life. 


History-based criteria 

30 to 40 years have elapsed 

Statistics of prior failures indicate impending failure 

Frequency of repair renders continued operation uneconomical 

Calculations indicate life exhaustion 

Performance-based criteria 

Severe loss of efficiency indicating component degradation 

Large crack manifested by leakage, severe vibration, or other malfunction 

Catastrophic burst 

Inspection-based criteria 

Dimensional changes have occurred, leading to distortions and changes in clearances 

Inspection shows microscopic damage 

Inspection shows crack initiation 

Inspection shows large crack approaching critical size 

Criteria based on destructive evaluation 

Metallographic or mechanical testing indicates life exhaustion 


From Viswanathan & Dooley (1986) 
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may generally be defined as the inability to perform its intended function reliably, 
economically and/or safely (Viswanathan 1989). 

1.2a Crack growth at elevated temperatures: A number of crack growth mechanisms 
operate at the elevated temperatures to which power plant components are subjected. 
These include creep, creep-fatigue, oxidation, stress corrosion cracking, corrosion- 
fatigue and thermal-fatigue. Often more than one of these operate simultaneously and 
probably have synergistic effects on the rate of crack growth and, consequently, on the 
material degradation. Hence, the study of elevated temperature crack growth is vital for 
understanding failure of power plant components. 

1.2b Retirement of components from service: Economics rather than technical need 
has often formed the basis for decisions to retire components from service in the past 
because an investigation into the cost to benefit ratio of keeping the component in 
service may prove expensive. This is due to the high cost of carrying out remaining-life 
calculations based on the operating history, regular inspections and material testing. 
Further, if the decision arrived at after expensive analysis is not favorable, the economic 
penalties can be severe. This requires that life prediction methods be accurate and must 
also be inexpensive to perform. 

1.3 Motivation for life prediction 

Due to the high expenses associated with a forced plant outage, the conservative 
practice has been to replace certain components, or those which have a history of 
failure, after a fixed interval of time regardless of their actual condition. However, with 
the recent developments in areas such as fracture mechanics, materials testing and 
NDE techniques, confidence has increased in the application of these concepts for life 
prediction and consequently extending component life beyond their ‘design life’ and up 
to their ‘actual life’. Some other factors which have contributed to the desire of 
extending plant life and consequently encouraging the use of remaining-life-assessment 
technology are (Viswanathan 1989): 

(a) the escalating cost of new construction and diminishing capital resources; 

(b) extended lead times in plant construction; 

(c) uncertainties in projected demand growth rates; 

(d) limited availability of suitable sites for new construction; 

(e) increasingly stringent environmental, safety and other regulations. 

A very important use of life-prediction procedures, besides extending plant life, is the 
determination of safe intervals for inspection, repair and maintenance. Thus, in order to 
carry out these steps of remaining-life-assessment technology, study of high-tempera¬ 
ture crack growth in power plant materials becomes very important. 

1.4 Objectives of this paper 

In keeping with the current needs of the power generation industry to extend the 
operating life of existing power plants, a study of high-temperature crack growth 
phenomenon in power plant materials is presented. Various techniques developed for 
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characterizing elevated temperature crack growth have been discussed with emphasis 
on data collection, analysis and interpretation of results. Essentially, fracture mechan¬ 
ics approaches have been presented. Creep crack growth (CCG) and creep-fatigue 
crack growth (CFCG) have been discussed in detail with emphasis on the latter. The 
discussion is primarily focussed on creep-ductile materials 1 . Special considerations for 
high-temperature crack growth in weldments have also been discussed. Finally, all 
these concepts have been applied to life prediction of power plant components like 
steam-turbine casings. This presentation is aimed to be application oriented. Some 
recommendations for future research in this area are also provided. 


2. Review of time-dependent crack-tip parameters 

Fracture mechanics approaches are extrapolative i.e. the results obtained using small 
laboratory specimens can be “extrapolated” to make predictions in actual components. 
Various crack-tip parameters have been proposed which characterize the state of stress 
at the crack tip. The underlying idea behind these crack-tip parameters is that they act 
as a ‘transfer function’ from a specimen in the laboratory to actual structural behaviour. 
Identical conditions of stress and deformation at the crack tip result in the same value 
of these crack-tip parameters regardless of the shape and size of the specimens or the 
structures. 

When a cracked body is subjected to constant load at elevated temperatures, the 
initial stress-strain response due to sudden loading is similar to that under subcreep 
temperatures. A l/r 1/2 -type and HRR-type (Hutchinson-Rice-Rosengren type) singu¬ 
lar stress field is formed in elastic and elastic-plastic materials, respectively. However, 
with the passage of time, creep strains (inelastic in nature) begin to accumulate. Their 
magnitude near the crack tip is relatively large because of the high level of initial 
stresses. This causes the crack-tip stresses to relax, and, the initial strong singularity at 
the crack-tip is relaxed to a weaker one governed by material creep behaviour. 

Under the aforementioned conditions, the stress-intensity parameter, K , and the 
J-integral cannot be used to uniquely characterize the crack growth. This has led to the 
extensive development of time-dependent fracture mechanics (TDFM) concepts using 
analogy to the elastic-plastic fracture mechanics (EPFM) concepts (Riedel 1987; 
Saxena 1989). Three regimes of crack growth - small scale, transient and steady-state - 
are observed in creeping materials, depending on the size of the crack-tip creep zone 
relative to the specimen dimensions. Figure 1 (Saxena 1986) schematically defines these 
regimes. The reader is referred to Saxena (1986, 1991) for definitions and details. The 
best example of an analogy between TDFM and EPFM concepts is the relation 
between the creep strain rate and the stress for the steady-state creep regime in 
a creeping material. This relation is analogous to the relation between instantaneous 
plastic strain and stress in an elastic-plastic (EL-PL) material underfully plastic 
conditions. The first crack tip parameter defined in TDFM is C*, which specifically 
addresses the steady-state (large-scale) creep crack growth regime. Crack tip par¬ 
ameters used in TDFM are discussed in the following sections. 


1 Creep-ductile materials are defined as those in which the crack growth is negligible compared to 
the creep zone expansion rate, i.e. these materials are highly susceptible to creep deformation. 
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(o) 



(c) 



[~1 secondary 


Figure 1. Stages of creep deformation in cracked bodies subjected to primary and 
secondary creep deformation, (a) Small-scale primary creep conditions, (b) Second¬ 
ary creep zone growing within the primary creep zone which is now comparable in 
size to the uncracked ligament, (c) Extensive primary creep conditions with a sub¬ 
stantial secondary creep zone, (d) Extensive secondary creep conditions (from 
Saxena 1986). 


t 
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2.1 The C* parameter 


C* was defined (Landes & Begley 1976; Nikbin et al 1976) as a path independent 
integral with direct analogy to the J-integral (Rice 1968) in EPFM. 

C* = j W*dy— T i (du i /dx)ds, (1) 

where W* is the strain-rate-energy density given by 


W* = 






( 2 ) 


In (1), 7) is the traction vector defined by the outward unit normal vector nj along the path 
T, which is taken from any point along the lower crack surface, counter-clockwise around 
the crack tip to end at the top surface. and e y are the stress and strain-rate tensors, u f is 
the deflection-rate vector along the direction of the traction and ds is the length along F. 
The physical significance of C* lies in that it represents the power difference between two 
identically loaded bodies having incrementally different crack lengths, i.e. 


C*=-(l/B)(dU*/da), (3) 

where U* is the power (energy rate) input to the cracked body to grow the crack by an 
increment da. Equation (3) is also analogous to the physical significance of J. This makes 
C* measurable as a physical entity at the loading pins where the external load is applied. 
Further, its path-independence definition ensures that a unique er-e relationship holds in 
the near field of the crack tip as well as in the far field where the external load is applied. The 
stress and strain-rate fields at the crack tip are characterized by C* as follows: 


G ij ~ (C*/AI„r) ll{n+i) d ij (0), 
e ij = A(C*/AI n r) nl(n+1) e ij (6). 


(4) 

(5) 
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in (4) above, C* represents the intensity of the stress singularity at the crack tip 
(Goldman & Hutchinson 1975). r is the radial distance from the crack tip and 6 is the 
angle from the plane of the crack. I„ is a nondimensional constant and <7 i; -(0) and & tj (6 ) 
are angular functions listed in Shih (1983). A and n are the steady-state creep constant 
and exponent, respectively, in the following constitutive law used to describe the unique 
relationship that exists between stress (a) and strain-rate (e) for a material undergoing 
secondary creep, 

e = Acr". (6) 

In brief, C* satisfies the following characteristics which make it an attractive choice 
for a crack-tip parameter under large-scale (steady-state) creep conditions. 

(a) It is associated with a path-independent integral, (1); 

(b) it has a physical interpretation, (3), and can be measured at the loading pins; 

(c) it characterizes the crack-tip stress and strain-rate fields, (4), (5). 

However, due to its definition being analogous to the J-integral used under fully- 
plastic conditions, the usefulness of C* for characterization of CCG is limited because it 
is not applicable under small-scale-creep conditions (SSC conditions) and transition 
creep conditions. Since most components in service are made from materials designed 
to resist large-scale creep, C* cannot be used to characterize creep crack growth in these 
components because SSC conditions dominate. 

2.2 C(f) - integral 

Equation (6) can be modified to represent the creep behaviour of elastic-creeping 
materials as follows: 

e = (d/£) + Ao", (7) 

where, E is the elastic modulus of the material. The dots represent the derivatives w.r.t. 
time. In accordance with (7), under transient creep conditions (non-steady state 
conditions) in elastic-creeping materials, strain-rate is a function of both time and stress 
at every point inside or outside the creep zone boundary because stress also varies with 
time. Hence, the integral defined in (1) is no longer path independent i.e. it varies for 
each contour, and even for the same contour the value of the integral varies with time 
resulting in a non-unique o-e relationship. Further, it no longer has the energy rate 
interpretation. Since crack-tip stresses also vary with time, C* does not uniquely 
characterize them under these conditions. Thus, it loses its utility as a crack-tip 
parameter. 

The C(t ) - integral was defined to extend C* to the small-scale creep regime via its 
path-integral definition, (1). This involves selecting an integration path in the small 
creep strain dominated region, just ahead of the crack tip, contained within the elastic 
strain dominated fields. Thus, the integral defining C.*, (1), is path-independent and also 
characterizes the crack tip stress and strain rate singularities at any given time. This 
integral is the C(f)-integral. Under extensive creep conditions C(f) = C*. These at¬ 
tributes make C(t) a candidate parameter for characterizing CCG and CFCG. Details 
regarding the definition and evaluation of the C(t)-integral can be found in the 
literature (Ohje et al 1979; Riedel & Rice 1980; Ehlers & Riedel 1981). Recent numerical 
studies (Leung et al 1988; McDowell & Leung 1988; Bassani et al 1989) have confirmed 
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the validity of the expression proposed by Ehlers & Riedel (1981) for evaluating C(f) in 
the transition creep regime. However, the C(f)-integral cannot be related to the 
stress-power dissipation rate and cannot be measured at the loading-pins. This limits 
the practical use of C(t) as a crack-tip parameter. 

2.3 C t - parameter 

In order to overcome the aforementioned limitations of C* and C{t) parameters, C* 
was extended into the transient creep regime via its stress-power dissipation rate 
definition, (3), by Saxena (1986). C t represents the instantaneous stress-power dissipation 
rate during the transient while C* represents the steady state rate. C t is defined as 
follows: 


C t — lim 

Aa-0 


1A17* \ 1 dUf 

B Aa J B da 


( 8 ) 


where U* is instantaneous stress-power (energy-rate). The subscript Y denotes that the 
value of U is at a fixed time t. Thus, C t is defined in terms of the difference in the 
energy-rates (or power) supplied to two cracked bodies with identical creep deforma¬ 
tion histories, but having cracks differing in length by A a, as they are loaded to different 
deflection-rate levels. In other words, C t is the power supplied to a cracked body, 
undergoing creep deformation, at any instant of time to grow the crack infinitesimally. 
Further details are presented by Saxena (1986). Under transient creep conditions, the 
C ( -parameter does not characterize the singular crack tip stress fields but is uniquely 
related to the creep zone expansion rate (Saxena 1986). 

Methods for evaluation of C t under small-scale and steady-state creep conditions are 
discussed in subsequent sections. 


2.4 Crack tip parameters for CFCG 

Under creep-fatigue type of loading, using trapezoidal or other waveshapes (figure 2) 
(Saxena 1993), elastic, plastic and creep deformation can occur at the crack tip. The 
extent of creep is governed by the rate of loading. The extent of creep deformation 
depends on the length of the hold period, while the extent of creep deformation during 
the loading part of the cycle is affected by the rate of loading. Stress redistribution 
occurs due to creep during the hold time and causes change in the elastic strains 
(Saxena 1993). If the hold time is sufficiently long, extensive creep conditions may occur 
whence the stress for a stationary crack becomes constant and further deformation 
occurs only by creep (Riedel & Rice 1980; Saxena 1980; Bassani & McClintok 1981). 
For the case of a trapezoidal waveform involving fast loading followed by a significant 
period of sustained loading, the normalized crack tip stress as a function of normalized 
time at a fixed distance ahead of the crack tip is given as in figure 3, taken from the work 
of Riedel (1993). Formulating analytical approaches for crack-tip stress analysis under 
CFCG conditions is not an easy task because the combinations of loading and material 
behaviour involved are complex, including effects due to cyclic plasticity mixed with 
primary and secondary creep deformation. Consequently, numerical approaches are 
frequently employed (Saxena 1993). Also, the CFCG experimental studies carried out 
for explaining CFCG behaviour outnumber the analytical studies. These experimental 
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Figure 2. Typical loading 
waveforms used during creep- 
fatigue crack growth testing in 
the .laboratory. 


studies have resulted in the identification of relevant crack tip parameters. The 
discussion here will focus on the A J c (Ohtani etal 1981,1989; Ohji 1986; Kuwabaraeta/ 
1988) and the (C t ) avg (Saxena & Gieseke 1990) parameters. 

For trapezoidal waveform with rapid loading/unloading as compared to the hold 
time, AJ c is given by (Ohtani et al 1981, 1989; Ohji 1986; Kuwabara et al 1988) 

AJ C — J h J*dt, (9) 

where J* is identical to C* described in §2T, t h is the hold time. 

(C t ) avg is defined as (Saxena & Gieseke 1990; Yoon et al 1992,1993) 

(C,u =4 fc.dt, (10) 

l h J 0 


o JS = Steady-State Stress 



Figure 3. Normalized crack 
tip stress as a function of nor¬ 
malized time at a fixed distance 
ahead of the crack tip for fast 
loading followed by sustained 
load (from Riedel 1983). 
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for materials showing ‘creep reversal” i.e. the creep deformation is significantly reversed 
during every cycle because these materials have low resistance to cyclic plasticity. For 
materials that do not exhibit creep reversal effects, C t is not reinstated after each cycle. 
The following expression is used to calculate (C t ) avg for such materials (Adefris 1993), 


(Qav g 


1 f N 

C,dt. 

h J n - i 


(ID 


As mentioned in the previous section, under extensive creep conditions, C, and C* (or 
J*) are the same by definition. Hence, (9) and (10) imply that A J c and (C,) avg are 
uniquely related in the extensive creep regime as: 


Mc = t h (C t ) ave . ( 12 ) 

Under conditions of small scale creep C t and C* (or J* ) are not the same. However, in 
the case of correlations based on experimental measurements of these parameters, 
differences in the numerical values of the two are not significant (Saxena 1986; Bassani 
et al 1989). On the other hand, if the parameters are calculated, significant differences 
could result if the influence of small-scale-creep conditions is not included in the 
calculations of C t . Thus, (C f ) avg and A J c differ significantly, and, in general, are not 
related by (12). 

In summary, (C t ) avg and A J c parameters are equivalent under extensive creep 
conditions but not under small-scale-creep conditions where A J c can only be operationally 
defined (Saxena 1993). The operational definition of A J c is based on experimental 
load-line deflection measurements (Saxena 1993). Thus, (C,) avg is the more widely 
applicable parameter. 


3. Test methods and crack growth data 

Test methods for conducting CCG and CFCG tests are discussed in the following 
sections along with the data collection and data reduction steps. Insights into the 
analysis of these data is also provided. An accelerated test procedure (Grover & Saxena 
1994) recently proposed for CFCG testing is discussed in some detail. 

3.1 Creep crack growth testing and data analysis 

The objective of creep crack growth testing, according to the ASTM (1992) standard on 
creep crack growth testing, is to determine the relationship between the time rate of 
crack growth, da/dt, and the applied value of the C*(t)-integral, where C*(t) is the 
time-dependent value of C* defined in (1) and has been defined to extend the validity of 
C* into the extensive primary creep regime (Saxena 1989a, 1991). The creep crack 
growth rates expressed as a function of C*(t ) characterize the resistance of a material to 
crack growth under extensive creep conditions. However, it has been shown (Riedel 
& Detampel 1987; Saxena 1989a), as discussed later, that C t correlates the da/dt data 
better than C*(t) and it characterizes the creep crack growth resistance of a material 
under a wide range of creep conditions, including small-scale creep and transition creep 
conditions. Hence, C t is the crack-tip parameter of choice in this study. 

A variety of specimen geometries are used in fracture mechanics studies to simulate the 
service loading conditions in the laboratory. Some of these are illustrated in figure 4. 
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Figure 4. Typical specimens 
used in fracture mechanics for 
testing, after Anderson. (1991) 
(a) Compact specimen, (b) Disk 
shaped compact specimen, (c) 
Single edge notched bend 
(SENB) specimen, (d) Arc 
shaped specimen, (e) Middle 
tension (MT) specimen. 


Compact type (CT) and the centre cracked tension (CCT) specimens are commonly 
used. However CT specimens are the preferred geometry for CCG and CFCG studies, 
to be discussed later. They require lower load levels and the transition time for 
steady-state conditions to develop from SSC conditions is longer for CT specimens 
than for CCT specimens having comparable widths. Thus, CT specimens are better 
suited for longer elevated temperature tests. Further, a displacement gauge can be 
conveniently placed at the load-line of a CT specimen for measuring the load-line 
deflection. The schematic in figure 5 (ASTM 1992) shows a deflection gauge placed at 
the load-line in a CT specimen. 

Specimens are pre-cracked using fatigue loading. They are subjected to dead-weight 
loading at test temperature in suitable furnaces. The crack length and the load-line 
displacements are continuously recorded as a function of time. Electric potential drop 
method (DC or AC) is usually used to measure crack lengths. These data can be 
collected digitally using a suitable data-acquisition system, or it can be recorded on 
a strip-chart recorder. The latter is usually used as a backup even when a data- 
acquisition system is used. Monitoring of the temperature is required to ensure that it 
remains constant within allowable limits during the test. A record of load versus time 
must be maintained if servo-mechanical loading systems are used to achieve constant 
load conditions (ASTM 1992). 

The load, load-line displacement and the crack length data are subsequently 
processed to obtain the correlations between the creep crack growth rates and a crack 
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Figure 5. Schematic of the 
overall test set-up showing the 
clip gauge as attached to the 
specimen (from ASTM 1992). 


tip parameter, C t in this study. Expressing da/dt as a function of C t generally provides 
results that are independent of the specimen size and planar geometry for the same 
crack-tip stress state. This is useful in comparing data from a variety of specimen 
configurations and loading conditions. Assuming similitude, i.e. that cracks of different 
length subjected to the same nominal value of C t advance by equal increments of 
crack extension per unit time, the data obtained from these tests can be utilized in 
the design and evaluation of engineering structures operated at elevated temperatures. 
Material selection criteria and inspection requirements for damage tolerant applica¬ 
tions can also be established using this data. However, results from these tests should 
not be applied to cracks in structures with wide-spread creep damage around the 
crack tip. • 

The crack growth rates are calculated from the crack length versus time (a vs t ) data. 
C t is calculated for a wide range of creep conditions from the load-line deflection rates 
in the specimen using the following expression. 


Q 


PV F' 

WJ +C ^ 


(13) 


where, P is the load applied to the specimen having width W and thickness B. F is the 
^-calibration factor = (K/P)BW i/2 and F' — dF/d{a/W). V c is the contribution to the 
measured load-line deflection rate due to creep. It is obtained using (14) (Saxena 1986, 
1991; ASTM 1992). 

K = V-(dB/P)l(2K 2 lE) + (m + l)J p ] - PC, (14) 

where a is the crack growth rate, J p is the plastic part of J calculated from expressions 
provided in Riedel & Detampel (1987), m is the plasticity constant of the material, C is 
the machine compliance. For constant load tests, the load rate, P = 0 and the last term 
on the right hand side of (14) drops out. 

C*(t) in (13) can also be measured in test specimens (Lee 1987), 

C*{t) = {PV c IBW) n (a/W,n,n x \ 


( 15 ) 
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where rj is a function of the crack size to width ratio, ajW, the secondary and primary 
creep constants, n and n t respectively. For practical purposes, for CT specimens it is 
defined as (Saxena 1991) 

n = \n/{n + 1)] {[2/(1 - a/W)l + 0-522). (16) 

However, the term n/(n + 1) on the right hand side is strictly valid for secondary creep 
only and it has to be replaced by n i j{n l + 1) when most of the time is spent under 
primary creep conditions (Saxena 1991). Further details on this issue are provided in 
Saxena (1991). 

3.2 Experimental correlations of creep crack growth data 

In this section, experimental evidence is provided in support of the suitability of the C t 
parameter over other crack tip parameters, C(t) and C*, for characterizing CCG. The 
creep crack growth trends of austenitic and ferritic steels used at elevated temperatures 
are also discussed. 

Figures 6 to 8 (Saxena 1989a) show plots of the creep crack growth rate, da/dt, as 
a function of C*, C(t) and C t respectively. The data shown include that from tests where 
transient conditions prevailed for significant lengths of time before the steady state 



w 


C*. kJ/m*—h 



Figure 6. Creep crack growth rate as a function of C* (from JSaxena 1989a). 
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Figure 7. Creep crack growth rate as a function of the C(t) parameter, in 
a Cr-Mo-V steel (from Saxena 1989a). 


conditions were reached. It can be clearly seen that the data do not correlate well with 
either C* or C(t) while it does with C t . Riedel & Detampel (1987) have also shown C t to 
be the parameter of choice for correlating creep crack growth rate (CCGR) data. For 
further discussion, readers are referred to Saxena (1989a). 

Creep cracks in most high temperature steels grow by grain boundary cavitation and 
the effect of the environment is minimal. Figure 9 (Saxena 1989a) shows the CCG 
behaviour of ex-service and new Cr-Mo and Cr-Mo-V steels commonly used in 
power plants. Despite the varying material compositions and thermal histories of these 
materials, their CCG behaviours show similarity and almost all the data lie within 
a 95% confidence scatter band. Correlating CCG rates with C t (or C*, for only steady 
state conditions) provides a first-order normalization with respect to changes in 
temperature, provided there is no substantial change in the rupture ductilities due to 
changes in temperature, composition, or thermal history (Saxena 1989a). Thus, CCGR 
data at different temperatures show the same trend when correlated with C t even 
though the creep deformation rates at those temperatures for the material are very 
different. This is illustrated in figure 10, while figure 11 shows the creep deformation 
behaviour of the material (Saxena 1989a). This lack of temperature dependence of the 
da/dt vs C t plots is useful because the life predictions made for a component operating 
at one temperature would be valid for its operation at a different temperature. Further, 





C t , in-kips/in 2 -h 

Figure 9* Creep crack growth behaviour of Cr-Mo and Cr-Mo-V base material. 
The data include all ex-service and new material data (from Saxena 1989a). 





da/dt 



Figure 11. Creep deformation behaviour of A470 class 8 steel (from Saxena 1989a). 
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if the CCGR data for a certain temperature are not available, design parameters can be 
decided and life predictions for the component can be made on the basis of the CCGR 
data for a different temperature. However, service conditions may result in a significant 
change in the creep ductilities causing the da/dr vs C t relationship to change with 
temperature and it may be different for two parts of the same component subjected to 
different temperatures. This is generally due to the microstructural changes in the 
material resulting from exposure to varying service temperatures. An example of this is 
the higher CCGR that results in the “hot end” of a steam header as compared to its 
“cold end”, due to the coarsening of grain boundary carbides under the higher 
operating temperatures of the hot end. 


3.3 Creep fatigue crack growth testing and data analysis 

Under creep fatigue crack growth conditions, crack growth occurs due to both creep 
and fatigue processes. A trapezoidal waveform simulates creep fatigue loading condi¬ 
tions. Creep deformation occurs mainly during the sustained loading period, the hold 
time ( t h ), while fatigue processes contribute to the crack growth during the loading part 
(rise time - t r ) and unloading part (decay time -t d ) of the trapezoidal wave, provided 
the rise and decay times are small in comparison to the hold time. However, if the 
loading rate at high temperatures is slow, creep can occur during the loading part of the 
cycle as well. 

Unlike the ASTM (1992) standard for CCG testing, a standard test method has not 
been adopted for CFCG testing as yet. However, CFCG test methods and data analysis 
techniques have been discussed in detail by Grover & Saxena (1994). 

Accelerated CFCG tests lasting a few days to a few weeks are conducted to obtain 
data that are used to predict service behaviours up to 30-40 years. 

CT specimens are the preferred geometry for CFCG testing due to reasons discussed 
earlier. Their size is determined on the basis of the amount of material available, the 
desired orientation of the crack, the size/shape of the available furnaces and the 
load-line deflection gauge to be used. The specimens are mounted on clevises placed in 
high temperature chambers/furnaces to heat them to the desired temperature. This is 
generally the same as the service temperature although higher temperatures may be 
used to decrease the test time, provided the creep deformation processes remain the 
same. Servohydraulic machines are used for loading the specimens under the desired 
waveform. 

The trapezoidal waveform, mentioned earlier, is most commonly used because it 
simulates actual service conditions, especially in power plant materials. Figure 12 (Lee 
1987) shows the stress as a function of time for various locations in a steam header. 
Having selected the waveform, the K-level/load levels are decided on the basis of the 
crack growth rates required during the test. These crack growth rates should be 
approximately those encountered by the material during service (Grover & Saxena 
1994). However, often the time constraints on the duration of a test dictate the choice of 
the crack growth rates to be achieved during the test. Thus KAevels , have to be chosen 
in conjunction with the hold times of the loading waveform (Grover & Saxena 1994). 
Other than the K-levels to be used, the sensitivity of the load-line deflection measure¬ 
ment gauge/amplifier system also affects the choice of the hold times. The reader is 
referred to Grover & Saxena (1994) for further details. 
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Figure 12. Stress versus time 
histories at different locations in 
a steam header. 


The measurements made during the test are the deflection change during hold time, the 
total load-line deflection and the crack length at different stages in the test. It is advisable to 
monitor the load during hold time to ensure that the deflection changes are not due to 
random variations in the load (Grover & Saxena 1994). Strip chart recorders or com¬ 
puterized data acquisition systems can be used to record the aforementioned data. Strip 
charts are useful because the deflection change traces can be visually inspected for the 
quality of the data. As in the case of CCG tests, crack length is measured using electric 
potential drop. Details on the DC potential drop method are given in ASTM (1991). 

In order to obtain the influence of creep, the crack length data is reduced to time depen¬ 
dent crack growth rates, da/dt. The average values of these, (da/dr) avg , are plotted against 
(Q avg , which is the average value of C t . The average of C t is used to correlate CFCG data 
because of the experimental limitations involved in measuring C t during hold times. 

(C ( ) avg is evaluated using the measured load-line deflection change rate during the 
hold time as follows: 


(Qavg 


APAV C F' 
BWt h F 



i c*(t). 


(17) 


where AP is the applied load range and AV C is the load-line deflection change due to 
creep during hold time t h . Rest of the terms are as defined earlier. 

AF C is obtained from the total measured deflection change, AV, using the following 
expression, 


(£), 


2*; ff 

E 


+ {m+l)J p , 


Bt h 

AV C ~ AV — 
c P 


(18) 


70 


Parmeet S Grover and Ashok Saxena 


where m is the plasticity exponent in the Ramberg-Osgood equation and P is the 
applied load level. K eff is the value of K corresponding to the ‘effective’ crack length 
corrected for plasticity. 

It is noted that the AV C value which is obtained using (18), from the total load-line 
deflection change, A V, by deflection partitioning approach (Saxena et al 1983) includes 
both primary and secondary creep contributions to the load-line deflection change. J p 
is the fully plastic part of J and is obtained from expressions listed in Kumar et al( 1981). 
C*(t) can be determined using procedures outlined in Kumar et al (1981) and must 
include the contributions of secondary and primary creep. The (da/df) avg is calculated 
using 


(da 
\d t 



1 (da 



(19) 


where (d a/dN) cycle is the cyclic crack growth rate and has to be obtained from a FCGR 
test carried out for which the rise and decay times are same as the trapezoidal waveform 
used in the CFCG tests. 

The effects of fatigue on the crack growth are studied by obtaining correlations 
between the cycle dependent crack growth rate, (d«/diV) cycle , and AK, which is 
expressed as 


AK = (AP/BW i/2 )F{a/W). 


( 20 ) 


All terms in the expression have been defined earlier. 

The overall crack growth rate during a fatigue cycle, da/dN , is defined as 


d a _( da 
dN == [dN ] + 


th ( \A 

.U |d '’ 


( 21 ) 


and is measured during a CFCG test. 


3.4 Experimental correlations and modelling of CFCG data 

This section includes a discussion on the experimental correlations of CFCG data and 
the development of life prediction models for components using this data. 

Figure 13 (Grover 1993) shows plots of da/dN vs A K for a 2-25Cr-10Mo steel at 
594°C (1100°C). The regression line through the FCGR (t ft = 0) is used to get the 
cycle-dependent part in modelling the CFCG data. The lack of correlation between 
da/dN and AK for the CFCG tests is evident from the scatter in the data in this figure. 
Such lack of correlation has also been shown in Cr-Mo-V steel, figure 14 (Harrison 
& Sandor 1971). A discussion on the limitations of using AK for correlating high 
temperature crack growth is presented in (Saxena 1988, 1993; Grover 1993). An 
increase in the da/dN with increasing hold time for fixed AK is observed in figures 13 
and 15 (Saxena & Bassani 1984). This is due to the increasing contribution of time- 
dependent crack growth (Grover & Saxena 1994). Creep damage at the crack tip, 
influence of the environment, or microstructural changes such as formation of cavities 
which occur during loading at elevated temperatures could be responsible for this 
behaviour (Saxena 1993). 

The average time dependent crack growth rates, (da/dt) avg , are correlated with 
(^r)avg- Figures 16 (Grover 1993) and 17 (Yoon et al 1993) show plots of (da/dt) avg vs 
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Figure 13. Fatigue crack growth data with various hold times and without hold 
time (from Grover 1993). 


(Q) avg f°r 2-25Cr-10Mo and l-25Cr-0*5Mo steels, respectively, for various hold times 
at elevated temperatures. All data show a clear trend and fall into a narrow scatter band 
despite the range of hold times used. This strongly indicates the usefulness of (C t ) avg in 
characterizing CFCG rates. Further, the CCG data for each of these materials have 
also been plotted on these graphs. However, da/dt has been correlated with C t for the 
CCG data. All the CCG and CFCG data show the same trend. This has the important 
implication that life prediction procedures for these materials would be considerably 
simplified because CCG data could be used to predict the life of components under 
CFCG conditions and vice-versa. In comparing a (da/df) avg vs (C t ) avg relation of CFCG 
with a da/dt vs C t relation of CCG, it must be kept in mind that although C t and (C t ) avg 
are equivalent parameters with the same physical interpretation, their exact values may 
differ slightly in the SSC regime by a constant factor for a given material (Yoon et al 
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Figure 14. Elevated tempera¬ 
ture fatigue crack growth be¬ 
haviour of Cr-Mo-V steel 
plotted as a function of A K 
(from Harrison & Sandor 1971). 


1993). The value of this constant ranges from 1 to 1-3 for different materials, and further 
details are also presented (Yoon et al 1993). 

The time dependence of the life-prediction model is obtained by generating a re¬ 
gression line through all the data. The total fatigue crack growth rate per cycle is 


(MPa /in ’) 



Figure 15. Effect of hold time of the fatigue crack growth behaviour of a Cr 
Mo-V steel (from Saxena et al 1984). 
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Figure 16. Correlation of crack growth rates with the measured values of C t . (The 
average values of da/dt and C t are plotted for CFCG data, from Grover 1993). 

a linear summation of the cycle and time-dependent crack growth rates. Such an 
expression obtained for 2-25Cr-l-0Mo steel at 594°C (1100°F) under trapezoidal 
loading waveshapes is given in (22), (23) (Grover & Saxena 1994) which are expressed in 
SI and British units, respectively. 

da/dN = 108 x 10~ 6 AK 19 * + 1-46 x 10" 2 [(C,)^] 0722 ^ (22) 

da/dN = 511 x 10" 8 AK 194 + 2-40 x 10~ 2 [(C f ) avg ] 0 722 t h (23) 

The first term in the equations above represents the cycle-dependent crack growth 
rate and the other term represents the time-dependent crack growth rate. These 
equations can be effectively used to predict the service life of high temperature 
components made of 2*25Cr-l-0Mo steel under, both, CCG and CFCG conditions at 
594°C (1100°F). An upper and lower scatter band can also be generated for the data in 
figure 16 for design purposes. This model has been established under the assumption 
that the crack growth during hold time is only due to creep deformation. Any other 
time-dependent effects like oxidation at the crack tip have not been incorporated 
(Grover Sc Saxena 1994). Neither have any synergistic effects due to any complicated 
interactions of the creep and fatigue mechanisms of crack growth during unload¬ 
ing/reloading been incorporated. However, their exclusion in the light of the assump¬ 
tion that the unloading/reloading times are much smaller compared to the hold times 
seems justified (Grover & Saxena 1994). If such effects were to be considered, depending 
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Figure 17. Correlation of crack growth rates for creep and creep-fatigue crack 
growth data in l-25Cr-0*5Mo steel (from Yoon et al 1993). 


on the material, an equation of the type presented above would be too simplistic in its 
description of the CFCG behaviour of a material. 

3.5 Calculating CCG and CFCG crack tip parameters in components 

In §§ 3*1-34 the discussion concentrated on obtaining and correlating experimental 
data with crack tip parameters, C t and (C, ) avg , calculated from experimental quantities 
measured in the laboratory. Use of directly measured deflection changes to obtain these 
parameters has advantages discussed by Grover (1993). However, often there is a need 
to calculate these crack-tip parameters for components in service due to limitations or 
constraints that might exist in carrying out laboratory tests on the component material. 
In such an event, the expressions outlined in this section may be used to calculate C t and 
(C,) avg for components, making use of material properties. 

C t is related to the rate of expansion of the creep zone for small-scale creep conditions 
and is expressed as 

(Qssc = 2(1 - v 2 )(K 2 /EW)(FfF)Pf c (24) 

where = 1/3 is a factor obtained from finite element analysis, v is the Poisson’s ratio 
and f c is the rate of creep zone expansion which is given by (25) and (27) for 
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materials deforming by elastic-secondary creep (EL-SC) and elastic-primary creep 
(EL-PC) respectively. Other terms have been defined earlier, 

r c = 2a/(« - l)K 2 t~ {n ~ mn ~ l) (EA) 2,{ "~ 1} r c (6,n), (25) 


where t is time and r c {9, n) is a nondimensional constant which varies with the secondary 
creep constant n. a is a scaling factor also dependent on n and is defined below. 

a = (l/27t)I>+ l) 2 /(2na” +1 )] (2/( ” _1,) (26) 

where a" + 1 ~ 0-69 for 3 ^ n ^ 13 (Riedel & Rice 1980). 

For EL-PC materials 
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(27) 


where l nX is a nondimensional constant dependent on the primary creep constant n x . 
A x and p are also primary creep constants 2 . 

For a wide range of creep conditions C t is expressed as 


C t = (C t ) ssc + C*(t). (28) 

Expressions for (C r ) avg can be obtained using (10) and (11) for materials exhibiting 
complete creep reversal and those that do not, respectively. Corrections can be made 
for the effects of crack-tip cyclic plasticity (Yoon et al 1992; Grover & Saxena 1994). 

For elastic-cyclic plastic-secondary creep (EL-CPL-SC) materials showing complete 
creep reversal, 


n/~> \ i 2<xpr c (6,n) /i 
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where Cf is the steady-state part of C*. t pl is the time for the crack-tip creep zone to 
grow out of the cyclic plastic zone (Yoon et al 1992); thus, it accounts for retardation in 
the creep zone expansion rate due to the presence of cyclic plasticity. It can be 
calculated analytically using (30) or (31) (Yoon et al 1992; Grover & Saxena 1994) for 
elastic-cyclic plastic-secondary creep (EL-CPL-SC) and elastic-cyclic plastic-primary- 
secondary creep (EL-CPL-PC-SC) materials. It can also be obtained experimentally 
(Yoon et al 1992; Grover & Saxena 1994). 
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2 n 1 ,A 1 ,p are regression constants in the following constitutive equation used to describe the 
creep behaviour of a material. 

a = (<r/£) + + 1 e-^ ni<1+p) + Aa\ 
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for EL-CPL-SC materials. 
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where m is the plasticity exponent and a c y is the cyclic yield strength of the material. This 
equation accounts for primary creep effects and is thus suitable for EL-CPL-PC-SC 
materials. 

When primary creep dominates the small-scale creep conditions and then the 
secondary creep zone grows out from within the primary creep zone, to dominate the 
steady-state creep condition, (Cj) avg for EL-CPL-PC-SC materials is expressed as 
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where t 2 is the transition time for extensive secondary creep conditions to develop from 
small-scale secondary creep zone contained within the primary creep field (Yoon 1990; 
Saxena 1991). 

The crack-growth rate vs crack-tip parameter correlations are then used for life 
assessment of components as discussed in the next section. 


4. Applications to life prediction 

The primary objective of this section is to briefly describe life prediction procedures, for 
power plant components, based on the crack-growth concepts discussed earlier. Life 
assessment of a steam-turbine casing has been discussed as a specific illustration. 

4.1 Remaining life assessment using crack growth models 

The remaining crack growth life assessment methodology includes time-independent 
fatigue crack growth, time-dependent creep crack growth and creep-fatigue interac¬ 
tions affecting crack growth. These crack growth vs crack tip parameter (A K, C,) 
correlations, obtained as discussed in §3, are used in the life prediction steps as 
illustrated in figure 18 (Chell et al 1993) which is a schematic of the steps involved in 
using time-dependent fracture mechanics concepts for remaining crack-growth life 
prediction. Material data, including tensile properties, elastic and creep deformation 
constants, fracture toughness of the material and the CCG and CFCG constants of the 
material are required. Stress analysis is conducted on the component based on the 
operating conditions and the geometry of the cracked component, and the appropriate 
crack growth model (fatigue/creep/creep-fatigue) is selected for the material. Estimation 
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of parameters like K and C* in order to obtain C t is done using expressions available in 
handbooks (Kumar et al 1981; Tada et al 1986; Zahoor 1991). 

Either static or cyclic analysis can be performed for calculating the remaining life. In 
a static analysis, a component such as a steam pipe or a header is assumed to operate 
continuously, uninterrupted by start-up and shut-down (Saxena 1993). The life is then 
estimated as the time for the defect/crack to grow from its initial size, a 0 , to its final size, 
a f . a 0 is the crack size at inspection and a f is the critical size at which fracture can occur 
and is determined from fracture conditions. In a cyclic analysis, explicit account is 
taken of the start-ups and shut-downs and the life is expressed as the number of such 
cycles required to grow the crack from a 0 to a f (Saxena 1993). 

Under static conditions, the remaining life, t R , is obtained by integrating the CCG 
rate law (Saxena 1993). 


Ir — 


' af da 


(33) 


The creep crack growth rate law (Saxena 1986; Saxena et al 1988) is of the form 


da/dt ~ C 1 [C t (a, t)] 9 , 


(34) 


where, C 1 and q are regression parameters. 

Under cyclic loading, the cyclic crack growth rate is first obtained for a given crack 
size using empirical equations that are in the form of (22) and then a numerical 
integration is carried out to obtain t R using (Saxena 1993), 


t D = 


af da 
ao (da/dNY 


(35) 


4.2 Life assessment of turbine casings 


Steam-turbine casings are large castings that encase the stationary and rotating 
components of the turbine. Their main function is to contain the steam pressure and 
maintain the alignment of internal components (Viswanathan 1989). These are made 
from l-25Cr~0-5Mo, 2-25Cr-lMo and ICr-l Mo-0-25 V steels. Many steam turbines 
have been used beyond their design lives and, due to economic considerations, the 
trend has been to use them for cyclic operation. Such factors have made cracking 
a major concern with regards to the safe operation and operating life of these 
components. This section aims at briefly providing an insight into the nature of 
cracking in turbine casings and illustrating the use of remaining crack growth life 
assessment methodology discussed above. 

4.2a Damage mechanisms and failure sites : Cracks in casings are typically located 
at the steam inlets of HP and IP turbine sections because the local thermal stresses 
are higher. Figure 19 (Logsdon 1986) shows a schematic of a casing showing inlet 
steam passages. Cracking in the HP and IP sections is typically found on the interior 
surfaces of steam chests, valve bodies, bolt holes, seal casings and nozzle chambers 
(Viswanathan 1989). Figure 20 is a schematic of the crack locations in the nozzle 
area of a steam-turbine casing (Rasmussen 1986). The primary cause of crack initiation 
and propagation in turbine casings is fatigue, creep-fatigue and brittle fracture 
(Logsdon 1986). Thermal stresses are responsible for fatigue and creep-fatigue cracking 
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Figure 19. Schematic of steam 
turbine casing showing inlet 
steam passages (from Logsdon 
et al 1986). 


while creep contributes to crack growth in regions where temperature exceeds 427°C 
* (800°F) (Saxena 1993). The reader is referred to (Viswanathan 1989: Saxena 1993) for 

further details on the damage mechanisms and failure sites in turbine casings. 

4.2b Remaining life estimation: Casings are made of relatively ductile materials and have 
large section areas which are not highly stressed. Thus, remaining-life assessments are 
generally based on crack-growth considerations rather than crack- initiation consider¬ 
ations, which are more useful at the design stage (Viswanathan 1989). The basic informa¬ 
tion needed for remaining life estimation of turbine casings is given below (Saxena 1993). 

* Operating history, including the number and type of starts (cold/warm/hot) per 
year. The thermal ramp rates during heating and cooling, the amount of prewarm¬ 
ing and metal temperature at each stage in order to determine thermal stresses. 

\r * Material properties like monotonic and cyclic stress-strain behaviour, creep stress 

relaxation rates, fracture toughness, and, the fatigue and creep-fatigue crack 
initiation behaviour as a function of temperature. Further, this information is also 
needed for weldments, if they are involved. 

Determination of various material properties is described by Saxena et al (1986). Using 
the above information, remaining life estimation can be performed following the steps 
listed below. 


'S 



Cracks at various locations are indicated by wiggly lines. 


Figure 20. Crack locations in 
first-stage nozzle area of 
a steam-turbine casing (from 
Rasmussen 1986). 
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Figure 21. Schematic .of the methodology for remaining life prediction (from 
Saxena et al 1986). 


(1) Determine the transient cyclic stresses for the turbine casing for the various 
operating histories by conducting a finite element analysis for the specific casing 
design. The first step in this analysis is the generation of transient temperature 
distribution using heat transfer analysis. The temperatures thus obtained are input 
into the stress analysis portion of the program along with the pressure and external 
loads. 

(2) The stress-strain and time histories at critical locations are then used to predict the 
fatigue and creep-fatigue crack initiation and propagation behaviour using the 
models described earlier. The total remaining life is the sum of the crack initiation 
and the crack propagation lives. However, if cracks have already initiated, only 
crack propagation life must be determined. 












Creep crack growth in power plant materials 


81 


Inspection Crack Depth, a (cm) 

0 0.5 1.0 1.5 2.0 Z 5 3.0 3.5 



Figure 22. Remaining life as 
a function of crack depth for 
cracks in a steam turbine cas¬ 
ing (from Saxena et a\ 1986). 


Figure 21 (Saxena et al 1986) is a schematic of the various steps involved in conducting 
fracture .mechanics analysis for remaining life prediction of a cracked turbine casing. 
F urther, figure 22 (Saxena et al 1966) shows the results of calculation of remaining life as 
a function of initial crack size for a ship’s service turbine generator casing (SSTG 
casing). 


5. Special considerations for high-temperature crack growth in weldments 

Most of the problems encountered in heavy-section piping occur at welded joints. 
Thus, it is desirable to focus damage-assessment techniques on these areas rather than 
on the base metal. However, many problems that are unique to weldments make both 
stress analysis and material characterization extremely challenging (Viswanathan 
1989). Some of these problems are discussed below. 

(1) The weldment can be considered a composite material consisting of the base 
metal, three or more heat-affected zones (H AZ) and the weld metal. Thus, there is 
an unpredictability in the modeling of its mechanical behaviour. 
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Figure 23. Distribution of steady-state hoop stress across a girth weld in a steam 
pipe. The initially uniform elastic stress distribution redistributes such that 
weak weld metal off-loads stress to the stronger parent metal (from Viswanathan 
et al 1982). 


(2) Collection of impurities and inclusions occurs at the fusion line between the weld 
and the HAZ because it is the last region to solidify. This creates complications in 
stress analysis. 

(3) Uniaxial creep rupture data for a material cannot be used to predict the rupture 
properties of the weld because multiaxial stress state exists at a weld. This is due 
to the complex geometry and influence of complex loading. Consequently, the 
creep resistances of the base metal, the weld metal and the HAZ are quite 
different and cause the redistribution of initial elastic stresses in complex ways. 
These distributions can be different for girth welds and for seam welds. In girth 
welds the pressure stresses in the axial and hoop directions peak at the most 
creep-resistant (strongest) location, i.e. the HAZ, and are lowest at the weld 
metal, which is the least creep-resistant location (Cane 1982). This is shown in 
figure 23 (Viswanathan et al 1982). However, in seam welds, the peak stresses 
after stress redistribution occur in the weld metal (weakest location) and the 
lowest stresses are encountered in the HAZ (Wells 1987). 

(4) A good material-property database does not exist for weldments. The limited 
amount of available data pertains only to pure base metal, weld metal, or just 
to the HAZ material. Thus, prediction of crack growth in weldments is not 
a trivial task. Generation of such data in the laboratory is not easy either. 
Thus, in the absence of a reliable database, the convenient practice has been to 
assume the weld to be always weaker than the base metal and then using base 
metal properties to model the weld behaviour by applying arbitrary safety 
factors. 

(5) Welds also serve as sites for fabrication defects which can act as stress raisers and 
preferred sites for cavity growth and crack initiation. The presence of impurities 
like sulphur further worsens the situation. 

As mentioned above, both the crack growth database and the analytical procedures 
are inadequate. Significant developments are thus needed in addressing cracking 
problems in elevated temperature weldments. 
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Remaining life assessment of high temperature components 
using threshold stress concept 

RAGHUBIR SINGH 

Materials Evaluation Division, National Metallurgical Laboratory, Jam¬ 
shedpur 831007, India 

Abstract. The paper first presents global experience, including that of our 
laboratory, mainly with power plant components to show that it is possible 
to extend the life of some critical components 2-3 fold over the design life. 
Different approaches for remaining life assessment are also briefly dis¬ 
cussed. 

It appears that the main creep damage in so far as Cr-Mo and Cr-Mo-V 
steels are concerned, is an accumulation of creep strain. Hence, this paper 
suggests a new approach for creep life prediction based on threshold stress 
concept and stress-strain rate relationship. The approach is based on the 
analysis of creep and hot tensile data generated on a popular 1 Cr-0-3 
Mo-0-25 V steel in the service exposed and the virgin states and also on 
samples with a wide range of microstructures prepared from the virgin 
states. A summary of the main finding leading to development of a new 
approach has been given. 

Keywords. Creep resistant steels; creep strain damage; threshold stress; 
remaining life. 


1. Introduction 

Low alloy ferritic steels, lCr-0-5Mo, 2-25 Cr-1 Mo-0-5 Cr-Mo-V, are extensively 
used for high temperature applications in the power, chemical and oil industries. This is 
because they have very good elevated temperature creep strength together with good 
resistance to oxidation and hydrogen embrittlement. These steels have been used in 
power industries for components such as turbine rotors, cylinders, steam chests, steam 
pipes, headers etc. The typical service temperature and stress in the power industry are 
about 550°C and 50 MPa respectively. Thus the components operate in the tempera¬ 
ture range where the creep is the main life limiting factor. Prolonged service exposure of 
boiler and turbine components could lead to degradation of their mechanical proper¬ 
ties and hence these are designed for a certain definite life (10-20 years). In many cases 
component replacement after design life expiry is not necessary due to ‘conservatism’ at 
the design stage and hence, calls for methods for full life exploitation. In other cases, 
however, operating conditions can be more burdensome than envisaged by the 
designer and hence plant integrity assessment may be required before the design life is 
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reached to ensure safety. Thus, the need for remaining life assessment both for 
economic and safety cannot be obviated. 

The available approaches for creep life prediction can be broadly classified into the 
following (Cane & Townsend 1983; Viswanath 1989): 

■ operational parameter approach; 

h post-service examination approach. 

The first approach depends on the evaluation of the service temperature/stress history 
for a given component. Once the history is known then combining it with the lower 
bound value of the standard material data and the life fraction rule an estimate of 
remaining life can be obtained as illustrated in figure 1. However, this approach has not 
been fully successful because of several limitations: 

m The operational data with requisite precision are not always available; 

■ standard material data have a wide scatter and often cannot be reliably extra¬ 
polated to the rather low operating stresses; 

■ Life fraction rule used in this approach is not always applicable. 

Thus the approach at best can serve the purpose of obtaining a very rough estimate of 
creep life. 

The second approach based on direct evaluation of the component after prolonged 
service exposure is undoubtedly more accurate and also does not require detailed 
knowledge of the service history. Hence it is widely used. Among the numerous 
techniques available for post-service evaluation listed in table 1 the accelerated stress 
rupture testing is the most common. In order to obtain results within a reasonable 
period of time, these tests have to be accelerated by tesing: either at higher stresses or at 
high temperatures or both. The short-term rupture data so obtained, are extrapolated 
to ascertain long-term life at lower stresses/temperature as illustrated in figure 2(a, b). 
This approach has the following limitations. 

■ being empirical its validity is limited to the domain of available database, i.e. the 
short-term data have only limited extrapolation capacity; 

■ The procedure is time consuming and expensive. 

On the whole, the method gives a first estimate of the life. 

It is important to recall here that the critical components are allowed to undergo 
only a limited creep strain (1-2%) and there are evidences (Wufee 1983; Weber 1993) 


stress-rupture curve 
for temperature,T 

+20 7 . 


life fraction rule 
£ tp(a;T) 

t(crT )=time under conditions,! 
tp(crT) = life under conditions,crT 



predicted 

life 


rupture 

lite,t R 


Figure 1. Schematic represen¬ 
tation of operational parameter 
approach (Cane & Townsend 
1983). 
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Table 1. Techniques available for post-service evaluation. 


Technique Comments 


Physical: 

Density, resistivity, magnetic 
permeability, small-angle neutron 
scatter 

Mechanical: 

Tensile (Room temperature), 
tensile (hot), hardness, charpy 

Low-cycle fatigue 
Metallographic examination using 
acetate replica; micro-sample removal 

NDE method: Oxide scale thickness 
measurement 

Accelerated stress-rupture testing 


May be suitable for cavity 
assessment* 


May be useful for rough 
assessment 

limited applicability 
Useful provided coupled with 
creep strength. 

Useful if damage mechanism is 
oxidation 

Useful provided extrapolation is 
done judiciously 


to suggest that the Cr-Mo and Cr-Mo-V types of steel can sustain this strain without 
entering into the tertiary stage. 

In view of this it appears to be more appropriate to develop life prediction 
methodology based essentially on the creep deformation data such as creep rate vs 
stress relationship particularly when the service stress is lower than the threshold stress 
as has been proposed by the author (Singh & Banerjee 1992). 

In what follows is a brief presentation of global experience and that of NML about 
the vital issue of “in-service degradation” mainly to bring into focus the need for creep 
deformation data base in the low-stress region to: 

■ determine the threshold stress; 

■ develop a reliable relationship between creep strain rate-stress for life prediction. 


2. Global experience 

The assessment of accumulated damage and the estimation of remaining life of components 
particularly of boiler pressure parts has been the subject of intense study for the past several 
years. A large volume of information has been generated as can be seen in the proceedings 
of recent conferences (Wufee 1983; Dooley 1987). This information must be critically 
analysed to evolve a reliable method for predicting remaining life of components. Indeed, 
this will vary from component to component depending on the operating condition. 

In steam power plants, the following components have received special attention for 
life extension: 

■ superheater (SH) and reheater (RH); 

■ headers; 

■ main steam pipes; 

a superheater and reheater. 
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(a) 



Figure 2. Post service examin¬ 
ation approach (Cane and Tow¬ 
nsend 1983): (a) stress vs 
Larson-Miller parameter plots; 
(b) temperature vs rupture life 
plots. 


SH and RH fail due to one or a combination of the following causes: 

■ overheating; 

■ short-term overheat; 

■ long-term overheat; 

■ corrosion; 

■ fire-side corrosion; 

■ steam-side corrosion; 

■ stress-corrosion cracking; 

■ fly-ash erosion; 

■ exfoliation. 

This clearly reveals that life assessment of SH and RH shall require a comprehensive 
approach which takes into account all the factors responsible for the failure. This is 
outside the scope of this paper. However, some approaches has been discussed in 
another article of this issue (Ghosh et al 1995). 

3. Header and main steam pipe 

Unlike SH and RH, the headers and steam piping operate fairly under steady state 
stress and temperature. These are very expensive components and the failure besides 
causing long-term outages can be catastrophic. Hence, they deserve special attention. 
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Header and main steam pipe essentially undergo two processes during service 
exposure: 

■ creep deformation under long-term influence temperature/stress; 

■ change(s) in microstructure and properties under the long-term influence of 
temperature/stress. 

Information on these two aspects is of crucial importance for the life assessment and 
extension. 


4. Creep deformation in service 

In the coded design, the limit on creep strain rate is 10“ 7 /h and the accumulated creep 
strain should be less than 1 %. That the actual creep rate is much lower than that envisaged 
at the design stage is borne out by the table 2 (Wufee 1983) and table 3 (Auerkan 1983). It is 
significant that hardly 02% creep had occurred after the service life of 100,000 h. Though 
the data given in table 3 do not report the value of accumulated creep strain, the opera¬ 
tional time as high as 2,50,000 h indirectly reflects that the creep rate was much less than 
10—7/h. Similar service experience has been widely reported recently (Dooley 1987). 

In some of the plants diametrical expansion of the steam pipe is also measured at 
regular intervals. The expansions are measured at certain pre-fixed locations; the 
measurements are made along both horizontal and vertical axes. Some such data are 
shown in figures 3,4 (Weber 1983). These data besides showing that the actual creep 
rate is generally less than 10-7/h, bring out another important fact: The mode of creep 
deformation remained essentially the steady state and the tertiary creep had not set in 
even when the creep strain had nearly touched the prescribed limit of 1 %. This is a very 
useful inoformation from the view-point of life prediction based on the knowledge of 
the creep rate at the operating stress and temperature. This aspect has been dealt with 
subsequently. Another interesting point that transpires from figures 3 & 4 is that the 
expansion along horizontal axis is often larger than that along vertical axis. 


Table 2. Practical examples of life prediction by the comprehensive analysis method and 
assessment of results (Wufee 1993) 


Name 
of pipe 
work* 

Material 

Total 
operation 
duration of 
sampling (h) 

Life prediction - 
Stress evaluation 

Creep 

strain 

(%) 

Conclusion 

(continue 

operation 

hours) 

Operation 
duration 
up to now 
(x 10 5 h) 

<t10 5 

(MPa) 

(T 7c 

(MPa) 

K 

MSP 

12CrMo 

107675 

133-3 

464 

2-9 

<0-2 

10 5 


MSHS 

15CrMo 

~ 137000 

109-8 

46-6 

2-4 


10 5 


MSP 

12CrlMoV 

90000 

107-8 

57-2 

1-9 

0-15 

10 5 


MSP 

12CrlMoV 

110660 

107-8 

57-2 

1-9 

015 

10 5 


MSP 

12CrlMoV 

101794 

110-8 

61-8 

1-8 

0-13 

10 5 



*MSP - main steam pipe; MSHS - main steam header system; <x 10s : stress for rupture in 10 5 h; 
a zs : service hoop stress; K : o t0 ,/o zs 
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Table 3. Inspected steam pipings (Auerkan'1983). 


Plant 

Nominal steam 
values 
(°C/MPa) 

Type of material 

Operational time, h 
(design life 
100,000 h) 

A 

510°/6 

1 Cr-0-5 Mo 

238,000 

B 

48076 

1 Cr-0-5 Mo 

250,000 

C 

530711-5 

1 Cr-0-5 Mo 

2-25 Cr-1 Mo 

148,000 

D 

52076 

1 Cr-0-5 Mo 

156,000 

E 

525711-5 

1 Cr-0-5 Mo 

135,000 


Some reasons suggested for the gap between anticipated and actual creep rate are: 

■ the mean diameter formula used for calculating hoop stress yields 15% higher 
values (Cane 1983). 

■ the steels used for such critical components have the creep properties better than the 
‘minimum’ property data used for design. 

■ stress exponent of ~ 4 generally used by designers could be higher. 

For steam pipes which work under the stipulated design condition life extension to 
a period as high as 40 years for straight portions of the pipes has been suggested 
(Neubauer & Wadel 1983). 

5. Microstructure and property changes 

Microstructural changes developed after prolonged service at high temperature 
has attracted the attention of material scientists world wide for their study and 


% 



Figure 3. Expansion of inter¬ 
mediate superheater line of 
1 Cr-0-5 Mo, 398 (OD) x 
18 mm wall, temperature 
= 530°C (Weber 1993). 
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Figure 4. Expansion of life 
steam line of 1 Cr-0-5Mo, 318 
OD x 40 mm wall, tempera¬ 
ture = 530°C (Weber 1983). 


characterization can be a very attractive tool for remaining life assessment. Though 
information available (Foldina etal 1973; William & Wilshire 1981; Senior 1988; Singh 
& Banerjee 1991) is sizeable, the goal has so far been elusive. 

The changes in microstructure mainly consist of coarsening of carbides, composi¬ 
tional and morphological changes in the carbides, increase in the inter-particle spacing 
formation of new carbides, grain coarsening etc. Here, it is noteworthy that creep 
damage after prolonged service exposure in the form of grain boundary cavities has 
rarely been reported in so far as Cr-Mo and Cr-Mo-V steels are concerned. It appears 
there exists a critical strain limit for initiation of grain boundary cavities. According to 
one report (Weber & Wellnitz 1993), the limits are: 3% for 2-25 Cr-1 Mo, 2% for 
l-25Cr-05Mo and 1% for 0-5Cr-0-5 Mo-0-25 Y as well as 12 Cr-Mo-V steels. 

Post-exposure samples have been tested by a number of workers (Cane & Townsend 
1983; Weber 1983; Nakashiro et al 1990; Singh & Banerjee 1991) and some typical 
results are reported in figures 5 & 6. One interesting fact transpires: indeed there is 
degradation in the rupture strengths of the service-exposed sample but this occurs up to 
a certain stress level. At low stresses (long-term rupture strengths) virgin and service 
exposed materials are hardly distinguishable. Even milder degradation has recently 
been reported (Cheruvu 1989) in case of turbine rotors where service temperatures are 
quite lower. 

NML experience is also by and large in conformity with the above observation as 
will be seen later. 

6. NML’s experience with boiler and steam pipe 

The author, jointly with other team members, has investigated over the last two 
decades a large number of tube failures received from various steam power plants in the 
country. Also, a sizeable number of samples have been investigated for health assess¬ 
ment and life extension. 
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Figure 5. Stress vs Larson-Miller parameter curve of superheater tube (2-25 Cr~ 
1 Mo steel) after 111,940 and 175,562 hour service at plant (Nakashiro 1990). 


Premature failure of tubes in the different zones of boilers such as that of steam 
generators, superheaters and reheaters were found to have occurred due to a variety 
of reasons; a summary has been reported (Singh 1993). Overheating is indeed an 
important factor leading to premature failures which could be grouped under two 
categories: 

■ short-term excursion of runaway nature, and 
s long-term excursion of temperature. 

The majority of failures belonged to be first category and only a few to the second. 
This indirectly suggests that in the design creep has been adequately taken care of and 
also that the situation calls for the full exploitation of ‘safe life’. 

Samples for remaining life assessment and life extension recently received at NML 
are summarised in table 4. Accelerated stress-rupture tests, metallographic examina¬ 
tion and also other tests did not show any major sign of degradation after ten years of 
service. 

Notwithstanding that a lot of information on life assessment and extension studies is 
available in the literature, often they are too fragmented to get a complete picture. That 



Figure 6. Creep isotherms 
with (O) and without (A) service 
exposure, main steam pipe of 
lCr-0-3 Mo-0-25 V steel 
(Weber 1983). 
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Table 4. Samples for life extension recently investigated at NML. 


Sample 

Source 

Steam value/ 
exposure 
(°C/atm/years) 

Recommended 
remaining life 

Main steam pipe 

1 Cr-03 Mo-O-25 V: 8 Nos. 
273(OD) x 22 mm thick 

NLC 

540/90/10 

10 5 

Reheater 

2-25 Cr-1 Mo 

Korba 

550/35/10 

10 5 

Superheater 

1 Cr-03 Mo-O-25 V 

SAIL 

450/40/10 

10 5 

Heater tube 
l-25Cr-0-5Mo 

IPCL 

600/7/11 

10 5 

CCU reactor 

C-Mn (ASTM 201) 

300(OD) x 23 mm thick 

BPCL 

480/2-1/35 

10 5 

Catalytic reforming tube 

5 Cr-0-5 Mo 

219(OD) x 16 mm thick 

IOC 

500/5/20 

10 5 


Abbreviations: NLC - Neyveli Lignite Corporation; SAIL - Steel Authority of 
India Ltd; IPCL - Indian Petrochemicals Corporation Ltd.; BPCL - Bharat Pet¬ 
rochemicals Corporation Ltd; IOC - Indian Oil Corporation. 


is comprehensive information like microstructure, mechanical properties, creep-rup¬ 
ture properties etc., for a specific type of steel in both virgin and service-exposed states is 
hardly available. Hence, a strong need was felt by the author to carry out a systematic 
study covering all these aspects: Post-exposure analyses of microstructural changes as 
well as that of degradation in creep strengths and ductilities over wide ranging 
stresses/temperatures, changes in hardness, room and elevated temperatures and 
tensile properties. A brief account of the work which led to the proposal of a new life 
prediction methodology based on threshold stress concept is presented. 

7. Experimental 

The new method for creep life prediction being proposed in this paper is based on 
a systematic study carried out on a sample of the main steam pipe of 1 Cr-03 Mo- 
025 V steel. Experimental details and findings have been reported in other papers 
(Singh & Banerjee 1991, 1992). However, a brief summary may be in order here. 

Microstructural changes encountered during long-term creep exposure and their 
adverse impact on creep properties have been central to the most of the R & D work in 
the field. But useful conclusions can be derived only when both the creep and the 
microstructural aspects are simultaneously and extensively studied using a proper mix 
of samples. The study is a unique attempt to meet this objective since it is extensively 
devoted to; 

■ detailed study of the morphological and compositional changes of the carbides in 
both virgin and service-exposed samples (Singh & Banerjee 1990,1991); 
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fable 5. Details of the 1 Cr-0-3 Mo-0-25 V steel samples investigated. 


Sample 

State/dimension 

Heat treatment* 

Microstructure/ 

hardness 

VN 

Virgin pipe 

Commercially N at 
980 +Tat 700 

90% ferrite 10% 
bainite 156 


273 mm OD x 


* 

SE 

22 mm thick 

Same as VN 

90% ferrite 152 

RHV1 

15 mm dia 

austenitized 

100% bainite 210 

RHV2 

15 mm dia 

N at 980°C(1 h) + T 
at 700°C(2 h) 

80% ferrite 190 

20% bainite 

RHV3 

15 mm dia 

Nat 980°C(1 h) + T 
at 700°C(72h) 

80% ferrite 160 

20% tempered bainite 


*N - normalized; T - tempered 


m extensive creep-rupture and hot tensile properties of samples having widely varying 
microstructures including the one produced in the service exposed material. 

The details of the five samples that have been studied are given in table 5. The chemical 
composition is reported in table 6. 

The SE samples are removed from the region close to the boiler end which had seen 
the highest service temperatute. It is remarkable that the VN and the SE samples 
belonged to the same manufacturing batch. 

Our findings of microstructural study (Singh & Banerjee 1990, 1991) have brought 
out a significant point that morphological and compositional changes of the carbides 
impair the creep only at high stresses where the Orowan Bowing is a dominant creep 
deformation mechanism. On the other hand, creep strength at low stresses customarily 
experienced by engineering components is not affected appreciably. 

The main findings concerning the creep and hot tensile studies which have provided 
the basis for the new approach are discussed subsequently. 


8. The threshold stress 

The threshold stresses for the five different microstructures were obtained using the 
Lagneborg-Bergman (1976) approach. This is schematically shown in figure 7 wherein 
the creep data are plotted in the form of the fourth roots of the minimum creep rates 
(e)° 25 vs the applied stresses, a. Line 1 corresponds to the low stress sensitivity (n = 4) 
obtained at lower stresses and line 2 corresponds to the high stress sensitivity (i n » 4) 
obtained at higher stresses. The extension of line 2 to a strain rate equal to zero gives 


4 


Table 6. Chemical composition (wt%) of the pipe investigated. 


C Mn P S Si Cr Mo V N 

011 0-30 0 01 0 007 0-35 1 01 0-28 0-23 0 023 
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Figure 7. Schematic represen¬ 
tation: Determination of thresh¬ 
old stress from creep data for 
a given microstructure. 


the value of the threshold stress a H . The threshold stress obviously implies that it is 
the highest value of the applied stress which if exceeded produces runaway creep (high 
vaue of n). 

Figure 8 shows (e) 0-25 vs o plot in the high and in the low stress sensitivity regions for 
each of the five microstructural states. The intercepts of the lines A, B, C, D and E with 
the abscissa give the values of a H . The transition stress, o T , represents the stress level 
above which change in creep deformation mechanism is believed to occur (Lagneborg 
& Bergman 1976; Blum & Reppich 1985). 

The figure shows the following important points: 

■ The values of <r H for the VN and the SE samples differ only by 12 MPa, which means 
that a service exposure of 10 5 h produces only a mild degradation. This observation 
was also in keeping with the insignificant micro structural changes observed after 
an exposure of 10 5 in service (Singh & Banerjee 1991). 





Applied stress, a (MPa) 


Figure 8. Threshold stress for 
different microstructures in 
1 Cr-0-3 Mo-0-25 V steel. 
(Singh & Banerjee 1992). 
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Figure 9. Minimum creep rate 
in the low stress sensitivity re¬ 
gion (Singh & Baneijee 1992). 


■ The creep strength varies significantly with change in original microstructures but 
only at higher stresses. When a <o T the data points for all microstructures fall on 
the same line OM. 

■ The best fit line OM (see figure 9) passes through the origin. The equation of the line 
OM is: 


(e) 0 ' 25 = Mcr, (1) 

where 

M = 3x lQ" 4 ,(/j)°' 25 /MPa. 

It may be noted that the line OM is microstructure-independent and hence can be used 
for any given microstructure for life prediction. It is also noteworthy that the 2-25 Cr- 
1 Mo steel (Wolf 1990) exhibited a similar trend (figure 10). However, the difference in 
respect of lower values of a H and M, may be noted. This is understandable in view of the 
lower creep strength of 2-25 Cr-1 Mo. 



Figure 10. Threshold stress of 
2-25 Cr-1 Mo steel based on 
published data (Wolf 1990). 
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Figure 11. Threshold stress vs 
hot yield stress at 550°C (Singh 
& Banerjee 1992). 


9. Correlation of cr H with hot yield strength 

a H can indicate the extent of microstructural change during creep exposure. It has also 
been contended that <j H “can be used as a design-limiting strength property” (Russel 
etal 1985). Notwithstanding this contention, it should be noted that to achieve 
significantly long life, the stress level in a power plant component used in high 
temperature service, must be considerably lower than cr H . On the other hand, <j h 
represents an upper-bound value of the permissible stress. Indeed, the service stress at 
any point of time must always be less than a H to avoid runaway creep deformation. 
Therefore, a simple experimental method to determine o H should be useful. The 
metallographic methods to determine a H are difficult to carry out and liable to yield 
uncertain results (Cane & Townsend 1983). 

Hence, the correlation of o H with the hot tensile yield strength, a y as observed by us is 
'significant. This is plotted in figure 11, which indicates that 

er H = 048<7 y . (2) 

Similar observation has been made on Ni-base supeyalloys (Russel et al 1985) where 
<j H = 0 7 a Y ; Purushothaman & Tien (1978) have also theoretically predicted the tensile 
rowan stress and found that it is nearly equal to a H . This indicates that the threshold 
stress is nearly equal to the Orowan stress and is proportional to the hot yield strength. 
Also, it can decrease to a lower limit of about one-half of hot yield strength, as 
temperature increases and/or strain rate decreases (Purushothaman & Tien 1978). 

10. Conclusions 

Procedure for creep life prediction 

Based on the main findings as represented by (1) and (2) the following steps can be 
suggested for a rapid assessment of the remaining creep life of a component made of 
1 Cr-0-3 Mo-0-25 V steel and subjected to the usual service temperatures of 550°C 
(barrings accidental short-term excursion to a higher temperature). 

(1) Determine the hot yield strength, o Y , at the service temperature and estimate <j h 
from the relationship <j h = 0-48 <j y . 
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(2) Determine £ m at six stress levels, three of them between 0-4 and 0-5 times and the 
other three at 0-6,0-7 and 0-8 times hot yield stress. The maximum duration of one 
such test is 1500 hours only. 

(3) Plot (£ m )° 25 vs a curve in a manner similar to that in figure 5 and determine a H . 

(4) If o 1 // is greater than the service stress, <r, calculate the residual life in terms of the 
time required to produce the balance amount of strain to make up the maximum 
permissible strain of 1%. The calculation uses the simple relationship: 
(£ m )°- 25 = M < 7 . 

(5) If a H is less than the service stress, or, retire the component and replace it at the earliest. 

At present, the total duration of creep testing using the stress-rupture approach for 
creep-life prediction is about 15,000 h. This can reduce five-fold if the new approach is 
used. Furthermore, if (1) and (2) are fully established, creep testing can be completely 
dispensed with. 

Further study 

There is need to generate a strong database at stresses approaching the service stress of 
30-60 MPa and at several temperatures between 500 and 650°C. This should be done 
for various grades of both virgin and service-exposed steels so that (1) and (2) can be 
fully established. 

It would also be interesting to assess the lowest possible value of the threshold stress 
after say 20-30 years of exposure. This will reveal if o H will ever go as low as the service 
stress. 

Generation of data at low stresses is rather tedious and would require very close control 
of the temperature (+ 1°C) and LVDTS with improved stability and sensitivity. 
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Multiaxial fatigue - An overview 

V M RADHAKRISHNAN 

Metallurgical Engineering Department, Indian Institute of Technology, 
Madras 600036, India 

Abstract. Many engineering components are subjected to multiaxial type 
of fatigue loading and the fatigue life relation based on uniaxial testing 
needs modifications before it could be used for multiaxial condition. In this 
paper a brief review of the different approaches based on stress and strain is 
presented for the correlation of bulk parameters with the fatigue life and the 
merits and limitations are discussed. 

Keywords. Multiaxial fatigue; stress-strain parameters; in-phase out-of¬ 
phase fatigue; high temperature fatigue; life assessment. 


1. Introduction 

Multiaxial fatigue is the major consideration in the design of many of the structural 
components such as automobile parts, gas turbine components, power reactors etc. 
The parts will be subjected to varying loads of different amplitudes and frequencies in 
two or three directions depending on the thickness. This will give rise to biaxial or 
multiaxial fatigue and the assessment of fatigue life is an important design requirement 
for reliable operation and safety. 

Most of the fatigue data are obtained based on tests performed under uniaxial 
loading condition. However, engineering components in general, are subjected to 
complex loading involving bending, torsion and axial forces. The design of these 
components is carried out based on fatigue properties under uniaxial loading condi¬ 
tions. It will be in the form of endurance limit or the low cycle strain range fatigue life 
relation. However, these data can serve only as an input information for the life 
prediction of components subjected to biaxial or multiaxial fatigue. The safe life 
assessment has to be carried out through suitable methods based on the deformation 
mechanisms that will be operative during the fatigue process. An excellent review of the 
state-of-the-art up to 1980 has been presented by Garud (1981). Many symposium 
proceedings have also been brought out in the last decade highlighting the develop¬ 
ments made in the area of multiaxial fatigue (Miller & Brown 1985; Brown & Miller 
1989; Kussmaul et al 1991). 


A list of symbols is given at the end of the paper 
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Figure 1. An element subjec¬ 
ted to triaxial fatigue stressing. 


2. Life prediction models 

2.1 State of stress and strain 

Normally components are designed based on high cycle fatigue data where the stress 
levels are below the yield strength of the material and general yielding of the material is 
not present. The design approach is based on the state of stress acting on the 
component. A typical element subjected to triaxial fatigue loading is shown in figure 1 
and Mohr’s circle for state of stress is shown in figure 2. Three principal stresses in 
general, a j, a 2 , and er 3 , are considered for evaluating the equivalent stress that is taken 
for life calculations. The stress that governs the deformation could be the maximum 
principal stress, <x l5 or the maximum shear stress, r max , given by the relation 

= (ff 1 - ff 3)/2. (1) 



Figure 2. Mohr’s circle for 
triaxial state of stress. 
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Octahedral shear stress, t oc „ is also considered as a controlling parameter for deformation 
and is given by 

T o« — (l/3)[{cr ! ~(r 2 ) 2 +{a 2 -o 3 ) 2 + {o 3 -o 1 ) 2 '] 1/2 . (2) 

The corresponding strains, i.e., maximum shear strain and the octahedral shear strain 
are given by 



(3) 

and 


Toct = (2/3)C(«! - s 2 ) 2 + ( e 2 -%) 2 +(£3 ~£ i ) 2 ] 1/2 - 

(4) 

In terms of equivalent normal stress and strain we have 


T oct = 2 1/2 /3 < 7 , 

(5) 

and 


V -2 1/2 r 

y oct ^ 

(6) 


In the following, stress- and strain-based approaches for multiaxial fatigue are con¬ 
sidered for the calculation of the equivalent parameters. 


2.2 Stress based approach 

Consider a machine member subjected to a triaxial pulsating load along the three 
principal axes. The stresses vary from a minimum to a maximum with a mean value. 
For simplicity it is assumed that the stresses are in-phase and the principal axes are 
fixed. Under such a condition, Sines (1955) proposed a relation in the form 

CKi “ a o2 ) 2 + Ka “ <r« 3 ) 2 + Ks “ i ) 2 ]' 11 a 

+ +^2 + ^3] -2 1/2 0’„, (7) 

where o ai is the alternating principal stress, a mi the mean stress, A a constant, o n the 
uniaxial fully reversed fatigue stress that is expected to give the same life on smooth 
specimens as the multiaxial state. 

The above relation for biaxial loading will be reduced to 

Wi + + +0/ 2l/2 = ff »- ( 8 ) 

This relation gives a family of ellipses with their centres decided by the mean stresses 
cr ml and cr m2 , as shown in figure 3. Thus, lines of constant octahedral shear stress are 
ellipses in such a plane. The different ellipses give the areas of safe alternating stresses 
on a stress plane for biaxial stresses. For any biaxial in-phase alternating stress that 
corresponds to the fatigue criterion, successive instantaneous states of stress are 
represented by points on a diameter of such an ellipse. The centre of the ellipse is the 
mean stress. Ellipses that are located on a line of constant octahedral normal mean 
stress 45° downward to the right, are Of equal size. Increasing the mean stress decreases 
the size of the ellipse. The approach assumes that (i) there is no gross yielding, and (ii) 
the alternating stresses can be represented by principal alternating stresses along fixed 
principal axes. 

Sines (1955) showed a good correlation with experimental results ir 
fatigue. Static torsion has been found to have less effect than static 
alternating fatigue strength when gross yielding is avoided. 
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Figure 3. Areas of safe alter¬ 
nating stresses on a biaxial 
stress-plane. 


Findley (1957, 1959) introduced the influence of normal stress acting on the 
maximum shear stress plane and has given the fatigue life relation as 

+ = /(*/) (9) 

where k is a constant. Thus the allowable alternating shear stress, z a , for a given fatigue 
life decreases with an increase in the maximum normal stress a n max on the plane of the 
critical alternating shear stress. 

Combining the approaches due to Sines and Findley, an equation for the fatigue 
equivalent stress has been proposed in the form, 

CKi - ° a2 ) 2 + K 2 - < U 2 + (*. 3 - * a i ) 2 ] 0 ' 5 

+ A [ a ml + G m2 + °m3] + B l a a\ + 0 a2 + = ( 10 ) 

where a ai are the principal alternating stress ranges, and o mi the principal static stresses. 

Kakuno & Kawada (1979) were able to show good correlation with experimental 
results obtained from bending and torsion fatigue testing. The above criterion implies 
that there will be fatigue failure under hydrostatic alternating stresses i.e., 
<r ai = a a2 = o a3 . The constant B which determines the contribution of the hydrostatic 
alternating component could be positive or negative indicating the deterimental or 
beneficial effects. However, for cases where there is phase lag, the effect is not clearly 
brought out in the equation. 

McDiarmid (1985, 1987, 1991, 1993) introduced a modification in the stress ap¬ 
proach and gave the modified equation in the form 

*.(! +K ma x/2^ uU )) = i 7 (N / ), (11) 

where a uU is the tensile strength. The function F depends on whether the cracking grows 
along the surface (mode III) or grows into the surface (mode II). The first case occurs in 
torsion loading and the second one under biaxial tension loading. The theory has been 
extended for the cases of out-of-phase biaxial stresses and has been shown that the out- 
of-phase stresses produce shorter fatigue lives than equal in-phase stresses. Typical 
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Figure 4. Effect of phase lag 
and different frequency on S-N 
curve of EN 24 steel. 


experimental results for both out-of-phase and different frequencies are shown in figure 4 
for the material EN 24 steel. S-N curve (case 1) gives for uniaxial condition. In biaxial 
condition, when <j 1 and o 2 are both in-phase (case 2), the fatigue strength is reduced 
as compared to uniaxial loading. When both of them are completely out-of-phase 
(case 4), the strength is still lower. Similarly, it can be seen that different frequencies 
for and cr 2 also (cases 6 and 8) reduce the fatigue strength as compared to the uniaxial 
condition. 

A relation of the type 


*max + Kmax= Ci V 

has been used to compute the life. 

For out-of-phase angle of 90° 

x 4- kS„ = t cos wt + kc sin wt 

For maximum value of (t 4- ka„) we have 

tan wt = kafx = kfX a , 


( 12 ) 


(13) 

(14) 


where X a = xja a . 

Thus the maximum value of (t 4- kc r„) can be found for different values of X a ( = xjcr a ). 

The analysis can also be extended to cases of different frequencies. The experimental 
results on 1045 steel and the applicability of the model are shown in figure 5. In the 
out-of-phase loading the damage is likely to be more severe. Though, in the out-of¬ 
phase cases, the maximum normal stresses do not occur simultaneously with maximum 
shear stresses in any plane, the analysis used here maximises the (t 4- ko n ), critical 
parameter for each particular loading condition. 


108 


V M Radhakrishnan 



Figure 5. Relation between the equivalent stress due to McDiarmid and fatigue 
life of 1045 steel. 


In the stress based approach two criteria are normally used - one based on maxi¬ 
mum shear stress amplitude (Tresca) to correlate the life and incorporated in the ASME 
Boiler and Pressure Vessel Code Section III - and the other based on octahedral shear 
stress amplitude (von Mises) incorporated in Code Case N-47-12 of Section III. 

Both these methods compute an equivalent fully reversed uniaxial stress amplitude 
from an applied multiaxial state of stress. Life estimation is made by using the, 
relation 


= (15) 

Notches are quite common in components subjected to multiaxial fatigue, such as 
crank shafts, connecting rods, transmission shafts etc., and hence stress concentration 
factors should also be taken into consideration. 

In cases of shafts or machine members subjected to both bending and torsion the 
amplitude of the maximum shearing stress used as equivalent stress amplitude in the 
above equation, is calculated using the Tresca criterion as 

i max = <r/2[l + K 2 +( 1 + 2K 2 cos(20) + K 4 f 5 ] 1/2 , (16) 

where <r is the notch bend stress amplitude including the stress concentration factor Kb 
in bending, x is the notch torsional shearing stress amplitude including the stress 
concentration factor Kt in torsion, K = 2t/ct and 0 is the phase angle between applied 
bending and torsional loading. 

In the second approach, based on octahedral shear stress amplitude, the equivalent 
stress amplitude, T eq , is given by 

r eq = (er/2)[l + 3/4 K 2 + (1 + (3/2) K 2 cos(20) + (9/16)^)° 5 ] 0 5 . (17) 

The effects of stress concentration in bending and torsion are to be included in both 
the approaches. In some cases the fatigue notch factor K f could be assumed to be unity. 
However, this assumption is likely to lead to over estimation of fatigue life, specially in 
high strength materials. In high cycle fatigue, out-of-phase loading appears to be more 
damaging than in-phase loading for the same applied load levels. 
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2.3 Strain based approach 

Fatigue lives under multiaxial loading may differ significantly from those observed 
under equivalent uniaxial loading conditions due to additional damage and deforma¬ 
tion mechanisms that are activated under first condition. Many investigations have 
been conducted over the last two decades on thin-walled tubular specimens to 
characterize the fatigue behaviour of several engineering alloys under combined axial 
and torsional loading conditions. These investigations involved both in-phase and 
out-of-phase strain-controlled axial-torsional fatigue tests. The materials investigated 
included aluminium alloys, low alloy steels, stainless steels, and nickel base super alloys 
(Socie & Shield 1984; Fatemi & Stephens 1989; Jayaraman & Ditmars 1989; Nitta et al 
1989; Sreeramesh & Bonacuse 1993). 

Many life prediction models have been proposed to estimate fatigue life under 
combined cyclic axial-torsional loading conditions (Haverd & Topper 1971; Miller 
& Brown 1984; Fatemi & Kurath 1988; Fatemi & Socie 1988; Socie 1989). Some of 
these models include parameters based on the von Mises equivalent strain range, the 
principal stresses or strains and on a combination of maximum shear strain and the 
normal stress and/or normal strain acting on the maximum shear plane. Most of these 
life prediction models have been verified with room temperature axial-torsional data. 

The local strain based approach has become an important tool for prediction of 
fatigue life. Many attempts have been made to identify the damage parameters capable 
of predicting multiaxial fatigue. The important among them are 

(a) maximum principal strain amplitude; 

(b) maximum shear strain amplitude; 

(c) von Mises effective strain amplitude; 

(d) Brown-Miller approach; 

(e) Lohr-Ellison approach; 

(f) other strain-based approaches; 

(g) energy-based approach. 

2.3a Maximum principal strain approach : In the maximum principal strain ap¬ 
proach the principal strain e l5 e 2 , c 3 are determined by an appropriate transformation 
of the measured or applied strains 'e iy For correlating multiaxial fatigue tests, the range 
of maximum principal strain on the plane that experiences the maximum principal 
strain range is considered the dominant parameter to describe damage. The strain life 
equation is written in the form 

= A(N f )~ a 4- B(N f )~ p , (18) 

where, A = a' f /E and B = e' f . 

2.3b Maximum shear strain approach : In the case of maximum shear strain range 
approach the shear strain range Ay max is given by 

Ay ma x = (£i-£ 3 ) m ax- (19) 

This can be related to the fatigue life N f as 


(20) 
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In the case of a shaft with a notch subjected to bending and torsion the relation for 
shear strain range, Ay can be written as 


and 


Ay max = [Ae'O + vf + Ay 2 ] 0 ‘ 5 , 


Ay 

A}’ = greater of I xy t . 

' max 6 1 Ae JC (l + u) 


for in-phase loading, 

for 90° out-of-phase loading. 


( 21 ) 

( 22 ) 


Ay max is the range of notch maximum shear strain, Ae* is stabilized notch bending strain 
range, and Ay x> , is stabilized notch torsional shear strain range. 

The equivalent strain range is found from 


Ae = Ay max /(1 + u). 

The elastic and plastic values of Poisson’s ratio have been taken as 0-29 and 0-5. 


2.3c The von Mises equivalent strain range model: The von Mises effective strain 
may be thought of as the root mean square of the maximum principal shear strain 
normalised to axial loading. It is also called the octahedral shear strain., 

This model gives the multiaxial equivalent strain range in the form 

Ae = [1/(2 1/2 (1 + u))]/[(Ae 1 - Ae 2 ) 2 + (Ae 2 - As 3 ) 2 + (Ae 3 - A^) 2 ] 1 ' 2 , 

(23) 

where 

Ae = von Mises effective strain amplitude. 

In the case of combined bending and torsion the relation can be written as 

Aej or Ae 2 = (l/2)(Aa x + Ae^) ± [(Ae*- Ae y ) 2 + Ay^] 0-5 (24) 

Ae 3 = [- o/(l - y)](A e l + Ae 2 ), (25) 

where Ae^, is the circumferential strain range. 

The effective strain is used as an equivalent uniaxial strain amplitude to predict 
fatigue life. The variable Poisson’s ratio, v, is defined as 

U = ( U elSel + U pfip)/ e t ( 26 ) 

where e eI , e p , e, are elastic, plastic and total strains respectively and are used in cases of 
proportional loading. 


2.3d The Brown-Miller approach: Figure 6 shows the Mohr’s circle based on 
maximum principal strain in triaxial loading situation. In fatigue fracture two strain 
parameters are important, that is, (i) the size of the largest circle created in a cycle 
(e t - e 3 ), and (ii) the location of this circle in strain space, i.e., (e x + e 3 )/2. Thus it can be 
seen that equivalent stress or strain calculation on the basis of Tresca or von Mises 
criterion may not be sufficient to calculate the fatigue life. 

Brown & Miller (1973) (B-M) postulated that failure under multiaxial fatigue is 
governed by a functional relation between the maximum range of shear strain and the 
range of normal strain acting on the plane where the former occurs. Thus 

(«i -£ 3 )/2 = /[>i + e 3 ]/ 2 


s 




4 


(27) 
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Figure 6. Mohr’s circle based 
on maximum principal strains. 


4\ The approach proposed is for fixed principal strain axes and in-phase straining. The 

function/ is not defined and varies with life. 

The B-M approach considers that, in addition to the amplitude of the maximum 
shear strain y max , the amplitude of the normal strain e„ acting perpendicular to the plane 
of maximum shear strain amplitude. Thus we have 

(y m j9)‘ + (UI>y = l- (28) 

Contours of constant life on y max versus s„ called Lamda planes are obtained. The 
constants g, h, and i were found to vary with life. However, it was suggested that a safe 
and simple design criterion could be established with i = 2, so that 

(7ma Jq? + (zjh) 2 = 1 (29) 

A typical Lamda plane and the constant life contours on the y max and (e* -I- %)/2 
axes are shown in figure 7. The figure also shows how stage I and stage II cracks 
grow in relation to the surface. There are two cases (A) and (B). Stage I growth plane 
is inclined at 45° to the principal direction while stage II is perpendicular. In case 
(A) type, cracks are contained in the surface layers. This state occurs in torsion 
testing and the growth of these cracks is by mode II along the surface and mode III 
in depth. In case (B) type, the crack growth situation is more dangerous. The cracks 
grow from the surface to inside the material. The stage I crack is driven inside by a 
mode II mechanism, but the mode I plays an increasing role as crack enters stage II. 
As can be seen in figure 7, the same life time of 500 cycles occurs when the plastic 
shear strain is 0-325 in a plane strain test and 0-6 in a pure torsion test. e„ is zero in 
both cases. This is because in the former case a dangerous case (B) type crack is 
generated while in the latter case of torsion test, a case (A) type crack propagates along 
the surface. 

Extending this approach Kandil et al (1982) showed that the life, N f> can be related 
to the strain criterion in the form 


Ay m „ + = A, (a' f /E)(N r )-° + A 2 s',(N f )->, 


(30) 
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Figure 7. Representation of 
constant life contours on the 
Lamda plane. Two types of 
crack growth (A) along the sur¬ 
face and (B) inside the material 
are also shown. 


where S — material parameter = unity, 

A x = 2(1 + y e ) + S(1 — v e ) = 3-3, 

A 2 = 2(1 + v p ) + 5(1 — u p ) = 3-5. 

2.3e The Lohr-Ellison approach: Taking into consideration the shear strain range 
that makes the crack propagate from the surface to inside the material, Lohr & Ellison 
(1980) proposed a critical plane theory, leading to the formulation of the relation 

Ay* + ke* = C. (31) 

Ay* is the maximum shear strain acting on a plane which intersects the surface at 45° 
and will control fatigue life in the low cycle regime. Ae* is the direct strain normal to this 
plane and may have a modifying influence. 

When the minimum principal strain is primarily normal to the surface the Ay* is 
given by 

Ay*/2 = A(e,-e 2 ) m J2 (32) 

If the second principal strain is most nearly normal to the surface, the value is given by 

Ay*/2 = A(£,-8 2 ). (33) 

The normal strain to the plane of shear is given by 

AeJ2 = [ej -I- s 3 (or e 2 )]/4. (34) 

The third (or the second) principal strain is normal to the surface. 

A life relationship based on the parameter can be developed as 

Ay* + ke* = 2-$8(o' f /E)(2Nj-)~ a + 3-2s}(2 N f )~ p , (35) 

where k = 04. The constants depend on the value of k as in the previous example. 
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The above relation proposed by Lohr & Ellison (1980) is similar to the theory 
proposed by Kandil et al (1982) which yields the relation 

A y p + SE p = F(N f ). (36) 

Thus both the theories with suitable values for the constants k and S should fit the data 
equally well. 

2.3f Other strain based approaches : Walker (1970) proposed an effective strain, 
e eff parameter for biaxial fatigue loading as 

fieff = (Ay o r(£r) 1_ " (37) 

where 

Ay 0 = 2/3 [Ay 2 + (Ay x - Ay,.) 2 + Ay 2 ] 1/2 , (38) 

= <*'*> + <-y> + < K 2>- (39) 

Ay,- is the principal shearing strain ranges, e, is the principal normal strains that give the 
maximum value of e r , <e) = e, if e > 0, are zero otherwise. 

The Ay 0 parameter, referred to as the octahedral shearing strain range, is considered 
the main parameter coupled with the total strain e T . 

A total strain criterion in the form 

£ T = (2/3) 1/2 [£ 2 +£ 2 +£ 2 ]^ /2 x (40) 

was proposed by Zamrik (1972). 

£,. are the three instantaneous principal strains. s r is the maximum value of the right 
hand side. This approach is similar to von Mises criterion. The total strain e T was 
related to N f in the form, 

E T = A(N f y a , (41) 

and was found to give good correlation for both in-phase and out-of-phase combined 
torsion and push-pull straining of tubular specimens. 

2.3g Energy based approach: Plastic work approach uses uniaxial stress-strain data 
to compute the hysteresis energy per cycle, resulting from bending and torsional 
straining, in-phase and out-of-phase. The plastic work per cycle is defined for this 
loading as 

\a,d^ + x xy iy^, (42) 

where <j x and x xy are the axial and torsional components of the stress tensor. dc£ and 
dy£ y are the corresponding infinitesimal load increment. It is assumed that 

W p = ANJ a (43) 

where the constants A and a are fitted from uniaxial strain life data. 


3. Data analysis and discussion 

Figure 8(a-e) show the correlation of experimental data with prediction based on (a) 
principal strain theory, (b) von Mises criterion, (c) maximum shear strain theory, (d) 



N# lexp) 
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Kandil, Brown and Miller theory and (e) Lohr and Ellison approach. The material 
tested was 1045 steel and results are for smooth thin walled tubular specimen (Fash et al 
1985). It can be seen that the maximum strain theory and von Mises criterion give good 
correlation with evenly distributed scatter. In the case of maximum shear strain 
approach, the prediction is conservative at low as well as high cycles regions. So is the 
case with the Kandil-Brown-Miller approach also. In the case of the Lohr-Ellison 
approach the scatter is wide and the torsion results consistently fall out with the 
prediction which is always non-conservative. Fash et al (1985) also found that the 
aforementioned approaches offer considerably less correlation for notched shaft 
specimens. 

A comparison of the degree of correlation found using a number of failure criterion 
as applied to automobile steel is shown in figure 9 (McDiarmid 1993). The octahedral 
shear strain approach as also the approaches due to Brown & Miller (1973) and 
McDiarmid (1993) modified shear stress approach appear to give fairly good correla¬ 
tion within a factor of + 2. Other approaches give large scatter. The octahedral shear 
strain criterion has the disadvantage of not being able to account for the effect of mean 
stress or strain. The other two criteria can account for mean stress/strain within their 
normal stress/strain terms. Thus the modified shear stress based on the critical plane 
criterion of failure, i.e., 

C, + k < max = constant (44) 

could reasonably be accurate in correlating both the in-phase and 90° out-of-phase 
tension-torsion SAE experimental data. 

SAE experimental programme (Tipton 1989) has shown that von Mises criterion 
provided reasonably good estimates of finite life under in-phase loading. In the case of 
out-of-phase loading non-conservative estimates are obtained. It was also observed by 
Tipton that all the methods gave somewhat more conservative predictions for pure 
torsion or loadings with higher ratios of torsion to bending. Further, all prediction 
methods were conservative at long life and somewhat non-conservative at lower cycles, 
particularly for bending or loadings with larger ratios of bending to torsion. 


Th . eo . ry £hase CO psg r conservative 



Figure 9. Degree of correla¬ 
tion (scatter width) based on dif¬ 
ferent parametric approaches. 
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Multiaxial fatigue behaviour of notched specimens will depend on stress concentra¬ 
tion and stress gradient, as in uniaxial fatigue, and also on the specific type of notch 
geometry and its influence on cracking behaviour. 

Correlations made by conventional strain based methods (Tresca and von Mises) for 
in-phase tests are fairly sensitive to the Poisson’s ratio assumed (i.e., the degree of 
plasticity). The use of a variable Poisson’s ratio with the von Mises approach eliminates 
this sensitivity. However, the use of a variable Poisson’s ratio is invalid for out-of-phase 
loading. 

The ASME Boiler and Pressure Vessel Elevated Temperature Code Sec III uses the 
von Mises correlation for non-proportional loading with Poisson’s ratio as 0-5 (the 
fully plastic value). However, predictions made by this approach are generally worse 
(more non-conservative) than predictions made by the same procedure with Poisson’s 
ratio = 0-29 (the fully elastic value). All out-of-phase predictions using this approach 
are non-conservative, regardless of Poisson’s ratio. 

The Lohr-Ellison method makes life predictions similar to those of the Brown and 
Miller approach for bending dominated loading. However, for tests where torsional 
loading dominates, predictions are nonconservative. The application of this approach 
to out-of-phase loading is not clearly possible since cracks which are deemed harmless 
by this approach, could grow detrimentally due to rotating principal stresses. 

In the case of out-of-phase biaxial fatigue, constant amplitude stresses of different 
frequencies produce non-constant amplitude shear stress variations (Miller et al 1967). 
For these conditions the normal stress acting on the maximum shear stress plane is also 
of non-constant amplitude. The analysis of McDiarmid (1985) concludes that the 
maximum shear stress criterion using damaging shear stress amplitude appears to be 
a reasonable basis for fatigue life evaluation under out-of-phase biaxial stresses of 
different frequencies. 

In general out-of-phase stresses could be more damaging than in-phase stresses 
of the same magnitude, though opposite trends are also observed in certain materials. 
The total strain approach due to Zamrik (1972) can be applied to out-of-phase loading 
by maximizing the function e T . However, the total strain expression reduces to 
von Mises criterion when the Poisson’s ratio is 0 5. Kanazawa et al (1977, 1979) 
showed that out-of-phase loading in 1% Cr-Mo-V steel is more damaging than 
in-phase loading in the low cycle regime. Further, they observed that both the Tresca 
and von Mises type criteria are not conservative for out-of-phase multiaxial loading. 
They proposed that the shear strain range and the normal strain amplitude on the 
maximum shear plane should govern out-of-phase multiaxial fatigue, but offered no 
functional relationship. 

When £ l5 e 2 , and e 3 rotate during every cycle in out-of-phase loading, a number of 
cracking systems will start operating and endurance will be reduced because of the 
large vaiue of e„ that can be obtained. However, in some cases, the life time may be 
extended due to crack growth on one plane being obstructed by other cracks on other 
planes. Taking a simple bi-axial loading, a 1 = a 2 and cr 3 = zero, three distinct cracking 
systems can operate in the material. Depending on the phase angle 0 one or more cracks 
will grow thereby either increasing or decreasing the endurance life. Figure 10 shows 
the effect of phase angle 0 on the endurance cycles in such a simple system of loading 
(Miller & Brown 1984). 

The plastic work life predictions are comparable to those made by the von Mises 
and the Brown and Miller approaches for in-phase and are reasonably good for 
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Figure 10. Effect of phase 
angle on the endurance life. 


out-of-phase loadings. However, the main difficulty with this approach is that when 
plastic strains are small, slight variations in their computation can lead to large 
differences in predicted life. Further, numerical difficulties may be encountered in 
incremental plasticity routines specially in 90° out-of-phase loading. 

Several new equivalent strain criteria have been proposed in recent times, as other 
criteria like Tresca, octahedral shear strain etc., do not give good predictions, even for 
in-phase loading. The inadequacy of the octahedral shear strain approach is clearly 
shown in the case of EN 15 R steel for in-phase tension torsion test results as shown in 
figure 11a. If R(e) is the ratio of y oct to y octtorsion , i.e., 

R{£) = y o Jy octtorsion- (45) 

it should be equal to unity if octahedral shear strain criterion is valid. However, the 
values of R(e pp ) (push-pull loading) deviate significantly from unity indicating the 
inadequacy of the y oct criterion. 

Realising the above inadequacy of von Mises criterion, Troshchenko et al (1993) 
proposed a modified octahedral shear strain approach. The octahedral shear stress 
component y oct . is taken in the form, 

W = 0-816[(4/3)(l + v*fei + y 2 ,] 1 ' 2 , (46) 

for a material subjected to axial and torsional loading. 

v* is the Poisson’s ratio corresponding to the achieved level of strains. The modified 
^oct(modified ) IS a function of state of strain and N f . Thus 

^(modified) = 4^(e,AT / )y oct* (47) 

The function \J/(e,N f ) can be evaluated for different phase angles. The correlation 
between the modified octahedral shear strain and the fatigue life is shown in figure lib 
for En 15 R steel. The agreement appears to be good for both in-phase and out-of¬ 
phase loading, though separate endurance curves for reversed tension-compression 
and for torsion are required for the computation. The approach which takes into 
account the non-proportionality of the deformation process and the dependence of the 
equivalent strain parameter on life time, is somewhat complicated and involves 
detailed calculation. 

Sreeramesh & Bonacuse (1993) have analysed the behaviour of Haynes 188 alloy in in- 
phase and out-of-phase axial-torsional fatigue at 760°C. The correlation based on the von 
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Figure 11. (a) Relation be¬ 
tween octahedral shear strain 
and fatigue life for different 
phase angles, (b) Correlation of 
fatigue life with the modified oc¬ 
tahedral shear. 





Mises criterion, Fatemi-Socie-Kurath (F-S-K) (1988) and Smith-Watson-Topper 
(S-W-T) (1970) approaches are shown in figures 12a, b and c respectively. 

The von Mises equivalent strain range Ae cq is given by 

Ae„ = {[A&, - &e y ) 2 + (Ac, - At,) 2 + {At, - As*)] 

+ (3/2) [Ay^ + Ay% + AyJJ}‘' 2 /[2 l ' 2 (l + »,„)], 

with 

»eff = C(Ae e u e + Ae p « p )/Ae f ]. (48) 

The von Mises equivalent strain range for in-phase tests has been computed with the 
maximum axial strain as the reference because in an in-phase test the axial and 
engineering shear strains reach their respective maxima simultaneously. However, in 
an out-of-phase test the axial and the engineering shear strains reach their respective 
maxima at separate points in a cycle. Therefore for each out-of-phase axial torsion test 
the equivalent strain range is computed twice: once with the maximum axial strain as 
the reference and a second time with the maximum shear strain as the reference. The 
larger of the two values is used for fatigue life prediction. 

The F-S-K parameter is given as 

v»[i+*(C/Vid)]=W/) 


V 



p 



pm® 


(49) 
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Figure 12(a-c). Correlation 
of elevated temperature fatigue 
life of Haynes 188 alloy with 
different parameters. 


The F-S-K model has been successfully used to predict room temperature in- and out- 
of-phase axial torsional fatigue lives for materials that exhibit a shear mode of failure. 

The S-W-T parameter is given by 

= <r' f e' f (2N f )- p + (a'/E){2N f )~*. '' (50) 

In the S-W-T model the magnitude and the plane of the maximum principal strain 
amplitude are determined first for a given cyclic of axial torsional loading. Then Ae „ 
is multiplied by the maximum normal stress o l that occurs on that plane. The constants 
in (50) are obtained from uniaxial fatigue properties. The S-W-T parameter is 
successful in predicting the room temperature fatigue life under in- and out-of-phase 
axial torsional loading specially in materials where cracks initiate and propagate 
perpendicular to the maximum principal strain direction. 

The description of the results by von Mises criterion appears to be the best among 
the three approaches considered. The F-S-K parameter is very conservative and 
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under-predicts, whereas the scatter in the S-W-T approach is large and the prediction 
is somewhat non-conservative upto a factor of 4. 

In high temperature multiaxial fatigue, damage accumulation is accelerated by the 
presence of mean component of stress and strain. The effect of time-dependent creep 
should also be taken into account in such cases. Further, ratchetting phenomenon will 
take place which will increase the strain accumulation. Hence the analysis becomes 
more complicated compared to ambient temperature multiaxial fatigue. 


4. Concluding remarks 

Important criteria and parameters for multiaxial fatigue are reviewed in this presenta¬ 
tion. The Tresca and von Mises criteria appear to be inadequate and hence approaches 
which take into account both shear stress/strain and normal stress/strain acting on 
critical planes as the controlling parameters have been developed. However, they are 
also not able to predict the fatigue life under different conditions of loading. Many 
times the predictions are either conservative or non-conservative and the scatter is 
more than by a ± 2 factor. 

The stress-based approaches can be applied to high cycle fatigue where gross yielding 
is not encountered. In low cycle fatigue, strain-based criteria will be better suited. 
Energy-based approach shows promise in predicting the life. However, energy is 
a scalar quantity and fatigue crack growth is controlled by stress/strain on certain 
critical planes which is a vector quantity. Further accurate computing of the plastic work 
where the plastic strain range is small will be difficult and will lead to erroneous results. 

The stress-strain path in multiaxial loading condition is more important in the 
fatigue failure process (Wu & Yang 1987) and hence the deformation mode under 
multiaxial loading must also be considered for crack nucleation and its subsequent 
growth. 

Thus the prediction methodology of life in multiaxial fatigue is rather very much 
involved and as their development stands to-day the approaches made are not 
complete in all respects. 


List of symbols 

A,B,C 

constants; 

E 

Young’s modulus; 

F 

function; 

K 

ratio shear stress/normal stress; 

N 

number of cycles to failure; 

V 

Poisson’s ratio; 


exponents; 

y . 

shear strain; 

A 

range; 

A<7 

stress range = 2 era; 

^ 1 )^ 2,£3 

principal strains; 

X 

biaxial strain ratio, y/e; 
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p j, a 2 , o- 3 principal stresses; 

<r„ normal stress; 

<r eq equivalent stress; 

o f fatigue strength coefficient; 

o m mean stress; 

<j a alternating component of the stress; 

x shear stress; 

(j) phase angle. 


Suffixes 

1 ,2,3 principal directions. 
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^ High temperature low cycle fatigue 
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Abstract. Fatigue at high temperature is a complex phenomenon as it is 
influenced by a number of time-dependent processes which become import¬ 
ant at elevated temperatures. These processes include creep, oxidation, 
phase instabilities and dynamic strain ageing (DSA), acting either indepen¬ 
dently or synergistically influence fatigue behaviour, often lowering the 
fatigue life. Current design approaches employ linear summation of fatigue 
and creep damage with suitable factors on permissible damage to take care 
of uncertainties in interaction between cyclic and time-dependent pro¬ 
cesses. It is, therefore, important to develop a deeper understanding of the 
processes that occur during high temperature fatigue so that realistic life 
predictions could be made. 

Results on the high temperature fatigue behaviour of austenitic stainless 
steels, ferritic steels and nickel base alloys are presented here. The import¬ 
ant mechanisms of interaction of high temperature time-dependent pro¬ 
cesses with fatigue under various conditions are discussed in detail. 
Emphasis is placed on cyclic stress response, fatigue life, deformation 
substructure and fracture behaviour. This is followed by a review of 
important life prediction techniques under combined creep-fatigue loading 
conditions. Life prediction techniques considered here include linear dam¬ 
age summation, strain range partitioning, ductility exhaustion approach, 
frequency modified and frequency separation methods, techniques based 
on hysteresis energy and damage rate models, and methods based on 
crack-cavitation interation models. 

Keywords. Low cycle fatigue; creep-fatigue interaction; environmental 
effects; dynamic strain ageing; life prediction techniques. 


1. Introduction 

Low cycle fatigue (LCF) is an important design consideration in structural integrity 
analysis of components operating at high temperatures. Repeated thermal stresses are 
L generated as a result of temperature gradients which occur on heating and cooling 

during start-up, shut-down and thermal transient conditions. LCF resulting from start¬ 
ups and shut-downs occurs under essentially strain-controlled conditions, since the 
surface region is constrained by the bulk of the component. Figure 1 shows schemati¬ 
cally the stress and strain cycles to which a boiler or a steam turbine component may be 
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Figure 1. Schematic diagram 
illustrating thermal stresses/ 
strains to which boiler or steam 
turbine component may be sub¬ 
jected. 


subjected to as a result of rapid start, steady state operation or a sudden shut-down. 
Typical temperature-time transients experienced by the surface of thick components in 
a fast reactor as a result of reactor trip (down shock) or secondary circuit failure (up 
shock) and the resulting stress-strain cycles are shown in figure 2. 




TIME (t) 




Figure 2. Surface hysteresis 
loops following reactor trip 
transient (a), heating transient 
(b), after secondary circuit fail¬ 
ure in a fast breeder reactor. 
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To represent the component behaviour in a laboratory test, the thermal strains are 
replaced by mechanical strains, introduced and controlled under isothermal condi¬ 
tions, figure 3. The slow start-up/shut-down cycle is replaced by a symmetrical and 
continuous fatigue cycle of equal strain rates in tension and compression with a hold 
period at a constant peak strain to simulate the on-load period i.e. creep-fatigue 
interaction (figures 3b-d). Slow-fast and fast-slow strain-time wave forms represent 
another category used to evaluate creep-fatigue interaction effects (figures 3e-f). In 
slow-fast cycle, tension-going strain rate is less than that in compression cycle, while in 
fast-slow cycle, compression-going strain rate is less than that in tension. Tension hold 
alone, compression hold alone, slow-fast and fast-slow wave forms are considered 
unbalanced cycles. 



(a) 


6 





(b) 


bH-W 



(c) 




(d) 



Figure 3. Typical waveforms 
employed in LCF testing. 
Act = total stress range, Ae r = 
total strain range and <r m = 
mean stress, (a) Continuous 
strain cycling; (b) tension 
strain hold; (c) compression 
strain hold; (d) tension and 
compression strain hold; (e) 
slow-fast cycling; (f) fast- 
slow cycling. 
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LCF behaviour of materials at high temperatures and low test frequencies is 
influenced by the interaction of time-dependent processes like creep, oxidation, dy¬ 
namic strain ageing (DSA) and metallurgical instabilities with cyclic deformation. In 
general, fatigue life decreases with increasing temperature and decreasing frequency. 
A few exceptions, however, have been reported as in the case of type 304 Stainless Steel 
(SS) (Bhanu Sankara Rao 1989), type 316 SS (Kanazawa & Yoshida 1975) and 12% Cr 
ferritic steel (Kanazawa 1978). In the case of type 304 SS (Bhanu Sankara Rao 1989), the 
observed increase in fatigue life with temperature correlated with the precipitation of 
M 23 C 6 carbides during testing. Carbide precipitation along grain boundaries inhibits 
grain boundary sliding, thereby preventing grain boundary crack formation. Since 
LCF life at elevated temperature is governed by intergranular crack initiation and 
propagation, reduced tendency for grain boundary cracking due to precipitation leads 
to enhanced life with increasing temperature (Bhanu Sankara Rao 1989). 

Elevated temperature fatigue tests conducted in air generally yield lower life than 
those carried out in vacuum, suggesting the deleterious effects of oxidation on fatigue 
properties (Coffin 1972a; Solomon & Coffin 1973; Cook & Skelton 1979). High 
temperature low cycle fatigue studies in ferritic steels such as 2-25 Cr-1 Mo (Sandhya 
et al 1989) and 9 Cr-1 Mo steels (Choudhary et al 1991a) indicate that oxidation 
enhances intergranular damage and consequently causes reduction in life. The effect of 
oxidation was more dominant in tests conducted at low strain amplitudes. Fatigue- 
oxidation interaction has also been observed to degrade the life in austenitic stainless 
steels (Bhanu Sankara Rao et al 1986a). Creep-fatigue interaction behaviour of type 
304 SS has been studied employing hold time tests (Bhanu Sankara Rao 1989). The 
results of this study indicate that tensile hold caused more damage to life than either 
compression hold or symmetrical (tension 4- compression) hold. Under tensile hold 
conditions, the accumulation of intergranular creep damage in the form of cavities 
enhanced the tendency for intergranular mode of fatigue crack propagation leading to 
shorter fatigue life. Degradation in life has been observed (Hirakawa et al 1978; 
Yamaguchi et al 1978) also under test conditions where creep and oxidation effects are 
expected to be minimal. It has been shown that another time-dependent process, 
namely, dynamic strain ageing interacts with fatigue and leads to life reduction. 

In this paper, the role of environment, DSA and creep in determining the LCF 
behaviour will be first discussed. This is followed by a discussion on important 
techniques for life prediction under conditions of time-dependent fatigue. Empirical 
techniques as well as those with a mechanistic basis would be discussed. 


2. Environmental effects on fatigue behaviour 

One of the major factors responsible for the degradation of fatigue properties at 
elevated temperatures is the interaction of environment (oxygen or air) with fatigue 
process. In general, it is observed that materials exhibit superior fatigue properties in 
vacuum or in an inert medium compared to those observed in air (Coffin 1963, 1972, 
1973,1979; Asada& Mitsuhashi 1980; Bressers & Roth 1983; Marshall 1983; Jianting 
et al 1984). The influence of environment becomes more pronounced with lowering of 
frequency (strain rate), decreasing strain amplitude, and imposition of hold during 
fatigue testing. Depending on test frequency, three regimes of interaction of fatigue with 
environment could be identified (figure 4): (a) frequency independent regime (where 
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Figure 4. Effect of frequency 
on fatigue life at elevated tem¬ 
peratures (Coffin 1977). 


time-dependent deformation mechanisms are not significant), (b) environment sensitive 
regime, and (c) deformation mode and environment interaction regime (Coffin 1977). 
At very high frequencies (v > v e ), fatigue crack is open only for a short while, and 
therefore enough time is not available for environmental interaction. Crack growth in 
this regime is not dependent on environment. At lower frequencies (v e > v > v m ), 
environment influences the crack growth process and a change in the mode of crack 
growth from transgranular to intergranular takes place. At still lower frequencies 
(v < v m ), the fracture mode is entirely intergranular and for a specific plastic strain 
range, time to failure remains constant (independent of frequency) (Coffin 1977). 

A few experimental results would be considered to illustrate the role of environment 
in determining fatigue life. The effect of frequency and strain range on LCF behaviour 
of type 304 SS has been extensively investigated at elevated temperatures in air using 
balanced cycles (Bhanu Sankara Rao 1989). Decreasing frequency was found to 
degrade fatigue resistance of type 304 SS (figure 5) (Bhanu Sankara Rao 1989; Bhanu 
Sankara Rao et al 1991). No creep damage in the form of grain boundary cavities and 
cracks was noticed. It was also observed that as the frequency was lowered, crack 
initiation changed from slip band induced transgranular to oxidation induced inter¬ 
granular mode (figure 6). The role of oxidation in determining fatigue life has been 
clearly brought out in comparative tests in air and vacuum environments on A286, 
figure 7 (Coffin 1972b). Fatigue life has been found to decrease with decrease in 
frequency in air environment while life is independent of frequency in vacuum. 



Figure 5. Influence of fre¬ 
quency on fatigue life as a 
function of grain size in 304 
SS. 







128 


P Rodriguez and S L Mannan 



Figure 6. Crack initiation modes in type 304 SS LCF tested at 923 K. (a) 
Transgranular crack initiation (Ae, = 0-66%, v = 1-0 Hz) and (b) oxidation assisted 
transgranular crack initiation at v = 0 001 Hz. 


Strain amplitude also has significant influence in determining environmental effects. 
During low strain amplitude fatigue tests, material is exposed for longer durations to 
air environment at high temperatures. This provides sufficient time for oxidation to 
affect the failure process. A two-slope behaviour observed in a 2-25 Cr-1 Mo steel has 
been attributed to the enhanced oxidation effects at low strain amplitudes, figure 8 
(Sandhyaeta/ 1989). 

Enhanced environmental interaction has also been reported in hold time tests. For 
2-25 Cr-1 Mo steel, compressive holds have been found to be more damaging than 
tensile holds (Teranishi & McEvily 1980; Challenger etal 1981). This has been 
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CYCLES TO FAILURE 


Figure 7. Plastic strain range 
vs fatigue life of A286 in air at 
866 K and in vacuum at 293 K 
and 866 K. Numbers adjacent 
to test points indicate fre¬ 
quency in cycles per minute 
(Coffin 1972b). 


attributed to the behaviour of the oxide formed during hold period (Teranishi 
& McEvily 1980). The oxide spalls from the specimen surface after a tension hold, 
leaving behind a relatively smooth surface. On the other hand, after a hold in 
compression, the oxide cracks, facilitating early crack nucleation in the underlying 
metal. This behaviour was later related to the tensile strain in the oxide by Challenger 
et al (1981). According to these authors, the loading waveform determines the tensile 
strain experienced by the oxide. Since the oxide formed during compression hold is 
subjected to a tensile strain equal to the total strain range of the test on a strain reversal, 
it is more likely to crack in a direction perpendicular to the applied stress. The 
vulnerability of superalloys to compressive dwell periods has also been interpreted in 
terms of environmental interaction on similar lines (Manson et al 1984). On the other 
hand, Sandhya et al (1989) have found tensile holds to be more damaging in normalized 
and tempered 2-25 Cr-1 Mo steel and have interpreted the results in terms of faster 
crack growth rates under tensile holds. Accelerated crack growth fates due to oxidation 
has been reported by various investigators (Smith et al 1969; Solomon & Coffin 1973; 



NUMBER OF REVERSALS TO FAILURE (2N f ) 


Figure 8. Coffin-Manson 
plots for 2-25 Cr-1 Mo steel at 
773 K. 
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Nj TO 0.3mm CRACK DEPTH (CYCLES) 


Figure 9. Oxidation induced 
frequency effect on the number 
of cycles spent in initiating 
a crack of 0-3 mm in IN 100 at 
1273 K (Reger & Remy 1986). 


Frandson etal 1974; James & Knecht 1975; Skelton 1978). Oxidation accelerates 
fatigue crack initiation, and generally leads to increased number density of micro¬ 
cracks, figure 9 (Bressers etal 1983; Reger & Remy 1986). The large reduction in 
number of cycles for crack initiation is often associated with a change in the mode 
of cracking from transgranular to intergranular. The transition in the mode of 
fatigue crack initiation in a gamma-prime strengthened PM Astroloy and in a carbide 
and solid solution strengthened alloy 800 H are compared in figure 10 (Bressers 
1985). PM Astroloy when tested in vacuum shows a transition from transgranular 
to intergranular mode of crack initiation at a temperature much higher compared 
to that in air, whereas for alloy 800 H, the change in transition temperature was 
very small. The trans- to intergranular transition in the PM Astroloy has been 
attributed to environmental effects while in Alloy 800 H, it has been shown to be creep 
controlled. 

The effect of frequency on fatigue crack propagation rate has been studied by 
Solomon & Coffin (1973) for A286, at 866 K, and Scarlin (1977) on Nimonic 105 at 
1023 K and IN738 LC at 1123 K. In the investigation on A286 (Solomon & Coffin 
1973), it is found that, in air at very high frequencies crack propagation is due to 
frequency independent fatigue, with transgranular fracture (figure 11). However, as 
frequency decreases the environmental effects interact with the fatigue process thereby 
increasing crack growth rate. At still lower frequencies, the influence of other time- 
dependent processes such as creep becomes dominant resulting in intergranular 



Figure 10. Transition in the 
mode of crack initiation of PM 
Astroloy and of Alloy 800 H as 
a function of the applied strain 
rate and temperature in air and 
in vacuum (Bressers 1985). 
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Figure 11. Effect of frequency 
and environment on low cycle 
fatigue crack propagation of 
A286 at 866 K (Solomon & Cof¬ 
fin 1973). 


fracture. The dependence of crack growth on frequency sets in at a lower frequency in 
vacuum compared to air. 

It must be pointed out that oxidation is not always detrimental to fatigue (Gell 
& Leverant 1973). Oxidation strengthening has been reported in Mar-M200 alloy and 
attributed to increased crack tip radius and blunting of the crack by oxidation. Further, 
corrosion products at the crack tip can cause closure during unloading which decreases 
the effective stress intensity range and thus retard cracking (Floreen & Raj 1985). 

Damage from oxidation is viewed as a synergistic phenomenon when plastic 
deformation or fracture occurs at a free surface or at an exposed crack tip. Brittle 
protective oxide films formed are ruptured exposing fresh nascent material and 
chemical attack of the material then proceeds until newly formed protective films 
intervene. With fatigue, this process of straining, film rupture and attack occurs 
repeatedly, leading to localization of the cyclic strain and an abundance of reaction 
products (McMahon & Coffin 1970; Sidey & Coffin 1979). Various mechanisms have 
been suggested to account for the effect of environment on fatigue life (Cook & Skelton 
1979; Duquette 1979; Ericsson 1979). At temperatures below and test frequencies above 
the range of intergranular cracking, the effect of oxidation on stage I cracking in planar 
slip alloys has been explained on the basis of slip step passivation mechanism reducing 
the degree of slip reversibility and thus enhancing crack nucleation at slip steps (Fujita 
1958). Alternatively, cyclically generated slip bands may concentrate oxygen through 
strain-induced vacancy generation, thus accelerating transition from slip band to 
microcrack by preventing the rewelding of nascent cracks (Thompson et al 1955). It has 
also been suggested that crack nucleation is speeded up as a result of dislocation 
accumulation against surface oxide films, leading to cavity formation on the, slip band 
(Shen et al 1966). The preferential oxidation of second phases which intersect the 
surface, such as carbides in nickel base alloys has also been invoked to rationalise fast 
crack initiation (Coffin 1973b; Reuchet & Remy 1983). In the intergranular failure 
regime, faster crack initiation is related to the existence of an easier path for chemical 
reaction. Inward diffusion of oxygen and its interaction with segregated elements or 
carbides present on the grain boundary can produce gaseous species, which may 
facilitate nucleation of cavities (Bricknell & Woodford 1981; Nich & Nix 1981; Dyson 
1982; Taplin et al 1983). Oxygen could promote cavitation by lowering the surface 
energy and thereby decreasing the threshold stress for cavity nucleation (Yoo & Trin- 
kaus 1983) or by enabling the reduction in the critical radius of cavity for growth 
(Taplin et al 1984). 
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3. Dynamic strain ageing 

DSA arises due to the interactions between solute atoms and mobile dislocations 
during plastic deformation. Its characteristics in tensile deformation have been re¬ 
viewed in detail by several investigators (Mannan 1981; Rodriguez 1984; Abdel-Azim 
Metwally 1986; Venkadesan 1991). Important manifestations of DSA in tensile defor¬ 
mation include serrations on stress-strain curves, peak/plateau in flow stress variation 
with temperature, negative strain rate sensitivity of flow stress, and increased work 
hardening rate and decrease in tensile ductility with increasing temperature or decreas¬ 
ing strain rate. Serrations occur on the stress-strain curve after the material has been 
deformed to a critical strain e c . The value of £ c depends on test temperature, strain rate 
and material condition. The characteristics of different types of serrations and the 
experimental conditions that produce them have been discussed in detail by Rodriguez 
(1984). Various models have been proposed to explain the phenomenon of DSA and 
observed characteristics in tensile deformation (Cottrell 1953; Sleeswyk 1958; McCor¬ 
mick 1972; Mulford & Kocks 1979; Van den Beukel & Kocks 1982; Hayes & Hayes 
1984; Kubin & Estrin 1990). 

3.1 Effect of DSA on LCF 

Even though DSA has been investigated in detail in tensile deformation, its role in 
influencing LCF has been examined to a limited extent (Conway 1968; Bressers 
& Verhegghee 1981; Bhanu Sankara Rao 1989; Bhanu Sankara Rao et al 1990, 1991; 
Srinivasan et al 1991; Valsan et al 1994) and recently Mannan (1993) has reviewed this 
subject. The improtant manifestations of DSA in LCF include serrated flow in plastic 
portion of the stress-strain hysteresis loop (figure 12), a rapid strain hardening 
(figure 13), an increase in stress response with decreasing strain rate (negative strain 
rate sensitivity) or increasing temperature, and a minimum in the plastic strain 
component of the cycle at half life (Bhanu Sankara Rao et al 1990,1991; Srinivasan et al 



Figure 12. Stress-strain hys¬ 
teresis loops exhibiting serrated 
flow in LCF test conducted at 
673 K (304 SS; grain size: 
700 gm; strain range: 1-2%). 
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Figure 13. Normalized cyclic 
hardening curves illustrating 
the influence of strain rate at 
823 K for type 304 SS (GS: 
310 jum; strain amplitude: 
+ 0-4%). 


1991; Valsan et al 1994). Figure 13 shows the influence of strain rate on the normalized 
peak stress ((A<r/2)/(Acr/2) 1 ) for a type 304 SS at 823 K. When strain rate is decreased 
from 1-6 x 10~ 2 to 1-6 x 10 _4 s -1 there is an increase in the cyclic stress response. 
Another example of the influence of DS A on cyclic stress response is shown in figure 14, 
which shows the results on Nimonic PE-16 alloy at 823 K over strain rates varying 
from 3xl0“ 5 s~ 1 to 3xl0 _2 s _1 (Valsan et al 1994). The negative strain rate 
sensitivity of cyclic stress response is clearly seen in this figure. 

A comparison of LCF and tensile deformation (Bhanu Sankara Rao et al 1990) 
indicates that serrated flow occurs over a wider temperature range in LCF than in 
tensile deformation. Onset of serrations in LCF occurs at a lower temperature 
compared to that in monotonic deformation. This is attributed to the enhanced 
diffusion of solute atoms with the aid of non-equilibrium vacancies generated during 
fatigue cycling. Reverse straining generates vacancies at a rate greater than unidirec¬ 
tional deformation (Johnson & Johnson 1965; Essman 8c Mughrabi 1979; Quesnel 
& Tsou 1983; Bhanu Sankara Rao et al 1986b). 



Figure 14. Cyclic stress re¬ 
sponse of Nimonic. PE16 as 
a function of strain rate at 
823 K. 
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The occurrence of DSA during cyclic deformation influences dislocation substruc¬ 
ture. In type 310SS (Kanazawa etal 1988) the substructure changes from cells 
at temperatures below 473 K to a planar arrangement at temperatures between 
473-823 K (DSA range) and back to cell/subgrains- structure beyond 823 K. In the 
DSA regime, the formation of cells/subgrains is retarded by locking of dislocations 
by solute atoms thereby restricting cross slip and climb processes. Detailed studies 
carried out in the authors’ laboratory on 316 LN SS over a temperature range 
298-873 K have clearly confirmed the transition from cells to planar slip bands 
in the temperature range 573-873 K, where DSA effects are predominant (Srinivasan 
etal 1991), figure 15. Further, studies conducted on 304SS at 823K over the strain 
rate range 1-6 x 10“ 2 to 16 x 10 -4 s -1 (Bhanu Sankara Rao et al 1991) showed an 
increase in the slip planarity with decrease in strain rate. Typical intense planar 
slip bands developed in the alloy at 1-6 x 10 -4 s~ 1 are shown in figure 16. It may be 
noticed that there is a uniform distribution of dislocations in the interband region. 
It has been suggested that DSA enhances the degree of inhomogeneity of deformation 
during LCF by solute locking of slow moving dislocations between slip bands. 
Presumably, dislocation velocities inside the slip bands are too high for dynamic 
ageing of mobile dislocations to take place and consequently DSA enhances the 
partitioning of cyclic strains into separate regions marked by high and low amplitudes 
of dislocation movement (Wilson & Tromans 1970; Abdel-Raouf et al 1973; Bhanu 
Sankara Rao et al 1989; Valsan et al 1994). 

DSA during LCF has been found to localize deformation. Figure 17 shows the 
variation of slip band spacing T with strain rate in PE-16 alloy (Valsan et al 1994). 
The increase in ‘i’ at lower strain rates and higher temperatures is attributed to 
enhanced DSA effects. Intergranular cracking due to impingement of slip bands has 
been observed in many alloys (Tsuzaki etal 1983; Bhanu Sankara Rao etal 1988; 
Bhanu Sankara Rao 1989; Valsan et al 1994) under conditions of dynamic strain 
ageing. 

It must be pointed out that planar slip generally enhances slip reversibility and 
hence resistance to both transgranular crack initiation and propagation (Valsan 1991; 
Valsan etal 1992). However, if stress concentrations associated with dislocation 
pile-ups cause intergranular cracks, or localized deformation in widely spaced slip 
bands causes transgranular cleavage cracks, then development of slip planarity 
will reduce fatigue life. Fracture mode has been found to be influenced under DSA 
conditions in LCF. SEM studies on type 304SS (Bhanu Sankara Rao etal 1990) 
revealed that at the two higher strain rates (1-6 x 10~ 3 s _1 and 1-6 x 10 _2 s _1 ) fatigue 
crack propagated transgranularly by ductile striation mechanism, and by a mixed 
transgranular and intergranular mode at the lowest strain rate (1-6 x 10 -4 s -1 ), 
figure 18. It should be noted that the occurrence of serrated flow coincided with 
the development of intergranular cracking. Also, where negative strain rate sensitivity 
existed but no serrated flow occurred, crack propagation remained transgranular. 
Intergranular decohesion at e= 1*6 x 10“ 4 s" 1 (figure 18c), was caused by the impinge¬ 
ment of slip bands on grain boundaries. Further, the higher stress response developed 
during strain controlled LCF in the DSA range (figures 13 and 14) would lead to a 
large stress concentration at the crack tip and increased crack growth rates (Bhanu 
Sankara Rao 1989; Valsan et al 1994). Moreover, higher response stresses would act 
to reduce the critical crack size for final fracture thus reducing cyclic life (Bressers 
& Verhegghee 1981). 
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Figure 16. Slip bands and uni¬ 
form distribution of disloca¬ 
tions in the interband regions 
(304 SS, grain size: 310 gm, 
e= 1-6 x 10~ 4 s -1 )- 



STRAIN RATE/s -1 


Figure 17. Variation of aver¬ 
age slip band spacing T with 
strain rate in Nimonic PE 16. 




Figure 18. Crack propagation 
modes as a function of strain 
rate at 823 K in type 304 SS. (a) 
Transgranular stage II propa¬ 
gation (e= 1-6 x 10“ 2 s _ 1 ) (b) 
transgranular stage II propaga¬ 
tion (s= 1-6 x 10 -3 s -1 )and(c) 
mixed mode propagation 
(e = 1-6 x 10~ 4 s _1 ). 





High temperature low cycle fatigue 


137 


4. Creep-fatigue interaction 

Creep-fatigue interaction refers to the situation in which the rate of damage accumula¬ 
tion under complex loading differs from linear summation of damage rates produced 
by creep component and cyclic component. This subject has recently been reviewed by 
Rodriguez & Bhanu Sankara Rao (1993). Creep-fatigue interaction studies are gen¬ 
erally carried out by introducing a component of creep into normal fatigue cycle 
(figure 3). This can be done by means of constant strain tensile (or compressive) dwell 
(figures 3b-d) or by reducing the tensile (or compressive) strain rate in the cycle 
(figures 3e and f). As the creep component of the cycle is increased by increasing the 
tensile dwell period or by reducing the tensile strain rate, failure mode also changes. 
The failure modes observed can be categorized into three distinct regimes: 

(1) fatigue dominated, 

(2) fatigue-creep interaction, and 

(3) creep dominated. 

These three types of failure modes are shown in figure 19. Fatigue dominated failure 
arises due to growth of surface fatigue cracks through the specimen, with no evidence of 
interaction with creep damage (figure 19a). During fatigue-creep interaction, creep 
cavitation damage is found within the material in addition to surface fatigue damage. 
Fatigue and creep damages initially develop independently and the likelihood of true 
interaction depends on the balance between them. In tensile hold time tests on 316 SS at 
elevated temperatures, it is found that initial fatigue crack growth rates are similar to 
those occurring under continuous cycling conditions as demonstrated by fatigue 
striation measurements (Wareing 1977). However, eventually the fatigue crack inter¬ 
acts with creep damage, resulting in accelerated crack growth, a reduction in endurance 
and creep-fatigue interaction failure (figure 19b). When such interaction occurs, the 
failure path would become mixed (trans-plus intergranular). Finally, under certain 
testing conditions, the creep component of the cycle dominates and intergranular 
fracture due to the accumulation of grain boundary cavitation results (figure 19c); 
under these conditions there is no interaction with fatigue damage. 

To understand creep-fatigue interaction let us first consider microstructural charac¬ 
teristics of creep failure and fatigue failure. Creep failure is caused by cavitation damage 




Figure 19. Schematic diagram 
of failure modes during creep- 
fatigue interaction testing, (a) 
Fatigue dominated, (b) fatigue- 
creep interaction, and (c) creep 
dominated. 
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while fatigue failure occurs by crack damage. The creep-fatigue failure can then be 
viewed from two points: 

(1) influence of cyclic loading on cavitation damage, 

(2) influence of cavitation on crack initiation and propagation. 

Creep-fatigue failure can be dominated by either cavitation-damage or by crack- 
damage. Conceptual mechanisms for each of these have been proposed by Raj (1985). 
These are illustrated in figures 20 and 21. In the case of cavitation damage, the 
imposition of cyclic loading enhances the rate of damage formation (figure 20). It can be 
seen that the integrated stress-time area is smaller for cyclic loading than it is for 
monotonic loading, yet cavitation damage is greater. Influence of cavitation damage on 
fatigue crack initiation and propagation is illustrated in figure 21. Here cavitation may 
enhance either crack initiation or propagation. Experimental evidence indicates that in 
pure fatigue, only one or two cracks are initiated, whereas, in creep-fatigue many grain 
boundaries at the surface develop cracks, and the largest among them propagates as the 
major crack. In the case of cavitation enhanced crack propagation, cracks can grow by 
lengths much greater than crack tip opening displacement due to the linkage of cavities 
ahead of the crack tip. Tomkins & Wareing (1977), Wareing et al (1980) and Rie et al 
(1988) have analysed the problem of fatigue crack growth in the presence of grain 
boundary cavitation and have suggested that fast crack growth rates occur when the 
following condition is satisfied. 

S/2 = l-p, (1) 

where, <5 is the crack tip opening displacement, and p and A are average cavity size and 
spacing respectively. 

Extensive creep-fatigue interaction tests conducted on several high temperature 
alloys have shown that addition of tensile dwell periods dramatically reduces life 
compared to continuous cycling (Berling & Conway 1970; Brinkman et al 1972; 
Conway et al 1975; Wareing 1975,1977,1981; Maiya & Majumdar 1977; Schmitt et al 
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Figure 20. In a cavitation 
dominated failure, cyclic load¬ 
ing enhances the rate of damage 
accumulation. For example the 
integrated stress time area is 
smaller for cyclic loading (b), 
than it is for monotonic loading 
(a), yet the damage is greater for 
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Figure 21. In crack dominated 
failure, cavitation can accelerate 
either crack initiation or crack 
propagation. (N, = cycles for 
crack initiation A a/f = (crack 
advance per cycle)/(crack tip 
opening displacement) (Raj 
1985). 


1979; Brinkman 1985; Meurer et al 1984; Bhanu Sankara Rao et al 1986a, 1993). 
Balancing tensile dwell with compressive dwell has been shown to enhance the life time 
compared to dwell in tension alone (figure 22) (Ellison & Patterson 1976). This has been 
attributed to reduced intergranular creep damage due to introduction of compression 
dwell. On the other hand, endurance of 9Cr-l Mo steel has not been found to be 
influenced by the introduction of tensile dwell (Wood et al 1979; Choudhary et al 
1991a). This material is characterized by a high creep ductility and no evidence of 
intergranular cracking is found in the tensile dwell period tests. These results suggest 
that creep ductility is a very important parameter in influencing endurance in hold time 
tests. Figure 23 shows data for several ferritic steels (Miller et al 1984). It is evident that, 
lower the ductility, the lower is the fatigue-creep endurance. In addition, it is clear that 
long dwell periods, small strain ranges and low ductility favour creep dominated 
failures, whereas, intermediate strain ranges, short dwell periods and high creep 
ductility favour fatigue-creep interaction failures. 

Extensive tests on austenitic alloys have revealed that influence of hold time on 
fatigue lives depends not only on ramp rate, position and length of the hold, but also on 
microstructure (Maiya & Majumdar 1977; Wareing 1981; Bhanu Sankara Rao 1989; 
Bhanu Sankara Rao et al 1993). Figure 24 illustrates the combined influence of tensile 
strain holds, ramp strain rate and temperature on the endurance of 304 SS (Schmitt 
et al 1979). There is a reduction in life as dwell period is increased at elevated 
temperature; however at temperatures below 823 K, hold period does not seem to have 
any significant influence. Apart from the accumulation of damage during the hold 
period, a substantial part of the damage is generated during loading and unloading 
periods as well (figure 24). 
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Figure 22. Plot of total strain range vs. cycles to failure for 1 CrMoV steel at 838 K 
(Ellison & Patterson 1976). 


Creep-fatigue interaction behaviour of 304 SS base metal and 308 weld metal have 
been recently investigated by Bhanu Sankara Rao et al (1993). These tests were 
conducted at 923 K employing a total strain range of 1-0% and hold period of 1 minute 
in tension, compression and in both tension and compression. Fatigue life of weld metal 
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Figure 24. The influence of 
tensile hold duration on fatigue 
life of type 304 SS (a) for differ¬ 
ent temperatures, (b) for differ¬ 
ent strain rates (Schmitt et al 
1979). 


was inferior to base metal (Brinkman et al 1974; Bhanu Sankara Rao et al 1993). Effect 
of hold period on life was dependent not only on the material condition and duration of 
hold, but also on the position of the hold in the cycle. 304 SS reveals a severe 
degradation in life with a tension hold compared with continuous cycling. Compres¬ 
sion hold lowered life marginally while for hold both in tension and compression, life 
values in between those for hold in tension alone and compression alone were observed. 
Mean stress was not found to be the reason for the observed differences in the 
behaviour. Fractographic investigations revealed bulk creep damage in tests with 
tension hold periods (figure 25a) while fracture modes associated with compression holds 
and continuous cycling were characterized by ductile striations (figures 25b and c). 

In nickel base alloys, compressive strain holds have been found to be more 
detrimental than tensile hold (Wells & Sullivan 1969; Lord & Coffin 1973; Ostergren 
1976a; Antunes & Hancock 1978). Development of tensile mean stress (Plumbridge 
& Ellison 1987), differences in the shapes of cavities formed during tension and 
compression (Wells & Sullivan 1969) and load shedding effects due to multiplicity of 
cracks in tensile dwell (Nazmy 1983a, 1983b) have been suggested as possible reasons 
for observed damaging effects of compression hold. 

Creep-fatigue interaction has also been investigated employing slow-fast and 
fast-slow types of waveforms (figures 3e and f). Generally maximum fatigue endurance 
has been observed in fast-fast cycling and minimum in slow-fast (Majumdar & Maiya 
1979; Bhanu Sankara Rao et al 1986c, 1988; Kempf etal 1987). Factors which 
contribute to the observed differences in the failure modes and fatigue lives in these tests 
have been recently discussed by Rodriguez & Bhanu Sankara Rao (1993). Development 
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Figure 25. Crack propagation 
in 1 min hold time tests on 
304 SS at 923 K. (a) Tension 
hold: intergranular propaga¬ 
tion; (b) compression hold: 
transgranular propagation; and 
(c) tension plus, compression 
hold: mixed mode propagation. 


of cavitation damage due to slow strain rates is considered to be responsible for reduced 
fatigue lives. 

Actual components in high temperature plants are subjected to complex cyclic 
loading situations where creep-fatigue interaction plays a dominant role. An in-depth 
knowledge of high temperature fatigue behaviour is necessary for design as well as life 
assessment and remnant life estimation of structural components. Various life predic- 
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tion methods applicable to high temperature fatigue condition have been proposed and 
these are reviewed in the following section. 
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5. Life prediction methods for high temperature low cycle fatigue 

Several life prediction methods have been developed to assess fatigue life of the 
components under time-dependent fatigue conditions, and these include: 

(1) prediction based on monotonic tensile properties; 

(2) damage summation method; 

(3) frequency modified, and frequency separation methods; 

(4) strain range and strain energy partitioning methods; 

(5) hysteresis energy method; 

(6) ductility exhaustion approach; 

(7) damage rate model; 

(8) Tomkins and Wareing’s model; and 

(9) Cavitation model by Rie et al (1988). 

Detailed reviews of most of the models mentioned above are available in the 
literature (Lloyd & Wareing 1981; Miller etal 1984; Rodriguez etal 1989; Viswanathan 
1989; Rodriguez 1993; Rodriguez & Bhanu Sankara Rao 1993). Marriott (1992) has 
recently reviewed the current trends in high temperature design rules covering creep 
and fatigue conditions. In the following section, the salient features of various life 
prediction methods are discussed. 

5.1 Method of universal slopes and 10 % rule 

This method has been formulated to predict fatigue life from strength and ductility 
parameters derived from short time tension tests. The fatigue life is given as the 
summation of the elastic strain amplitude, Ae e /2 (Basquin 1910), and the plastic strain 
amplitude, Ae p /2 (Coffin 1954; Manson 1954) given by: 

AeJ2 = (of/E)(2N f )-\ (2) 

Afip/2 = {s' f ){2N f )~ fi , (3) 

where, o' f = fatigue strength coefficent, E = elastic modulus, y = fatigue strength expo¬ 
nent, e' f = fatigue ductility coefficient, = fatigue ductility exponent and N f — number 
of cycles to failure. Based on the results on a variety of materials, Manson (1965) has 
suggested that the exponents and coefficients of these relationships can be universalised 
and proposed a generalized form for the strain-life relationship (universal slopes 
equation), 

Ae ( = 3-5(a u /E)N f -°- 12 + D 0 ' 6 Nf 06 (4) 

where, cr u = ultimate tensile strength, D = ductility = ln(100/(100 — %RA)), RA = re¬ 
duction in area and N f = number of cycles to failure. 

The deterioration in fatigue life at high temperatures aided by intergranular crack 
initiation and propagation has been empirically accounted for, in 10% rule (Manson & 
Halford 1968) by factoring the fatigue life obtained from universal slopes equation by 10. 
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Further, a creep modified 10% rule has been proposed (Manson & Halford 1967) to 
evaluate fatigue life under those situations where the creep effects decrease fatigue life 
by more than a factor of 10. 

Universal slopes method and its modifications are useful in the early stages of design 
when fatigue properties of a material of interest may not have been measured yet. 
However, these methods cannot adequately account for the deterioration in fatigue life 
by creep and other time-dependent processes at high temperatures. 

5.2 Linear damage summation - time and cycle fraction rule 

This approach involves the linear summation of time and cycle fractions, wherein time 
fractions are used as a measure of creep damage (Robinson 1952) and cycle fractions are 
used as a measure of fatigue damage (Miner 1945). Failure in a creep-fatigue situation is 
expressed as, 

n/N f + t/t r =1 (5) 

where, n — number of cycles at a strain range, N f = pure fatigue endurance at the same 
strain range, t = time at a given stress and t r = time to rupture at the stress under pure 
monotonic loading. 

The ASME design criterion (1992) based on this summation rule proposes a limit for 
allowable damage D. Further, N f and t r are replaced by N d and T d respectively 

Z("m+s w T Aso, (6) 

j =1 k =1 

where D = total allowable creep-fatigue damage value, n = number of cycles of loading 
condition j, N d = number of design-allowable cycles of loading condition j, t = time 
duration of load condition k and T d = allowable time at a given stress intensity. The 
values for D are different for different materials. D is unity for Alloy 800 H but is in the 
range 0-6 to 1-0 for types 304 and 316 stainless steels (ASME Code Case N47 1992). 

A creep-fatigue interaction diagram based on linear damage summation approach 
developed in our laboratory for 9 Cr-1 Mo steel thick section forging (Choudhary et al 
1991b) is provided in figure 26a. This figure also incorporates thin section data from 
Wood et al (1979). Figure 26b gives creep-fatigue interaction diagram for 2-25 Cr-1 Mo 
(Valsan et al 1988) based on the data available in literature (Booker & Majumdar 1982; 
Swindeman & Strizak 1982). It may be seen that for 9 Cr-1 Mo steel the data points lie 
above the D = 1 line, while for 2-25 Cr-1 Mo steel damage values less than unity are 
needed for encompassing the test data. 

The damage summation method is extensively employed in design codes in view of 
its simplicity as it requires only S-N curves and stress rupture curves for life prediction. 
However, this approach has the following limitation: 

(1) The value of damage D is specific for each material. 

(2) Tensile and compressive hold periods are considered equally damaging. In practice, 
compression holds are found to have a healing effect in austenitic steel (Bhanu 
Sankara Rao et al 1993, 1994) but in nickel-base superalloys they are more 
detrimental than tension holds (Wells & Sullivan 1969; Antunes & Hancock 1978). 

(3) The strain rate dependence of creep damage accumulation is not incorporated in 
this model. 
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(4) Stress rupture properties are determined from virgin material, whereas materials 
cyclically harden/soften depending on the alloy composition, metallurgical and 
testing conditions. 
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(5) This method assumes a load path independence for fatigue and creep damages. 
5.3 Frequency modified and frequency separation methods 

This approach is essentially a modification of the Coffin-Manson relationship for pure 
fatigue (Coffin 1969,1971,1976,1977). The time parameter is introduced to accountfor 
the environmental and other time-dependent factors, by incorporating a frequency 
term as follows: 

Ae p — A 1 (N f v k ~ 1 )~ p , (7) 

where A x ,k and jS are temperature-dependent constants, k is a material parameter 
derived from the Eckel relationship (1951) 

v k t f = constant = / (A^), (8) 

and N f v k ~ 1 is the frequency modified life. Similarly, Basquin relationship is modified 
and summed to provide a relationship of total strain range. 

+ A 2 N f -rv k >, (9) 

where A 2 , ft and k\ are material and temperature-dependent constants. For treating 
hold time effects, the frequency is calculated from v = l/(r + t h ), where t is the time for 
strain reversal and t h is the hold time in each cycle. 

The concept of total cycle time does not adequately predict the fatigue behaviour for 
complex wave shapes. The effects of wave shape on damage are associated with tension 
going time and hysteresis loop imbalance. Frequency separation equation can be 
written as: 


N f = {AJA^{y t l2) l ~ k , ( 10 ) 

where Ae' p represents the loop imbalance (Coffin 19.77) and the term (v,^) 1 accounts 
for the time spent in the tension part of the cycle. Asj, is the equivalent plastic strain 
defined as: 


~ Afip[((v c /v,)* 1 + l)/2y /fi (11) 

ki=fc'i+0'(fc-D (12) 

where v, = tension-going frequency, v c = compression-going frequency, and /?' and 
k\ are the appropriate constants derived from the Basquin law. The frequency 
separation method has been found to give a good correlation between the experimental 
and the predicted lives for a variety of wave shapes and dwell times, figure 27 (Coffin 
1977). 


Advantages of frequency based approaches: 

(1) Total strain is a major correlating parameter and hence is of direct use to designers. 

(2) Subdivision of inelastic strains into various components is not made as in strain 
range partitioning method. Here the inelastic strain arising from a host of deforma¬ 
tion processes is considered to be inseparable. 

(3) Implicit in the method is an assumption that significant damage for a cycle occurs 
near the peak tensile strain and is affected by strain rate in reaching this maximum. 
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Figure 27. Fatigue life predic¬ 
tion for various heats of type 
304 SS at 867 K by frequency 
separation method (Coffin 
1977). 




Disadvantages : 

(1) It is implied that the fatigue life decreases continuously with decrease in frequency. 
However, in certain materials homogenization of deformation (Valsan et al 1994) 
and thermal ageing cause improvement in fatigue life (Bhanu Sankara Rao 1989). 

(2) No distinction is made between tension and compression dwell periods and to their 
damaging effects which are material dependent. 

(3) Variable temperature effects are not accounted for. 

(4) Analysis assumes that hysteresis loop is closed. Hence it cannot take care of strain 
ratchetting. 

(5) Evaluation of the equation needs determination of several parameters. 

(6) For complex loops, determination of frequency is difficult. 


5.4 Strain range partitioning approach 

In strain range partitioning (SRP) method (Manson et al 1971), the total inelastic strain 
range is partitioned into four possible components depending on the directions of 
straining (tension or compression) and the type of inelastic strain (creep or time- 
independent plasticity), figure 28 (Hirschberg & Halford 1976): 

Ae PP : tensile plastic strain reversed by compressive plastic strain: 

\ Ae cp : tensile creep strain reversed by compressive plastic strain; 

Ae PC : tensile plastic strain reversed by compressive creep strain; 

Afi cc : tensile creep strain reversed by compressive creep strain. 

Partitioning of a generalized hysteresis loop is illustrated in figure 29 (Hirschberg 
& Halford 1976). Only three of these four generic strain ranges can exist in a given 
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Figure 28. Idealised hysteresis 
loops for defining individual 
partitioned strain range-life rela¬ 
tionships (a) PP, (b) CP, (c) PC 
and (d) CC type of cycles (Hir- 
schberg & Halford 1976). 


hysteresis loop. Ae pc and Ae CP cannot co-exist. By choosing suitable experiments, 
relations between inelastic strains and life, can be determined for a material of interest 
for each of these strain ranges: 

Njk = AjkA&jjf, 

where j and k represent plasticity or creep (Manson et al 1971; Hirschberg & Halford 
1976). The damage fractions resulting from each of the partitioned strain range 
components are summed up by an interaction damage rule (figure 30), 

F PP /N PP + F PC /N PC ( or F CP /N CP ) + F cc /N cc = l/N pred (13) 

Here N pp , N PC , N cp and N cc are obtained from the four generic life relationships as 
shown in figure 30 and F PP = Ae pp /AE in ; F PC = As pc /As in ; F cp = A e CP /Ae in and F cc — 
Ae cc /A8 in , where Ae in is the total inelastic strain range. 

In several high temperature alloys SRP has been found to satisfactorily predict 
life within a factor of 2 (Halford et al 1972; Hirschberg & Halford 1976; Hoffelner 
et al 1983). Several modifications have been proposed to the SRP approach subse¬ 
quently. A ductility normalized SRP has been suggested to take into account the effect 
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Figure 29. Defining par¬ 
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Figure 30. Definition of terms for an interaction damage rule (Hirschberg 
& Halford 1976). 


of variables like environment, heat to heat variations, degree of cold work, heat 
treatment, radiation damage and thermal exposure (Halford et al 1977). A total strain 
range version of the original SRP has been put forth to evaluate creep-fatigue 
interaction under low strain conditions (Halford & Saltsman 1983; Saltsman & 
Halford 1985), where inelastic strains cannot be determined with sufficient accuracy 
to make reliable life predictions. Recently, SRP has been modified by Duan et al 
(1988) to enable life prediction in high strength low ductility alloys. This approach 
synthesises the energy concept of Ostergren (1976b) and the inelastic strain partition¬ 
ing of the SRP. 

Detailed investigation on the time dependency of strain range partitioning approach 
for CP cycles has been examined by Kalluri & Manson (1984). For CP cycle, the time 
taken for introducing a given amount of creep strain depends on the position and the 
type (stress or strain) of hold. For example, the creep strain accumulated during stress 
relaxation in a tensile strain hold test can be obtained within a much shorter time in 
a corresponding stress hold test. Hence, the time required to complete the hysteresis 
loop differs, and therefore, suitable modifications to the CP life relationships incorpor¬ 
ating the time parameter has been proposed to take into account various time- 
dependent damaging processes (Kalluri & Manson 1984). 

Some of the advantages of SRP include the following: 

(a) SRP can be used to establish upper and lower bounds on life from the most 
conservative and least conservative of partitioned strain range versus life relation¬ 
ships; 

(b) Wave shape effects are treated directly; 

(c) Fatigue relationships are relatively temperature insensitive. The effect of tempera¬ 
ture is to alter creep and plasticity in a cycle; 

(d) Environmental effects are introduced through basic ductility normalized relation¬ 
ship. 
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This approach suffers from the following disadvantages. 

(a) Accurate knowledge of the cyclic history of service components is required, 
complex loops become naturally difficult to partition; 

(b) Hysteresis loops need to be closed; 

(c) Life predictions may be inaccurate in situations where the dominant degrading 
mechanism is environmental; 

(d) If the failure relationships lie close to each other or when there is a cross over 
between them, choice of a strain range component that causes maximum damage 
becomes difficult. 

5.5 Hysteresis energy approach 

Ostergren (1976b) has proposed a damage function based on net tensile hysteresis 
energy for predicting fatigue life under different frequencies and hold time conditions. 
Further, the mean stress concept is introduced to take into account the detrimental 
effects of compression holds in some nickel base superalloys (Leven 1973; Lord 
& Coffin 1973; Ostergren 1976b). The tensile hysteresis energy is approximated as 
A W T = o T Afi p where o r is the tensile stress range and As p is the plastic strain range. It is 
then assumed that universality of Coffin-Manson law applies to the hysteresis energy 
and hence 

<r T Ae p Nf = constant. (14) 

Here o T — o m + Ao/2, where a m — mean stress and Atx is the stress range corresponding 
to a given Ae p . The above equation has been modified for frequency and hold time 
conditions during which, time-dependent damages become significant as, 

cr T Ae p N^v P{k ~ l) = constant. (15) 

Frequency is suitably defined depending upon the damaging effects of compression and 
tension holds. In this way, the beneficial and harmful effects of compression holds could 
be accounted for. 

5.6 Ductility exhaustion approach 

In the ductility exhaustion approach (Edmunds & White 1966) it is assumed that the 
specimen failure occurs when the accumulated creep relaxation strain in a hold time 
test becomes equal to the monotonic creep ductility of the material. Life predicted using 
this approach, however, is found to be highly conservative. It was proposed (Priest 
& Ellison 1980,1981) that only a fraction of the relaxed strain should be compared with 
creep ductility. This fraction was defined by a transition strain rate above which matrix 
deformation dominates and contributes to fatigue damage and below which grain 
boundary damage accumulates and creep becomes important (figure 31). Here As c 
corresponds to the creep strain accumulated in a hold cycle and Ae p - “effective” 
plastic strain component (i.e. part of the inelastic strain range not associated with the 
grain boundary damage) and Ae PP - plastic strain corres- ponding to pure fatigue. 
Transition strain rates corresponding to the damage processes are obtained from 
deformation mechanism maps typically as shown in figure 32 for 1 Cr-1 Mo-V steel, 
565°C, grain size 23 pm (Priest & Ellison 1981). 
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Figure 31. Partitioning of 
a typical cyclic-hold hysteresis 
loop (Priest & Ellison 1981). 


In this method, creep and fatigue are considered to be independent damaging 
processes which compete to cause failure. The one that reaches its critical ductility 
exhaustion in the least number of cycles dominates the mode of failure. A creep-fatigue 
loading situation is creep dominated when N c Ae c — D c , where D c is the tensile creep 
ductility, and N c is the life. Rupture ductility of a material is not a constant but depends 
On temperature, strain rate, metallurgical variables and state of deformation (i.e., 
constrained or unconstrained). Fatigue damage dominated life times are similarly 
expressed by an equation N p Ae p = D p , where N p = fatigue dominated life time and 
D p = fatigue ductility defined by the strain intercept at one reversal from Coffin- 
Manson plot. For creep-fatigue interaction, the two mechanisms are summed 
linearly as 


1/N c +1/N p =l/N pred 


( 16 ) 
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Figure 32. Variation in domi¬ 
nant damage mechanisms with 
temperature and strain rate 
(lCr-Mo-V steel; d=^'23/xm) 
(Priest & Ellison 1981). 
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Good predictions have been reported by the application of this technique to 
1 Cr-Mo-V steel data, figure 33 (Priest & Ellison 1981). This method has been used 
successfully for predicting creep-fatigue life by a number of investigators for both ferritic 
and austenitic steels (Miller et al 1982; Priest et al 1983; Tomkins 1983; Wood et al 1988) 
and also for 17 Cr-8Ni-2Mo weld metal (Wood et al 1988). Damage healing effects 
can be accommodated by the ductility exhaustion technique (Priest et al 1983). This 
method can also be employed for slow-fast cycling conditions (Tomkins 1983). 

Ductility exhaustion method has the following limitations. 

(a) Severe damaging effects of compressive dwells seen in nickel base alloys are difficult 
to predict by this method. 

(b) Ductility is not a unique material property. Coffin-Manson plots exhibit two- 
slope behaviour under certain circumstances; determination of fatigue ductility in those 
situations is uncertain. Variability in creep ductility from heat to heat makes it difficult 
to select appropriate ductility values for creep damage. Further, tensile creep 
ductility may not be the appropriate one to use in compression hold conditions. 

(c) Determination of transition strain rate in hold tests needs elaborate metallographic 
investigations. 

(d) Deformation mechanism maps used for establishing the damage regime boundaries 
are only as good as the equations from which they are derived. Semi-empirical nature 
of the maps casts doubts with respect to their applicability to different sets of data. 

5.7 Damage rate model 

A strain-based approach which takes into account the rate of damage development has 
been suggested by Majumdar & Maiya (1976a, 1976b, 1979) and Maiya & Majumdar 



Figure 33. Comparison of ob¬ 
served and predicted lives by 
ductility exhaustion approach 
for various cycle shapes (data 
from Priest & Ellison 1981). 
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(1977). It is assumed that there are two types of damage namely cracks (fatigue) and 
cavities (creep) in creep-fatigue situation. If the two damage mechanisms are additive, 
the damage rate is given by the sum of equations: 

lda^ (T\e p \ m \e p \ ka (for tension), 

a dt [C|e p j w |s p | ka (for compression), 1 ; 

and 

1 dc = |G|£ p r|£ p | kc (for tension), 

c dt ( —G|e p | m |e p | kc (for compression). 

Equation (17) describes the crack damage due to fatigue and (18) describes the 
cavitation damage due to creep. T, C, G, k a , k c and m are material parameters which are 
functions of temperature, environment and metallurgical state of the material; e p and e p 
are current absolute values of plastic strain and strain rate respectively, and a and c are 
the crack size and cavity size respectively at time, t. T and C are included to account for 
differences in growth rates occurring in tension and compression. G or — G is used to 
denote tension or compression. Final failure is calculated as the reciprocal of the sum of 
crack and cavity damage. 

For the cases where the fatigue and creep damages are interactive, crack damage (17) 
has been modified as, 

(l/a)dfl/dt = {T or C}( 1 + alnc/c 0 )|e p P \e p \ k % (19) 

and the expression for cavitation damage remains the same. 

In the cavity growth formulation, (18), it is assumed that the cavities are widely 
separated and hence growth occurs independently. Further, the presence of cracks do 
not influence the growth of cavities. However, cavities of size greater than c 0 interact 
with crack and increase the crack growth rate. This is considered in the interactive 
damage rule by introducing a term (1 + a In c/c 0 ) into (19). 

In the interactive damage rule, failure is assumed to occur when the crack length 
a reaches the size for failure a f . If a = 0, the fatigue crack growth is unaffected by 
cavities and (19) reverts to continuous cycling, (17). 

The interactive damage rate model has been found to satisfactorily predict fatigue 
lives for several high temperature alloys (Majumdar & Maiya 1976a). The main 
advantage of this model is that it takes into account microstructural damage par¬ 
ameters and their evolution with time. Secondly, it incorporates interaction between 
fatigue and creep damage modes. Strain history of components can be calculated more 
reliably than their stress history, making this method more useful to designers. 
Moreover, the model can also be applied to situations when the stress-strain hysteresis 
loops are not closed. However, determination of a large number of parameters involved 
in these equations needs elaborate experimentation. Further, definition of initial crack 
and cavity sizes needs careful consideration. 

5.8 Tomkins and Wareing’s model 

Tomkins & Wareing (1977) and Wareing et al (1980) have extended the model 
proposed by Tomkins (1968) for fatigue crack growth to include the influence of 
cavitation. In Tomkins’ model (1968) it is assumed that crack initiation is a negligible' 
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phase and the entire life is spent for crack propagation. During crack growth, a new 
crack surface is generated by shear decohesion mechanism at the crack tip. The extent 
of decohesion is related to the extent of intense crack tip deformation {D) and the 
applied plastic strain range. In other words, the factors determining the crack growth 
rate per cycle are the magnitude of the applied stress-strain field, the current crack 
length and a material strength parameter. Fatigue endurance is determined by the 
initial crack size, rate of crack growth and the critical crack size. 

In this model, the crack propagation rate is expressed as, 

da/dN = Ae p D/2 1/2 , (20) 

where 

D = 2 1/2 [sec(7r<r t /2T) — 1] a 
o T = K'Ae? p \ 

K' = cyclic strain hardening co-efficient; 
n' — cyclic strain hardening exponent; 

T = approximately equal to cyclic UTS; 
a = instantaneous crack length; 

D = extent of deformation at the crack tip; 

As p = applied plastic strain range. 

For ff T ff < 0-6 and a power law for a T , integration of (20) gives Coffin-Manson law. 
Ae„N; = [(S/n 2 )(T/K') 2 ln(a,/a 0 )T, (21) 

where 

a=l/(2n' + l) 

a f = final crack length; 2/3 of the specimen section, 
a 0 = initial crack length (10 ^m for ductile materials). 

From (21) it follows that the slope of the fatigue life curve is related only to the slope 
of cyclic stress strain curve. The position of fatigue curve is dependent on the ratios 
( T/K ') and (a f /a 0 ). As T and K' are both strength parameters, variations in strength 
levels alone will not affect endurance for cycling at a given strain range. As the ratio 
a f /a 0 is in the logarithmic form in (21), this must change drastically to effect a signifi¬ 
cant positional change. The slope of the cyclic stress-strain curve is then the most 
significant factor affecting endurance. 

Applicability of Tomkins’ model has been examined for austenitic stainless 
steels (Wareing 1988), ferritic steels (Wareing 1988; Choudhary etal 1991b) and 
a magnesium alloy (Wareing 1988). In the case of austenitic stainless steels, a good 
agreement between predicted and experimental lives has been obtained for various 
testing and microstructural conditions. However, major differences between the 
predicted and experimental lives were noticed at low strain ranges (figure 34), where 
number of cycles required for crack initiation becomes significant. The ability 
of the crack propagation model, based primarily on stress parameter, to predict 
fatigue endurance at ambient and elevated temperatures is a consequence of the 
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Figure 34. Coffin-Manson plots for type 316 SS at various temperatures (Wareing 
1988). 


balanced nature of cycle which precludes the formation of creep damage at elevated 
temperatures. 

Crack propagation during creep-fatigue situation occurs in three distinct phases: 
Phase I is characterised by ductile striations and the growth rate in a tensile hold 
test is identical to that observed for the same applied strain amplitude without 
dwell period in the cycle. Phase II which occupies a major portion of the fracture 
surface occurs very rapidly compared to phase I and follows intergranular path. The 
reason for the large increase in growth rate is the attainment of cavity linkage at 
the crack tip. This triggers the unzipping of the cavitated material by the crack tip 
displacement field. Finally phase III, unstable fracture, occurs in a single cycle. 
The criterion for the onset of phase II is given by (1). An approximate solution for <5 in 
a fully plastic field is given by (Tomkins 1975): 

8/2 = ln(sec(7nr T /2f))[(2T , /£) + (2Ae p /(l 4- «'))> (22) 

Because phase II is so rapid, the life of a specimen failing by this mechanism is the cycle 
for initiation and phase I growth, i.e., the number of cycles to attain the critical crack 
size a' f , commensurate with the prevailing cavity spacing. For rapid crack initiation, 
this is obtained by substituting a Q (10 pm) and a' f in Tomkins’ model. 

Predictions of this model are compared with experimental data for 316 SS at 843 K 
for various tensile hold time tests in figure 35. Continuous cycling data are in good 
agreement with the prediction of (21). For the hold time tests, the data at the highest 
strains are in line with the prediction (1 and 22) for a cavity spacing of 1 pm. With 
decreasing strain range, the data are more in line with prediction based on spacings of 
0-2 and 0T pm. However, at the lowest strain range (015%), prediction overestimates 
life data. For strain ranges greater than 0T5%, fracture surfaces were predominantly 
intergranular with fatigue striations close to the specimen surface, thus confirming the 
model proposed for crack-cavity interaction during creep-fatigue failure. 
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NUMBER OF CYCLES TO FAILURE,N f 


Figure 35. Comparison of predicted and experimental data of fatigue life in type 
316 SS tested at 843 K with various tensile hold periods (Wareing 1988). 


5.9 Cavitation model of Rie et al 

A model has been proposed by Rie et al (1988) which takes into account the nuclea- 
tion and growth of r-type cavities on the grain boundaries under creep-fatigue 
conditions. Fatigue life is defined by unstable crack advance; this happens if the 
crack tip opening displacement becomes equal to the spacing of the nucleated inter¬ 
granular cavities, (1), which leads to unzipping of cavitated material by crack tip 
displacement field. 

Suitable expressions for X, p and d have been derived based on physical models of 
cavity nucleation, their growth and crack-tip opening displacement due to creep- 
fatigue. A cycle-dependent cavity nucleation under cyclic creep and low cycle fatigue 
conditions with superimposed hold time is considered for evaluating the number of 
cavities at the grain boundaries perpendicular to load axis. Cavity spacing, X is then 
determined from cavity density. While the nucleation of cavities is governed by 
deformation of the matrix, cavity growth is controlled by diffusion (Miller et al 1979). 
The crack-tip opening displacement includes an elastic term plus a contribution due to 
plastic deformation and thermally activated time-dependent processes (Ilschner 1982). 
The fatigue life is then estimated using criterion of the critical cavity configuration. The 
predicted and experimental fatigue lives are compared in figure 36. The method is 
capable of predicting lives even for conditions of long hold periods. This model is 
physically more reasonable compared to Tomkins & Warning's (1977), model even 
though the failure criterion for unstable crack growth is same for both (1). In Tomkins 
and Warning's model, arbitrary empirical values are assumed for (X — p ) term to get the 
best fit for the experimental fatigue life, whereas, in the model of Rie et al (1988) suitable 
mathematical expressions for X and p are derived based on theoretical consideration 
including contribution to <5 due to creep. 
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Figure 36. Comparison of 
experimental life and pre¬ 
dicted life following the cavita¬ 
tion model proposed by Rie 
et al (1988) in AISI type 
304 SS. 


6. Concluding remarks 


In this review, the complex nature of high temperature low cycle fatigue phenomenon 
has been highlighted. It is shown that various time-dependent phenomena like 
oxidation, dynamic strain ageing and creep interact with cyclic loading to determine 
life of high temperature components. The importance of a particular time-dependent 
process depends on the material, its microstructural condition and test parameters like 
strain range, frequency, position and duration of hold and temperature. Important life 
prediction techniques for components subjected to high temperature fatigue have been 
briefly reviewed. The applicability of these approaches is dependent on material and 
service parameters. A complete understanding of the different processes that occur 
during time-dependent low cycle fatigue is required before a unified approach appli¬ 
cable to different conditions could be developed. 

Oxidation can cause severe degradation in fatigue life influencing both crack 
initiation and propagation processes. The influence of environment becomes more 
pronounced with lowering of frequency, decreasing strain amplitude and imposition of 
hold during fatigue testing. In tests on 304 SS, it was observed that as the frequency was 
lowered, crack initiation changed from transgranular (induced by slip band) to 
intergranular mode (induced by oxidation). From comparative tests in air and vacuum 
environments, it is seen that while life decreases with decrease in frequency in air 
environment, frequency has little effect in vacuum. Oxidation effects become more 
important at low strain amplitudes. Depending on the material under consideration, 
either tensile or compressive holds could be more damaging. This has been attributed 
to the behaviour of oxide formed during the hold period on subsequent loading. 
A variety of mechanisms have been identified for oxidation-fatigue interaction that 
contribute to deterioration in life. 




158 


P Rodriguez and S L Mannan 


Dynamic strain ageing (DSA) has been found to exert a significant influence on LCF 
behaviour at high temperatures. Occurrence of DSA manifests during total strain- 
controlled fatigue tests in the form of serrations in stress-strain hysteresis loops, 
increased cyclic work hardening, reduced plastic strain range and increase in response 
stress with decrease in strain rate or increase in temperature. Further, DSA causes 
localisation of plastic flow leading to enhanced planarity of slip and widely-spaced slip 
bands. Impingement of slip bands on grain boundaries causes increased grain bound¬ 
ary decohesion, leading to reduced fatigue life. 

Unbalanced slow-fast cycling and tensile hold conditions introduce cavitational 
damage in addition to fatigue cracking. In creep-fatigue interaction, cyclic loading 
increases creep cavitation damage and cavitation has been found to enhance fatigue 
crack initiation and propagation. Rapid crack growth occurs under conditions where 
fatigue crack can link-up with grain boundary cavities. The susceptibility of a material 
to creep-fatigue damage is sensitively dependent on the creep ductility of the material. 
Creep brittle materials are particularly sensitive to interaction damage. While generally 
tensile holds are more detrimental to life in most of the materials, compressive holds 
have been found to be deleterious for nickel base alloys. 

Several life prediction techniques have been proposed for components under creep- 
fatigue interaction. These range from purely empirical to mechanistic models based on 
microstructural observations. Best known among these techniques are linear damage 
summation rule, strain range partitioning, frequency modification approaches and models 
based on micromechanistic considerations like ductility exhaustion, damage rate and 
fatigue crack-cavity interaction. To be reliable, the techniques must take into account the 
physical damage processes occurring during creep-fatigue interaction. The understanding 
of creep-fatigue interaction behaviour would aid in extrapolation of short term laboratory 
data to long term service conditions, involving complex loading situations under operating 
conditions for a wide range of materials. While linear damage rule does not have a 
mechanistic basis, it is still the best available tool to design engineers, because of its 
simplicity. Refinements to the linear damage approach could be achieved by replacing 
creep damage term t/T d by e/e r where £ is the accumulated creep strain and e r is the 
appropriate rupture ductility, thereby introducing deformation and fracture into failure 
criterion. Creep damage term (A e c /D c ) of the ductility exhaustion method could also be 
incorporated in the linear damage rule to predict realistic damage fraction due to creep 
rather than the term t/T d . Fatigue crack-cavitation interaction models appear quite 
promising as they are based on sound mechanistic considerations. These would be 
particularly applicable to situations where extensive creep cavitation is encountered. 


The authors gratefully acknowledge the collaboration of Dr M Valsan, A Veeramani 
and G Sasikala in the preparation of this paper. 
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Stress corrosion cracking and component failure: Causes and 
prevention 

U K CHATTERJEE 

Department of Metallurgical & Materials Engineering, Indian Institute of 
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Abstract. The century-old problem of stress corrosion cracking (SCC) in 
metals and alloys has been reviewed highlighting the metallurgical factors 
like grain boundary precipitation and grain boundary segregation contri¬ 
buting to the process, the electrochemical aspects of the phenomenon and 
the current knowledge of the mechanisms which can be broadly classified as 
dissolution-based and cleavage-based models. The introduction covers 
some field examples of SCC. SCC test methods and preventive measures for 
SCC have also been discussed. 

Keywords. Stress corrosion cracking; SCC tests; grain boundary precipi¬ 
tation, grain boundary segregation; repassivation; dissolution mechanisms; 
cleavage mechanisms; preventive measures. 


1. Introduction 

Stress corrosion cracking (SCC) may be defined as the delayed failure of alloys by 
cracking when exposed to certain environments in the presence of a static tensile stress. 
The importance of a conjoint action of corrosion and stress as well as the nature of 
stress is reflected in the definition. The stress level at which the failure occurs is well 
below the stress required for a mechanical failure in the absence of corrosion. Again, 
corrosion alone in the absence of stress does not produce cracking. The stress can be 
applied or residual. A compressive stress does not cause stress corrosion cracking, 
rather it is used to prevent this phenomenon. Environmentally induced crack propaga¬ 
tion due to cyclic loading is not stress corrosion cracking, but is defined as corrosion 
fatigue. Environmentally induced cracking in the presence of stress is also encountered 
in non-metallic materials, e.g. glass and alumina in water or organic polymers in polar 
solvents. The discussion in the present paper will, however, be confined to SCC in 
metallic materials. 

The earliest report on SCC is probably the occurrence of ‘season cracking’ in brass 
cartridge cases in ammonia-bearing environments in the beginning of the present 
century. Caustic embrittlement of riveted steel boiler plates is another classical 
example of SCC encountered in the early steam-driven locomotives. The causes of SCC 
in this case were the residual stress developed due to riveting operations and sodium 
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hydroxide present in these areas. Many alloy-environment combinations leading to 
SCC were discovered with the passage of time. It was recognized with concern that 
almost all alloys of engineering interest were susceptible to this type of catastrophic 
degradation in some environment or other, the only silver lining being that all alloys 
were not susceptible to SCC in all the environments. It was also noticed that pure metals 
were immune to SCC in the environments in which their alloys showed ready and rapid 
cracking. A list of alloy-environment systems exhibiting SCC is presented in table 1. 

It should be remembered that the table is not exhaustive; newer combinations and 
newer environments causing SCC in a particular alloy are being regularly discovered 
and added to this list. The so-called ‘specificity’ of the environment with respect to an 
alloy is no longer true. The immunity of pure metals to SCC is also no longer valid. 
Nevertheless, still only a small number of corrodents cause SCC in a particular alloy 
and SCC of a pure metal is still confined to laboratory tests. 

Practical examples of SCC in engineering alloys, on the other hand, are abundant 
and varied. Intensive SCC was observed in a high-pressure autoclave made of 18-8 
stainless steel (Fontana & Greene 1967). The failure in service took place in a matter of 
hours and was traced to the build up of chloride concentration on the outside surface 
because of evaporation of the water used for cooling which originally contained only 
a negligible amount of chloride. Yet another interesting example of SCC was the 
development of longitudinal cracks in a cold-drawn brass tube which had its origin in 
the bird droppings that provided the ammonia necessary for corrosion (Barer & Peters 
1970). The occurrence of SCC arising from chloride-contaminated moisture has been 
reported in high-strength aluminium alloy components in low-flying British military 
aircraft which led to the ban on the use of the material till a solution was found through 
cladding of the material with pure aluminium. Carbonate-bicarbonate environments 
have been identified as the probable cause of cracking in natural gas transmission steel 
pipe lines (Parkins 1980). 

Numerous cases of SCC in chemical process plants and power-generation plants 
have been reported in the literature. Oxygenated pure or impure high temperature 
water has caused failures in steam generator shells made of carbon or low-alloy steels 
(Congleton et al 1985). Extensive failures have taken place in austenitic 18Mn-4Cr 
steel rotor end-retaining rings in contamination with oxygenated high temperature 
water (Viswanathan 1982). SCC of austenitic stainless steel pipes caused by the same 
environment in boiling water reactors (BWR) has cost the world’s nuclear power 
industries as much as 10 billion dollars (Staehle 1990). SCC of nickel based alloy 600 in 
reducing high temperature water and supercritical steam in pressurized water reactors 
(PWR) has been reported (Coriou et al 1969). Room temperature cracking of sensitized 
austenitic stainless steels produced by polythionic acid was first experienced in 
catalytic reformers used in the petroleum industry (Dravnicks & Samans 1957). 
Titanium alloys, so reputed for their excellent corrosion resistance in some of the most 
aggressive media, are known to fail by SCC in many environments including red 
fuming nitric acid (RFNA), hot molten salts, methanol, liquid dinitrogen tetroxide 
(N 2 0 4 ) and salt water (Blackburn et al 1972). Hot salt SCC of titanium alloys was first 
discovered at fingerprints on creep test specimens (Jackson & Boyd 1966). Dinitrogen 
tetroxide is used as an oxidant in rocket engines. Interestingly, SCC in this environment 
is inhibited by the traces of NO. 

Stress corrosion cracking thus affects the performance of metallic components, 
whether mundane or exotic, in environments of diverse chemical nature, some of which 
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Alloy 

Environment 

Aluminium 

alloys 

Air with water vapour 
Potable water 

Seawater 

NaCl solutions 

NaCl-H 2 0 2 solutions 

Carbon 

steels 

Caustic NaOH solutions 
Calcium, ammonium and 
sodium nitrate solutions 


HCN solutions 

Acidified H 2 S solutions 
Anhydrous liquid ammonia 
Carbonate/bicarbonate 
C0/C0 2 solutions 

Seawater 

Copper alloys 

Ammoniacal solutions 
Amines 

Nitrites 

Nickel alloys 

Caustic alkaline solutions 
High-temperature chloride 
solutions 

High purity steam 

Acidic fluoride solutions 


Alloy 

Environment 

Stainless steels 
Austenitic 

Hot acid chloride solutions 
NaCl-H 2 O z solutions 

Austenitic 

(sensitized) 

NaOH-H 2 S solutions 

Seawater 

Concentrated caustic solutions 
Neutral halides-Br “, I “, F " 

Polythionic acids (H 2 S„0 6 ) 
Sulphurous acid 

Pressurized hot water contain¬ 
ing 2 ppm dissolved oxygen 

Ferritic 

h 2 s, nh 4 ci, nh 4 no 3 , 

hypochlorite solutions 

Martensitic 

Caustic NaOH solutions 

Titanium 

alloys 

Red fuming nitric acid 

Hot salts, molten salts 


n 2 o 4 

Metanol halide 


are apparently innocent. No wonder then that the last four decades witnessed a spurt of 
research activities in this area for an understanding of the phenomenon as well as for its 
prevention. A commendable review of activities in the period 1965-1990 has been made 
by Newman & Proctor (1990). 


2. General features of SCC 

The following features are generally common to SCC: 

(1) Pure metals are almost immune to cracking in environments where their alloys 
crack readily. 

(2) The existence of a tensile stress, either applied or present as a residual stress, is 
essential for the cracking process. 

(3) SCC for a given alloy usually occurs more readily in a rather specific environment. 

(4) The microscopic appearance of stress corrosion cracks, which are either inter¬ 
granular or transgranular (figures 1 & 2), is that of a brittle fracture although the 
metal itself may still be quite ductile in mechanical test. The brittle nature of 
fracture is quite evident is SEM fractography (figure 3). 

(5) In order for SCC to occur a threshold stress or stress intensity must be exceeded. 
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(6) SCC, in general, can be accelerated by the application of anodic currents and can 
be stopped by the application of cathodic currents showing that the process is, at 
least partly, electrochemical. 

Stress corrosion cracking is a complicated subject. It is an outcome of material- 
environment-stress interaction. Metallurgical and electrochemical factors and also the 
manner in which a specimen is stressed greatly influence the process. Some of them will be 
discussed in the subsequent sections. Before that, it will be useful to introduce the various 
SCC testing methods and the parameters used for ascertaining the susceptibility to SCC. 


3. Testing methods 

The SCC testing methods can be categorized as below. 

(1) Constant-strain or constant-deformation tests; 

(2) tests on statically loaded smooth samples; 

(3) tests on statically loaded precracked samples; 

(4) slow strain-rate testing (SSRT). 

Constant-strain tests make use of U-bend, C-ring, bent-beam specimens or tensile 
specimens in a small loading frame with nuts on the threaded ends of the specimen. 
Although the stress distribution within the specimen cannot be precisely known and 
the stress conditions are difficult to duplicate from specimen to specimen, U-bend 
specimens have found wide use in evaluating qualitatively the resistance of various 
alloys to various solution conditions. Time to failure, t f , is the parameter used to 
measure the resistance to SCC. 

Tests on statically loaded smooth samples are usually conducted at various fixed stress 
levels, and the time to failure is measured. A dead-weight load is hung from one end of 
the specimen, directly or by means of a lever system. Most of the constant deformation 
specimens can be adapted to constant load by adding a spring in the device. Figure 4 
illustrates the typical results obtained from this type of test. A threshold stress below which 
the time of failure approaches infinity is encountered in some systems, whereas in other 
systems it is not observed. Time to failure, t f , essentially is made up of two components: 
the time for crack initiation (t { ), and the time for crack propagation, t p , so that, 

ti + t p = t f . 

Different metallurgical and electrochemical variables do affect t f , but it is difficult, if 
not impossible, to ascertain these effects separately on t t and t p . In smooth specimens 
usually t { » t p , whereas in practical situations a pit or a surface roughness feature acts as 
an already initiated crack and the question of its propagation under different variables 
assumes more importance. The use of pre-cracked specimens, notched or fatigue- 
precracked, and the application of linear elastic fracture mechanics (LEFM) technique 
is stress corrosion crack propagation have evolved as a consequence. 

Tests on statically loaded precracked samples are usually conducted with a constant 
applied load and the velocity of crack propagation, da/dt, as a function of stress-intensity 
factor, k, is measured. The value of k is calculated from k = aC i/2 where a is the applied 
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Figure 1. Intergranular stress corrosion cracks in 70-30 brass exposed to ammo- 
niacal solution. 

stress and C is the crack length. Figure 5 gives the schematic representation of a typical 
da/dt versus k plot. Three regions in the plot are identified as stages I, II and III. No 
crack propagation is observed below some threshold stress-intensity level, /c )scc . In 
stage I, the crack-propagation rate increases rapidly with the stress-intensity factor. In 
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Figure 3. SEM fractograph of intergranular SCC of type 316 stainless steel in 
polythionic acid showing brittle features. 


stage II, the crack-propagation rate approaches some constant velocity, referred to as 
the plateau velocity, which is characteristic of the alloy/environment combination and 
is a result of rate-limiting diffusion of the reactant species to the crack tip. The effect of 
variations in composition, changes is heat treatment, electrochemical variables and 
changes to the environment get reflected in the plateau velocity and such data become 
helpful in alloy selection. Alloys which have a low plateau velocity can be chosen in 
preference to the alloys with a high plateau velocity under the similar conditions of the 
environment. 



Figure 4. Time-to-failure and 
applied stress relationship ob¬ 
tained in a constant-load type 
SCC test. 


APPLIED STRESS, o 
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Figure 5. General relationship 
between stress corrosion crack 
velocity and stress intensity 
factor. 


The earliest studies involving precracked specimens and LEFM were carried out by 
Brown & Beachem [1965] at the US Naval Research Laboratory. They showed that 
high strength titanium alloys, which were thought to be immune to SCC in dilute 
aqueous chloride environments on the basis of conventional, smooth specimen stress 
corrosion tests, were in fact highly susceptible when evaluated using fatigue precracked 
specimens. Actually, these alloys are resistant to pitting in these environments and 
stress corrosion crack initiation through pitting does not take place. However, 
pre-existing flaws and defects make these alloys susceptible to SCC. 

Slow strain rate testing is a procedure in which a smooth or a pre-cracked specimen 
exposed to the corrosive environment is strained at a low crosshead speed (10 -5 to 
10“ 9 m/s) to failure. The strain to failure in the corrosive environment and the strain to 
failure in an inert environment can then be plotted against strain rate, or rather a ratio 
of these measurements as shown in figure 6. The ratio of other tensile measurements, 
such as reduction in area and ultimate tensile strength, may be plotted. The plot in 
figure 6 shows the similar behaviour of the two alloys at high strain rates, the ductility 
ratio approaching unity, indicating that the failure is through mechanical fracture as 
SCC crack initiation has not been possible at these high strain rates. Alloy B shows 
SCC susceptibility in a narrow range of strain rate, higher rate of repassivation of the 
alloy in the environment resists SCC at slower strain rates. Alloy A shows the 
behaviour of an alloy which does not repassivate in the environment concerned. The 
importance of repassivation in SCC will be discussed later. 

The slow strain rate technique has the advantage over the constant-strain or 
constant-load test methods in that the experiment can be completed in a short period of 
time to ascertain the susceptibility to SCC of an alloy in a given environment. The test 
conditions, however, are more aggressive than the other methods. So it can be said that 
an alloy showing resistance in this test should be resistant in service with similar 
corrosive conditions, but an alloy showing SCC in a slow strain-rate test would not 
necessarily fail in service, where a forced, continuous strain is absent. 


4. Metallurgical aspects of SCC 


Virtually all alloys are susceptible to SCC in the appropriate environments. The 
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Figure 6. Strain-to-failure plots resulting from slow strain rate testing (a) Schema¬ 
tic of typical ductility versus strain rate behaviour of two different types of alloys 
tested by the slow-strain-rate technique, (b) schematic of the ductility ratio versus 
strain-rate behaviour of two different types of alloys. The ductility ratio is the ratio 
of a ductility measurement such as elongation, reduction in area, or fracture energy 
measured in the aggressive environment to that obtained in the inert reference 
environment. 


susceptibility to SCC is affected by the chemical composition of the alloy, grain size and 
preferential orientation of grains, composition and distribution of precipitates, disloca¬ 
tion interactions and the progress of phase transformation. Bulk alloy composition can 
affect passive-film stability and phase distribution (for example, chromium in stainless 
steel), minor alloying elements can cause local changes in passive-film forming elements 
(for example, carbon in stainless steel causing sensitization), impurity elements can 
segregate to grain boundaries and cause local difference in the corrosion rate (for example, 
phosphorus in nickel or nickel-base alloys), and inclusions can cause changes in the local 
crack-tip chemistry as the cracks intersect them (for example, manganese sulphide in steel). 

The effect of bulk alloy composition on SCC susceptibility is best exemplified in 
stainless steels and copper alloys. Figure 7 shows the effect of nickel content on SCC in 
stainless steels exposed to boiling 42% magnesium chloride solution, a common SCC 
medium used in the laboratory to simulate the cracking in hot chloride environments 
(Copson 1959). The beneficial effect of high nickel content is clearly visible. However, 
a ferritic stainless steel devoid of nickel, which is generally resistant to SCC in most 
common service environments that attack austenitic stainless steels, becomes suscep¬ 
tible wih the addition of 1-5 to 2% nickel. Duplex stainless steels with high chromium 
and low nickel content are more resistant to SCC than austenitic stainless steels. Pure 
copper which is almost immune to SCC in ammoniacal environments becomes readily 
susceptible with the alloying of 1 % silicon or 0-2% phosphorus (Thompson & Tracy 1949). 
Brasses with a zinc content below 15% are resistant to SCC. Addition of a third element 
like Si, Al, to 70Cu-30Zn brass changes the mode of cracking from intergranular to 
transgranular (Baumann 1962). 

Grain-boundary precipitation and grain-boundary segregation play a major role in 
intergranular stress corrosion cracking. Austenitic stainless steels heated in the tem¬ 
perature range of 500 to 850°C, develop a chromium-depleted zone along the grain 
boundary through the precipitation of chromium carbide, a phenomenon known as 
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Figure 7. Effect of nickel con¬ 
tent on the susceptibility to 
SCC of stainless steel wires con¬ 
taining 18-20% chromium in 
boiling MgCl 2 solution. 


sensitization. Susceptibility to intergranular SCC and the crack growth rate are related 
to the degree of sensitization. Intergranular SCC of sensitized austenitic stainless steels 
is a recurring problem in cooling-water piping of boiling-water nuclear power plants 
(Gordon 1987). The Ni-Cr-Fe alloy 600 also gets sensitized through chromium 
depletion at the grain boundary and has been reported to fail by intergranular SCC in 
high temperature pressurized water (Pessal et al 1979). Grain-boundary precipitation 
has been identified as a contributing factor in the intergranular SCC of aluminium 
alloys (Hollingsworth & Hunsicker 1987). 7000 Series alloys (Al-Zn-Mg-Cu) are most 
susceptible in the peak-aged condition, the susceptibility is reduced with overaging. 

Grain boundary segregation of alloying elements or impurities has been identified as 
the causative factor of intergranular SCC in many alloys. Grain-boundary enrichment 
of magnesium in Al-Mg alloys accounts for the increased anodic activity or possible 
formation of magnesium hydride along the grain boundaries (Pow et al 1981; Holroyd 
& Seamans 1985). Grain boundary enrichment of impurities such as phosphorus, 
sulphur, carbon and silicon contributes to the intergranular SCC of iron-base alloys, 
austenitic stainless steels and nickel-base alloys. The enrichment of impurities in the 
grain boundaries can be as high as 50% within a region 1 to 2 nm thick, facilitating the 
propagation of a stress corrosion crack along the grain boundary (Jones & Ricker 1992). 

The effect of variation of carbon on the SCC of very low carbon steels were studied 
(Long & Uhlig 1965). Carbon segregation at the grain boundary was considered to 
provide suitable imperfection sites for adsorption of nitrates to promote SCC, and at 
a very low level of bulk carbon content such segregation was not attainable. Parkins 
(1969) discussed the role of carbon and/or nitrogen segregation to grain boundaries in 
SCC of carbon steels and found experimentally that about 0*01 %C was required to 
cause SCC in nitrate or caustic environments, which is above the room temperature 
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solubility limit. Phosphorus segregation has been shown by several authors to promote 
intergranular SCC of low alloy steels (Cr-Mo or Ni-Cr-Mo-V) in caustic or water 
environments at relatively oxidizing potentials (Harrison et al 1977; Burnstein & 
Woodward 1983; Bandyopadhpayay & Briant 1983). Phosphorus segregation has also 
been identified as a contributory factor in the intergranular SCC of austenitic stainless 
steels containing less than 0-002% carbon (Jones 1985) and also in nickel alloy 600 
(Guttman et al 1981). Grain-boundary segregation of impurities is deemed responsible 
for the intergranular SCC of pure iron (Parkins 1990). 

In transgranular SCC, alloying effect on slip planarity is a major factor. A number of 
crack growth models have been proposed based on the planar slip-localized corrosion 
processes. Planar slip occurs in alloys with low stacking-fault energy, alloys containing 
ordered phases, or alloys exhibiting short or long range ordering. Addition of nickel to 
stainless steels (Pickering & Swann 1963) and manganese to copper (Chatterjee et al 
1970) have been reported to develop planar array of dislocations due to the lowering of 
stacking fault energy and consequently a susceptibility to transgranular SCC. Alloys, 
on the other hand, are prone to dealloying. It has been suggested that the dealloyed 
layer acts as a cleavage crack initiator in brass, copper, gold and stainless steels 
(Sieradzki & Newman 1987). 


5. Electrochemical aspects of SCC 

A number of environmental parameters influence stress corrosion cracking sus¬ 
ceptibility, both in terms of crack initiation and crack growth, as well as the cracking 
mode in an alloy. Important among these parameters are temperature, nature of solute 
species and its concentration, pH and the electrochemical potential. Although the 
‘specificity’ of an environment is no longer valid, still only a few environments can 
induce SCC in any particular alloy system and the susceptibility varies significantly 
with potential and pH. 

In this pioneering work, Mattsson (1961) showed the dependence of the cracking 
time of alpha brass on the solution pH. The results were discussed in the background of 
the potential-pH diagram for Cu-H 2 0-NH 3 system. The formation of an oxide film 
(Cu 2 0) and its rupture providing an active dissolution front were thought to be 
essential requirements for the SCC in this system. The SCC of brass has been shown to 
be potential dependent (figure 8) with cracking mode changing over from intergranular 
to transgranular beyond a certain value of applied anodic potential (Sircar et al 1972). 
For SCC of mild steel in various environments, potential-pH conditions for severe 
cracking susceptibility have been identified (Congleton et al 1985) which also coincide 
with the regions of thermodynamically stable protective films (figure 9). 

For an inherently reactive metal like mild steel most of the exposed surface will 
remain inactive if the surface is passivated. Nitrates, hydroxides and carbonates, which 
are anodic inhibitors of corrosion of carbon steels in appropriate circumstances, have 
the characteristics of passivating the surface. Along with this inactivity of most of the 
surface, the environment must also permit the formation of soluble species at the crack 
tip, so that the dissolution and hence crack propagation can occur. Both these 
conditions call for a border-line passivity. The ranges of potential within which 
cracking occurs are therefore expected to coincide with those ranges within which 
soluble species form. Such ranges are just above the active-passive potential zone in the 
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APPLIED POTENTIAL, mV 
{SCE scale) 


Figure 8. Effect of applied po¬ 
tential on the time-to-fracture of 
a-brass in Mattson’s solutions 
of pH 7 containing lgmol/1 
NH* and different composi¬ 
tions of dissolved copper. 


anodic polarisation curves. For austenitic stainless steels in chloride media the 
transgranular SCC takes place at potentials around the breakdown potential and the 
crack initiation is usually associated with pit formation. These ranges are shown 
schematically in figure 10. 

The effects of additions in causing the free corrosion potential to move in relation to 
the potential range for cracking have considerable practical singificance. The addition 
of small amounts of nitrates to concentrated NaOH causes the free corrosion potential 
to become significantly more positive than the cracking potential in alkali, and failure is 
no longer observed. The free corrosion potential of some mild steels is likely to be at the 
boundary of cracking range in hydroxide so that the stress corrosion does not occur. 
The addition of a small amount of lead oxide causes the free corrosion potential to 
move into the cracking range and the failure then occurs readily at these potentials. The 
potential and pH conditions inside the crack are not the same as those in the bulk of the 



Figure 9. Relationship be¬ 
tween pH/potential conditions 
for severe cracking susceptibil¬ 
ity of mild steel in various envi¬ 
ronments and the stability 
regions for solid and dissolved 
species on the potential-pH dia¬ 
gram. Note that severe suscepti¬ 
bility is encountered where 
a protective film (phosphate, 
mixed carbonate, magnetite 
etc.) is thermodynamically 
stable, but if ruptured, a soluble 
species (Fe 2+ , HFe0 2 ) is meta¬ 
stable. 
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Figure 10. Schematic anodic 
polarization curve showing 
zones of susceptibility to stress 
corrosion cracking. 


solution or at the surface of the alloy and so a correlation may not always be possible 
with the potential-pH diagram. However, it is reasonable to believe that the conditions 
prevailing inside the crack essentially meet electrochemical requirements as presented 
above for the SCC to occur. 

If the mechanism whereby stress corrosion crack propagation involves dissolution at 
the crack tip, then crack velocities may be expected to be related through Faraday’s law 
to the current density at the crack tip according to the equation, 

v=i a M/zFp 

where, i a is the current density, M is the molecular mass of the material, z is the valency 
of the solvated ions, F is Faraday’s constant and p is the density of the metal. Taking the 
effective current density as the largest difference between fast and slow sweep rate 
polarisation curves, or the maximum current densities observed in scarping or strain¬ 
ing electrode experiments at appropriate potentials, a plot of these current densities 
against calculated crack velocities for a variety of stress corrosion systems yields 
a straight line (Parkins 1979). There is good correlation in some cases whereby the 
mechanism postulating initiation and propagation of cracks through dissolution is 
collaborated. The deviation can be accounted for by attributing the role of dissolution 
in the crack initiation only, triggering a mechanical fracture propagation step. 

‘Specificity’ of environment also can be explained from the electrochemical stand¬ 
point. In very general terms, it is clear that potent solutions will need to promote 
a critical balance between activity and passivity, since a highly active condition will 
result in general corrosion or pitting, while a completely passive condition cannot, by 
definition, lead to stress corrosion. Whilst the relative inactivity of all exposed surfaces 
except the crack tip may be derived from a noble metal film in the case of alloys 
containing sufficiently noble elements, for the great majority of engineering alloys 
inactivity at exposed surfaces is the result of the presence of oxide films overlaying 
metal surfaces. It is not surprising therefore to find that the alloys of high inherent 
corrosion resistance (such as those based upon aluminium or titanium, or the austenitic 
stainless steels, that readily develop protective films) require an aggressive ion, such as 
a halide to promote stress corrosion cracking. On the other hand, to crack the metals of 
low inherent corrosion resistance, such as carbon steels or magnesium base alloys, we 
require the presence of an environment that is itselfipartially passivating. Thus, carbon 
steels can-bo made to fail in solutions of anodic inhibitors, such as hydroxides or 
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carbonates, and the cracking of magnesium-base alloys is achieved with an appropriate 
mixture of CrO^ - and Cl" ions, but not with either of these species alone. 


6. Mechanisms of SCC 

Any mechanism of stress corrosion cracking should explain the charateristic features of 
the process, viz. the susceptibility of an alloy in a limited number of environments, the 
role of stress and the nature of cracking apart from the influence of various metallurgi¬ 
cal and environmental factors in the cracking process. Since stress corrosion cracking 
essentially involves a corrosive environment, mechanisms based on localized dissol¬ 
ution of metal have been postulated. On the other hand, the essential brittle fracture 
feature of stress corrosion cracking has led to the postulation of mechanism based on 
cleavage. There are several variations of these two basic mechanisms. According to 
some models, the so-called continuous electrochemical mechanisms, the initiation and 
propagation of cracks are both dissolution-controlled. According to the periodic 
electrochemical-mechanical mechanism, the initiation of cracks or the embrittlement 
of the region ahead of the crack tip has been dissolution controlled while the 
propagation is a mechanical brittle fracture step. There is a third group, where the role 
. of dissolution is de-emphasized in the embrittlement process. 

6.1 Dissolution mechanisms 

According to this mechanism a crack advances by preferential dissolution at its tip. The 
restriction of the dissolution process at the crack tip has been attributed to either 
a pre-existing active path or to strain-generated active path. 

The pre-existing active path mechanism: This is the oldest mechanism proposed (Dix 
1940). Grain-boundary segregation or precipitation, as discussed in the previous 
section, brings about substantial change in the microchemistry of the grain boundary 
with respect to the bulk alloy. These segregated solutes or precipitated phases may act 
as anode in the local cell or, by acting as an efficient cathode, may cause the dissolution 
to be localized upon the immediately adjacent matrix. The role of stress here is to keep 
the crack open for the accessibility of the corrosive medium to the crack tip. The stress 
concentration at the crack tip producing yielding has also been considered to accelerate 
dissolution due to the so called ‘mechanochemical effect’ (Hoar & West 1962). 
Intergranular stress corrosion cracking (IGSCC) of sensitized stainless steel in various 
environments, including high purity water, of carbon steels exposed to nitrate, hydrox¬ 
ide or carbonate-bicarbonate solutions and of aluminium alloys exposed to chloride 
solutions have been explained in these terms. However, a propensity for intergranular 
corrosion is no guarantee for IGSCC. It has been shown that though the nickel-base 
superalloy IN718 and Ticolloy show intergranular corrosion in NaCl solution, in 
deaerated solutions they may crack transgranularly (Wang et al 1992). 

Strain-generated active path mechanism: The SCC in alloys which do not have a pre¬ 
existing path has been explained by dissolution of strain-generated active path and 
there are two distinct models: the film-rupture model (Logan 1952) and the slip-step 
dissolution model (Vermilyea 1969). According to the film-rupture model, the stress 
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opens up the crack and the localized plastic deformation at the crack tip ruptures the 
passivating film, exposing the bare metal which dissolves rapidly, resulting in crack 
extension. The crack tip remains bare if the rate of film rupture at the crack tip exceeds 
the rate of repassivation. If the situation is reverse, the crack tip repassivates completely 
and is periodically ruptured by the emergence of slip steps and the bare slip-step 
sustains dissolution till the next repassivation. 

Since many of the stress corroding systems are associated with film formation and 
the SCC occurs under some sort of borderline passivity condition (discussed earlier), 
most investigators now agree that film rupture is essential to initiate cracking. The 
emerging slip-steps at the surface as a result of deformation also provide an explanation 
for transgranular crack initiation. However, controversy persists in regard of a crack 
propagation by continued dissolution. For example, transgranular cracking would be 
expected to grow on the active slip plane, where deformation would cause continual 
film rupture, contrary to the experimental fact that cracks grow on planes of the type 
{100} or {110} (Liuet al 1980). Also, a crack growing by electrochemical dissolution at 
unfilmed crack tip should leave a relatively smooth, featureless surface, but crystallo¬ 
graphic cleavage, often with arrest marks (for transgranular SCC), and well-defined 
grain boundary (for intergranular SCC) features are often present on fracture surfaces. 
Although the slip-step dissolution model accounts for the presence of arrest marks, it 
fails to account for the matching opposite fracture surfaces. 

A corrosion tunnel model (Swann & Pickering 1963) has emerged where dissolution and 
mechanical fracture have been combined. It assumes that a fine array of small corrosion 
tunnels form at the emerging slip steps. These tunnels grow in diameter and length until 
the stress in the remaining ligaments causes ductile deformation and fracture. However, 
such a model should produce grooved fracture surface with evidence of microvoid 
coalescence at the broken ligaments, which have not been observed experimentally. So 
this model has been modified subsequently (Silcock & Swann 1979) by suggesting that 
the application of a tensile stress results in a change in the morphology of corrosion 
damage from tunnels to thin, flat slots. The width of the corrosion slots approach 
atomic dimensions and a close correspondence of matching surfaces is expected. 

6.2 Cleavage mechanisms 

Several mechanisms have been postulated to explain the cleavage type cracking 
encountered in SCC. These are: hydrogen-assisted cracking, tarnish-rupture, film induced 
cleavage, adsorption induced cleavage and atomic surface mobility mechanisms. 

Hydrogen assisted cracking is very often described by the decohesion model in which 
atomically dissolved hydrogen acts to weaken the interatomic bonds in the plain-strain 
region of the crack tip by lowering the surface energy y s in the Griffith equation, 

= (2 EyJncY 12 , 

where a c is the fracture stress necessary to cause the propagation of an elliptical crack of 
length 2c and E is the Young’s modulus. Blocked glide planes have been considered to 
provide the initial Griffith cracks (Petch & Stables 1952). It has also been suggested (Jani 
et al 1990) that hydrogen decreases the stacking fault energy to induce coplanar deforma¬ 
tion at the tip. Embrittlement would then result from Lomer-Cottrell supersessile 
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dislocations on the intersecting slip planes. A somewhat similar model has been 
presented to explain the hydrogen assisted cracking in Fe-3Si single crystals and the 
observed crack arrest marks (Chen & Gerberich 1991). The embrittlement of Nb, Ta, 
Ti, V and Zr is due to the stress-assisted formation of brittle hydride phase ahead of the 
crack tip which facilitates crack growth by cleavage, with cracks arresting at the 
boundary where the relatively tough matrix is encountered. Another particle of hydride 
then forms in the region of crack tip and the process is repeated, resulting in 
discontinuous crack growth. 

Hydrogen assisted crack growth in the manner described has been suggested as the 
SCC mechanism for ferritic steels, nickel base alloys, austenitic stainless steels, aluminium 
alloys and intermetallic compounds. The most probable source for atomic hydrogen to 
enter into the metal is the cathodic reduction of hydrogen ions accom- panying the 
anodic dissolution in the aqueous environments. Materials with impurities like sul¬ 
phur, phosphorus, antimony and tin segregated at grain boundaries have been found to 
be more susceptible to hydrogen-induced cracking, because these impurities act as 
hydrogen-evolution poisons (Latanision & Opperhauser 1974). On the other hand, the 
growing stress corrosion cracks have been stopped effectively in low-strength ductile 
alloys by cathodic polarisation, refuting the validity of hydrogen-assisted SCC in these 
alloys. Again, all the environments and conditions under which SCC is encountered do 
not produce hydrogen and some of them produce surface films which constitute an 
effective barrier to hydrogen entry (Hardie 1990). These restrict the universal applica¬ 
bility of hydrogen-induced cracking as SCC mechanism. 

The adsorption-induced cleavage mechanism or the stress-sorption cracking mechanism 
(Uhlig 1959) is based on the hypothesis that adsorption of environmental species lowers 
the interatomic bond strength and the stress required for cleavage fracture, an idea 
similar to the decohesion model of hydrogen-induced cracking. Adsorption is assumed 
to be potential dependent which accounts for the stoppage of SCC by cathodic 
polarisation below a critical potential. The specificity of the species for inducing SCC in 
a particular alloy can also be conveniently explained in terms of preferential adsorp¬ 
tion. However, this model does not explain how the crack maintains an atomically 
sharp tip in a normally ductile material and it also fails to explain the discontinuous 
nature of crack propagation (Parkins 1990). 

The tarnish-rupture mechanism was originally proposed to explain discontinuous 
transgranular crack growth of a-brass single crystals in ammoniacal solutions 
(Edeleanu & Forty 1960; Forty & Humble 1963). According to this model, a brittle 
surface film or tranish forms on the metal which fractures under the applied stress. The 
exposed bare metal reacts rapidly with the environment, the film grows again and the 
cycle of growth and fracture of the tarnish is repeated. The model was modified 
subsequently (McEvily & Bond 1965; Beavers et al 1972) for intergranular SCC 
proposing that the tarnish penetrates along the grain boundary ahead of the crack tip. 
At present, experimental results are insufficient to confirm or refute this mechanism, 
though some workers (Pinchback et al 1976) have questioned the validity of grain¬ 
boundary penetration in all systems. 

The film-induced cleavage mechanism developed as a consequence of the tarnish- 
rupture model (Pugh 1985; Sieradzki & Newman 1985). In this model it is assumed that 
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dissolution leads to the formation of a surface film and a crack growing in the film 
propagates further into the underlying metal. From an initial arrest position, the crack 
advances by cleavage for a limited distance, after which the crack becomes progressively 
blunted by plastic deformation until the propagation stage is repeated.The film may be 
a dealloyed layer or oxide, in which nanoporosity is considered to be the key feature 
leading to the injection of a sharp crack into the substrate. Experimental results showing 
the correlation of SCC with dealloying have been given for Cu-Zn and Cu-Al alloys 
(Sieradzki et al 1987), and also for stainless steel (Newman et al 1989). Certain objections 
have been put forward against this mechanism. The crack advance distance as measured 
from arrest marks is typically of the order of a few micrometers and doubts have been raised 
(Gerberich & Chen 1990) as to whether a cleavage crack could penetrate that far from an 
initiating film that is only of the order of 30 nm thick. Dealloyed layers are also unlikely to 
have sufficient adhesion to the substrate or inherent brittleness to sustain cracking. 

The atomic surface mobility mechanism^ (Galvele 1987) suggests that many forms of 
environmentally induced cracking, including hydrogen embrittlement, SCC and 
liquid-metal embrittlement, grow by the capture of surface vacancies at the crack tip 
and counter-current surface diffusion of atoms away from the crack tip. The coefficient 
of surface self-diffusion will dominate the crack growth rate and the role of the 
environment is to change that diffusivity which is directly related with the presence of 
low melting surface compounds. However, it has been pointed out (Parkins 1990) that 
although carbon steel cracks in nitrates which form low-melting compounds with iron, 
it also cracks in the presence of high melting point Fe 3 0 4 . Oriani (1990) puts an 
opposite viewpoint by suggesting that the flux of atoms in the region of a stressed crack 
tip should be the reverse of what the mechanism requires, by analogy with diffusional 
creep. It has been criticized (Newman & Procter 1990) that many of the correlations 
postulated in this mechanism would work equally well if the criterion was surface 
energy reduction, rather than increase in diffusivity, and thus it becomes indistinguish¬ 
able from earlier surface energy models. 

The proposed mechanisms of SCC thus differ immensely from one another; some 
suggesting an exclusive role of dissolution while some others giving dissolution a 
marginal or no role at all in the cracking process. While the proponents of a particular 
mechanism have produced enough evidence in support of the mechanism, observations 
not conforming to the mechanism have also been cited in plenty. Though there have been 
attempts from time to time to find a unified theory explaining all features of SCC in all the 
alloy-environment systems, it appears more reasonable to believe that the mechanism 
varies from system to system. It is also probable that no particular mechanism operates 
exclusively in one system but that more than one mechanism is at play. Decades ago, 
Parkins (1976) proposed that the different mechanisms of stress corrosion should be 
considered as occurring within a continuous spectrum, with a gradual transition from one 
to the other as the dominance of corrosive processes is replaced by stress or strain leading to 
a brittle fracture (table 2). The picture appears to have remained unchanged. 


7. Preventive measures 

Control of stress : Lowering of applied stress below threshold stress or stress intensity 
by suitable changes in design and lowering or removal of residual stress by annealing 
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treatment are effective means to reduce the incidence of SCC in service. The seasonal 
cracking of brass cartridge cases was prevented by proper stress relief annealing 
treatment. It is recommended practice to give such a treatment to brass tubes after 
drawing or to steel tubes after welding for use in sour oil wells. However, annealing may 
be impractical for some stainless steels which sensitize and become susceptible to 
intergranular attack. 

Since a tensile component of stress is required in SCC, it can be prevented by putting 
the surface of a component into compression, e.g. by shot-peening. The treatment 
needs to be applied uniformly. It will not be effective if pitting occurs on the 
compressive layer. 

Control of corrosion : The elimination or reduction of the damaging species is 
desirable, but often difficult to achieve in practice. The initial non-damaging con¬ 
centration of the species may become high in the crevices or under high temperature 
conditions bringing about failures. However, the prevention of intergranular SCC of 
sensitized stainless steel, a recurring and expensive problem in cooling-water piping of 
boiling water nuclear power plants, has been achieved by minimizing both dissolved 
oxygen and chloride (Williams 1959). 

Use of inhibitors reduces or even eliminates SCC, possibly by moving the corrosion 
potential outside the range of cracking. The addition of small amounts of nitrates to 
concentrated NaOH prevents SCC of steel, as discussed in section 4. Substances like 
H 3 P0 2 , Na 2 0 4 and CO(NH 2 ) 2 , which may be expected to form insoluble products 
with iron, retard or prevent cracking in nitrates. Addition of traces of NO to N 2 0 4 or 
1-2% H 2 0 to RFNA or CH 3 OH/HCl mixtures prevent SCC of titanium alloys in 
these media (Blackburn et al 1972). The addition of water causes anodic inhibition by 
shifting the potential to the safe passive potential range. However, there are some 
practical limitations for the use of inhibitors. Many failures occur in steam or under 
condensation conditions, and in both these cases the transport of inhibitors to sites of 
crack initiation is not feasible. 

Increasing the corrosion rate to reduce SCC might appear to be a ridiculous 
propositon. But since SCC is a form of highly localized corrosion, extending corrosion 
over the whole of the surface will usually lessen the probability of such failures. This 
approach is unlikely ever to be a permanent remedy. It is employed in making up 
mixtures containing HC1 to clean austenitic stainless steel parts in chemical plants, the 
corrosion rate is maintained > lOmpy (Laque & Copson 1965). 

Electrochemical protection : Cathodic protection will control SCC in alloys which 
crack by anodic dissolution mechanism, but is likely to accelerate hydrogen-induced 
cracking, particularly in high strength alloys. SCC failure of Kraft continuous digesters 
in pulp and paper industry in NaOH-Na 2 S environment at 140°C has been reported 
(Singbell & Garner 1987) to have been mitigated by the application of anodic 
protection. 

Material selection : Choosing a different alloy resistant to the particular environment 
is a popular option to prevent SCC. An alloy having the lowest plateau velocity, as 
discussed in section 3, can be chosen from among a number of susceptible alloys. 
Mitigation of SCC by alloy development has been a rare endeavour. However, it is 
reported (Newman & Procter 1990) that a relatively inexpensive stainless steel has been 
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developed by Japanese workers which resists SCC up to 140°C in crevice tests with 
20% NaCl in which 304 stainless steel will fail at 60°C. The addition of copper raises the 
lower critical potential for SCC. Minimisation of phosphorus content in austenitic 
stainless steels is also a key approach to make them resistant to chloride induced SCC. 

The cladding of high strength aluminium alloys with pure aluminium to mitigate 
SCC in aircraft components has been mentioned in § 1. The relatively low susceptibility 
of pure metals to SCC has been utilized in this preventive measure. 


Liberal help has been taken from the review articles of Scully (1979), Newman 
& Procter (1990), Parkins (1992), Jones & Ricker (1992) and the text book by Jones 
(1992) in the preparation of this article. 
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Abstract. Physically-based creep Continuum Damage Mechanics (CDM) 
is briefly reviewed to introduce the formalism necessary for mathematical 
modelling of creep and fracture in the presence of a chemically-interacting 
fluid environment. A recently developed theory of creep in precipitation- 
hardened alloys is presented in the form of a constitutive equation and two 
important creep/environment interactions are discussed in detail. Spalla¬ 
tion of oxide scales during creep of a low-alloy ferritic steel is one and is 
modelled using the new creep constitutive equation which incorporates 
particle-ageing as the principal intrinsic damage mechanism. Lack of 
experimental data has meant that only qualitative support for the model 
predictions could be given. Carbon dioxide gas bubble formation along 
grain boundaries in nickel alloys is the other form of environmental attack 
considered; a new kinetic model based upon oxyen diffusion-control is 
described and shown to be in quantitative agreement with a large experi¬ 
mental dataset. 

Keywords. Continuum damage mechanics; creep modelling; reaction- 
front kinetics; diffusion; degradation; damage; gas bubble formation. 


1. Introduction 

Mechanical test data for creep design are mainly procured in a laboratory air 
environment, whereas materials entering service in component form are often exposed 
to oxidising, sulphidising or carburising atmospheres. The potential for material 
degradation (or ‘damage’) in these environments is an unwelcome irritant to those 
concerned with predicting long-term component performance in service using data 
from short-term laboratory tests. However, environmentally-induced degradation of 
a material is only one of many mechanisms that can adversely affect a component’s 
creep lifetime and its quantification cannot therefore be considered in isolation: the 


A list of symbols is given at the end of the paper 
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state of stress experienced by a component or its fabrication route are two well- 
researched areas that can also reduce lifetimes to well below those predicted simply on 
the basis of uniaxial data. For example, material in bent pipework can be made much 
less ductile (with a consequent reduction in lifetime relative to virgin material) because 
of plastic strain-induced grain boundary cavitation; similarly, low ductility fractures 
are induced by a combination of local changes of microstructure and triaxial stresses 
created during welding. Predictive modelling of creep/environment interactions must 
therefore be within an overall life prediction framework. 

The present paper first presents a brief state-of-the-art review of physically-based 
creep Continuum Damage Mechanics (CDM). This mechanics has evolved over the 
past decade to deal quantitatively with design and remanent life prediction in situations 
where a variety of material damage mechanisms can control failure. The damage 
equations have been incorporated individually into a new constitutive description of 
creep that is relevant to nickel-base superalloys, ferritic steels and aluminium alloys. 
A multi-damage version is then used to advance further the quantification of two 
important creep/environment interactions. 

2. Mechanics framework for creep lifetime prediction 

In an effort to ‘build bridges’ between engineers engaged in lifetime prediction and 
metal scientists endeavouring to understand the underlying micromechanisms of 
material degradation, Ashby & Dyson (1984) proposed what has become known as 
physically-based creep Continuum Damage Mechanics (CDM). Four broad categories 
of material-damage were identified, each containing more than one micromechanism: 
(i) loss of external load-bearing section due to deformation at constant volume; (ii) loss 
of effective internal load-bearing section due to voiding or cracking; (iii) a progressive 
softening caused by degradation of the dislocation and/or particulate microstructure; 
(iv) a surface weakening due to chemical interaction with the environment. A non- 
dimensional ‘damage parameter’, S, was defined for each micromechanism and its rate 
of evolution and interaction with strain rate s , quantified. When each micro- mechan¬ 
ism acted alone, the strain rate was found to be represented generally by 


s = s(oTS), (1) 

while the damage rate had one of four forms: 

S = S(<jTS), (2) 

S = S(TS), (3) 

S = S(<tTSR), (4) 

S = S(TSR). (5) 


In (1)—(5), <t is the applied uniaxial stress; T is temperature and R is the radius of 
a cylindrical testpiece. 

Only (3) - describing damage due to coarsening of particles - and (5) - describing 
damage due to internal oxidation - are independent of deformation rate. Equations (4) 
and (5) describe damage due to chemical interactions with the environment and thus 
depend on radius R - the larger the testpiece, the smaller the interaction. 





Table 1. Creep damage categories, mechanisms, parameters, rates and effects on uniaxial strain rates using a hyperbolic sine law material 
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Ashby & Dyson (1984) used a material model with a power law stress function to 
formulate explicit pairs of equations consisting of (1) combined with various appropri¬ 
ate forms of the damage equations (2)-(5). Over the past decade, several developments 
have taken place: a recognition of the importance of a two-damage formulation for 
particle-strengthened engineering alloys, even under constant applied stress (Dyson 
& Gibbons 1987); a multiaxial formulation for use in finite element analyses of 
components (Othman et al 1993); a methodology (CRISPEN) for extracting model 
parameters to fit certain equation-sets (Ion et al 1986; Barbosa et al 1988); and 
development of a new material model for precipitation-hardened materials that is 
based on a hyperbolic sine stress function (Dyson 1994). 

A hyperbolic sine stress function rationalises the stress dependent ‘n’ values widely 
reported in the literature on precipitation-hardened engineering alloys. In addition, the 
new theory has suggested a totally different interaction between particle-coarsening 
and strain rate which has been validated using creep data obtained on virgin and aged 
lCrl/2Mo steel (Dyson & Osgerby 1993). Table 1 summarises the state-of-the art for 
single damage CDM models using the hyperbolic sine formulation. 

In the light of the above brief review of the status of CDM, the next section presents 
some new information on creep modelling in the presence of damage due to interaction 
with oxygen in the surrounding environment. 


3, Modelling of creep/oxygen-environment attack 

Exposure of pure metals or solid solution alloys to air during creep often results in 
lower creep rates and longer lives compared to data obtained in vacuum. The reason is 
simply due to the higher creep strength and greater volume of the surface oxide and is 
facilitated by a combination of a weak alloy with a fast-growing oxide. In contrast, 
ferritic steels and nickel base superalloys often display creep lifetimes in air that are 
often much shorter than in vacuum or inert atmosphere. Two quite different microm¬ 
echanisms have been identified as being responsible: (i) in low-alloy ferritic steels, 
oxides have been observed to spall, reducing the load-bearing section and accelerating 
creep (Cane & Manning 1981); (ii) in nickel-base superalloys, oxygen penetrates into 
the material during heat treatment and creates a surface zone of grain boundary 
cavitation which manifests itself in much reduced ductilities and lifetimes during 
subsequent creep at lower temperatures (Pandey et al 1984). 

3.1 Modelling damage due to oxygen penetration 

The mechanism of degradation when oxygen penetrates into a nickel-base material at 
high temperatures is believed to be carbon dioxide bubble formation along grain 
boundaries. The experiments of Bricknell & Woodford (1982) demonstrated conclusi¬ 
vely that carbon dioxide gas bubbles were formed along grain boundaries of dilute 
nickel-carbon alloys when exposed to air at temperatures of the order of 1000°C. Their 
presence seriously reduced ductilities in subsequent creep at lower temperatures. Much 
evidence has been accumulated by Bricknell & Woodford (1982), Pandey et al (1984) 
and Osgerby & Dyson (1990a) that points to a similar effect occurring in nickel- base 
superalloys: creep ductilities are reduced even more dramatically than in nickel- 
carbon alloys, but the evidence for a carbon dioxide gas reaction, although strong, 
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remains largely circumstantial since bubbles have never been found after heat treatment 
alone. Carbon monoxide formation has been detected by mass spectroscopy in one 
system after pre-exposure and creep (Pandey et al 1992). 

One of the major findings of Pandey et al (1984) that under certain conditions (higher 
temperatures), oxygen is able to penetrate into the material more easily under reduced 
oxygen potentials, has now been substantiated by other investigations (Pandey et al 
1986; Osgerby & Dyson 1990b). This finding has focussed attention on the need for 
a better understanding of the kinetics of carbon dioxide formation in addition to the 
kinetics of formation of barrier oxides such as alumina and chromia. CDM also 
requires a quantitative damage evolution rate which necessitates better knowledge of 
the kinetics of formation of both C0 2 gas bubbles and oxide scales under reduced 
pressure. Kinetic data on carbon dioxide gas bubble formation are sparse but Iacocca 
& Woodford (1988) report an activation energy for the rate of advance of the bubble 
interface that is close to the volume self diffusion coefficient in nickel, D v . Oxygen 
diffiises interstitially in nickel with an activation energy approximately 0-6 that of D v 
(Park & Alstetter 1988); the implication must be therefore that more than one 
activation process is important in determining the rate of advance of the reaction-front. 
The next section proposes a kinetic model for C0 2 gas bubble formation and 
exemplifies its potential using data from a nickel-carbon alloy. 


3.1a New kinetic model for C0 2 gas bubble formation: Consider figure 1 which illus¬ 
trates the planar concentration profile of dissolved oxygen in nickel containinga 
surface film of NiO that maintains a concentration C[ 0] for all time, t. The reaction- 
front occurs at a distance d below the surface at an oxygen concentration of C [0] R . If the 
diffusivity of oxygen in nickel, D IO] , is reasonably assumed to be independent of 
distance x, then the oxygen profile as a function of x and t is given in standard 
treatments (Crank 1956) as 

qo,=q 0) «/c(*/2(iw> 1 ' 2 ) («) 

and therefore 

C mJ t = q m erfcld/2(D fm t) 1 i 2 l (7) 

When the ratio C [0 ] ijR /C [ 0] is independent of temperature, then D [0] ccd 2 /t and 



Figure 1. Schemat: 
tation of the oxyge 
profile in nickel ahe*u 
fape layer of NiO. 
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the activation energy of the reaction-front kinetics becomes equal to that for diffusion. 
This will not be true in general since C [0J R and C' [0] are both functions of temperature. 

C[ 0] is maintained at a level proportional to the square root of the dissociation 
pressure P {0z) of NiO and is given by 

q o] = K( T) exp [A/i° Nlo> /RTl (8) 

where K(T) is the equilibrium constant for solution of oxygen in nickel and A£f° NiC » is 
the standard free energy of formation of NiO. 

The nucleation condition for C0 2 bubble formation is reached when P (C02h = 2y/r c = P*, 
where y is the surface tension and r c the critical bubble radius. Consideration of the CO a 
reaction kinetics at the bubble/matrix interface, gives: 


C [0] , R = K(T) [P*/^ c] ] 1/2 ex PCA/x ( co 2) /2K 71 


(9) 


where a [c] is activity of carbon in nickel and A^ ( ° COa) is the standard free energy of 
formation of CO,. 

The ratio C [0]iR /C[ 0] can now be calculated using (7)-(9) 


q 0] . R 

C 'lO] 



2A/i° Nio> 
2 RT 


= erfc 


d 

_2(D [0] t) 1/2 


( 10 ) 


3.1b Application to a nickel-carbon alloy: Kubaschewski & Alcock (1979) have 
tabulated equations for A/i ( ° C02) and A/i° Ni0> : 

AP(cojh — - [3-96 x 10 5 + 0.847] Jmol" 1 , 

Ap< N i 0 > = - [2-35 x 10 5 - 85.6 T] J mol“ 1 . 

Substituting into (10) gives 

erfc[d/2{D [0] t) 1/z '] =3-55 x W~ 5 lP*/a [c] y iz exp(4450/T). (11) 

The activity coefficient for carbon in nickel, y c . has been derived previously (Dyson 
1982): 

y c = 2exp[4763/7], (12) 

Substituting (12) into (11) gives 

erfc[d/2(D [0] t) 1/z ] =2-5 x 10" 5 [PVC [C] ] 1/2 exp(2069/7). (13) 

where C [c] is the atom fraction of carbon in nickel. 

Homogeneous nucleation of carbon dioxide gas bubbles in nickel requires pressures 
of the order of 10 5 atmospheres and is an unlikely event (Dyson 1982). We will assume 
P* to be one tenth of this value, implying heterogeneous nucleation at sites having 
defects a few nanometres in radius. This is consistent with gas. bubbles lying only along 
grain boundaries. Substituting for P* in (13) gives 

erfcid/2(D l0] t) 112 ^ = 2-5 x 10- 3 [C [c] ]" 1/2 exp(2069/T). (14) 

The experiments of Iacocca & Woodford (1988) used an alloy containing 135 ppm of 
carbon by weight, i.e. C [c] = 6-6 x 10 " 4 . Thus, from (14), 

erfc[d/2(D [0] t) llz ] = 9-8 x 10' 2 exp(2069/T). 


(15) 
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In order to derive an apparent activation energy, values of the parameter d 2 /t were 
calculated as a function of temperature using tables of the Error Function Complem'ent 
and the equation for D [0] given by Park & Alstetter (1988) 

D IO] = 4-9 x 10~ 6 exp( — 19700/T). (16) 

Values of d 2 /t are plotted against 1 /T in the conventional manner in figure 2, from 
which an activation energy of 211 kJmol" 1 can be derived. This activation energy is 
substantially greater than the diffusion activation energy of 163-8 kJ mol -1 input 
through (11). Figure 2 demonstrates that the frequently used practice of deducing 
activation energies for oxygen diffusion from measurements on reaction-front kinetics 
can be very misleading. 

It has often been assumed (Bricknell & Woodford 1982; Iacocca & Woodford 1988) 
that oxygen diffuses faster down grain boundaries and so gives rise to the C0 2 gas 
bubbles found preferentially along grain boundaries. However, even a cursory inspec¬ 
tion of the rates of transport of substitutional solutes through grains relative to those 
along grain boundaries indicates that at these high temperatures (greater than 0-7 T m ), 
most of the flux will be transported through the grains. Dyson & Hondros (1985) 
demonstrated this quantitatively and also pointed out that even when grain boundary 
transport is dominant, reaction-front kinetics will obey a t 1/4 law with a low activation 
energy and not the t 1/2 law with a high activation energy observed experimentally for 
C0 2 bubble formation. A further piece of evidence favouring mass transport of oxygen 
through the grain interiors is shown in figure 3, which compares the tracer volume 
diffusion data of nickel in a Ni-20% Cr alloy (Frost & Ashby 1982) with the relatively 
recent data of oxygen in nickel by Park & Alstetter (1988). The less complete and older 
data of Kerr (1972) are also included in figure 3. It is clear that oxygen diffusivities are 
considerably greater than tracer nickel diffusivities and that oxygen diffuses with 
a lower activation energy. Thus oxygen almost certainly diffuses interstitially in nickel 



Figure 2. Comparison of pre¬ 
dicted reaction-rate constant 
for C0 2 bubble formation (solid 
line giving an activation energy 
of 211 kJ mol -1 ) with experi¬ 
mental data. 
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Figure 3. Comparison of ex¬ 
perimental diffusivities of tracer 
Ni in a nickel-chromium alloy 
with two sets of measurements 
of oxygen diffusion in nickel, (a) 
Frost & Ashby (1982), (b) Park 
& Alstetter (1988), (c) Kerr 
(1972). 


and will have little or no tendency to diffuse faster along grain boundaries. The reason 
for the appearance of C0 2 bubbles only along grain boundaries cannot therefore be 
due to rapid transport down the boundaries, as Woodford and co-workers have 
consistently suggested, but must be a consequence of preferential nucleation. This is 
consistent with the assumption of a constant C0 2 pressure term, P*, in §3Ta. 

The model predictions of d 2 !t can be compared in figure 2 with the C0 2 penetration 
rate data of Woodford and co-workers on nickel and the single datum point of Pandey 
et al (1986) on the nickel superalloy X-750. The model predictions fall slightly below the 
lower limit of the experimental dataset but the agreement must be considered satisfac¬ 
tory considering the indpendent data input to the model and the large experimental scatter 
(due in part to the use of cylindrical section thicknesses which, according to Iacocca 
& Woodford (1988), were too small relative to the diffusion distance and so overes¬ 
timated d 2 /t because of an assumed infinite plane slab of material in their analysis). 
Better agreement with the dataset could have been achieved using the (poorer pedigree) 
oxygen diffusivity data of Kerr (1972). 

3.1c Incorporation into the CDM framework : During creep, the presence of a sub¬ 
surface layer containing cavity nuclei implies pre-existing damage in the material. The 
damage parameter and its effect on creep rate are identical to those given in table 
1 under the internal oxidation mechanism; in this particular case of pre-exposure to the 
environment, the damage accumulation rate is zero. Pandey et al (1986) and Osgerby 
& Dyson (1990a) showed that for the wrought Ni-base superalloy, Inconel X-750, the 
measured creep rate in testpieces exposed to air at 1000°C for various times showed 
good agreement with model predictions of penetration depth when / was assigned 
a value =1/3, a value that corresponds to cavitation on every transverse grain 
boundary in the environmentally damaged region. 



Reciprocal Temperature, 10* K 
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3.2 Modelling damage due to periodic oxide-spallation 


The equation-set developed to describe uniaxial creep behaviour of low-alloy ferritic 
steels in the absence of environmental attack (Dyson & Osgerby 1993) is 


e = s 0 sinh[<r(l — H)/{a 0 (l — S)(l — cu)}], 
H = (/i'/cr)[l — (H/H*)~\e, 

S = (K'/3)(l-S)\ 

cb = (l/3s u )e. > 


(17) 


Under the usual laboratory conditions of constant load, there is a further constraint on 
the change of stress with time given by <r = ere. 

The model material parameters e 0 ,o 0 ,h',H*,K' and the uniaxial ductility e u are 
generally functions of temperature, but Dyson & Osgerby (1993) found that the 
primary creep parameters h' and H* could be regarded as constants without significant 
change in predicted lifetimes. Two of the model parameters, i 0 and K\ are exponentially 
related to temperature with activation energies dictated by physics to lie within well- 
defined narrow bands. The parameter <r 0 is related to the volume fraction and initial 
dispersion of the particles. The coarsening of the dispersion with time is given by the 
Greenwood (1956), Wagner (1961) or Lifshitz-Slyozov (1961) expression for particle 
coarsening, S, and the variation of particle volume-fraction with temperature is 
currently accounted for in an empirical way. 

Table 2 gives values for the intrinsic (i.e. in the absence of environmental-attack) 
model material parameters determined for a normalised and tempered 1% Cr 1/2% 
Mo ferritic steel over the temperature range 530-625°C (Dyson & Osgerby 1993). 

Equation-set (17) can be integrated numerically to give lifetimes under a variety of 
imposed external constraints. Figure 4 is an example under constant load and 
temperature: two temperature, 550°C and 625°C, are given and at each temperature, 
lifetimes are predicted both with and (artificially) without the ageing term S operating. 
The importance of particle ageing during the test in curtailing lifetimes, particularly 
under service conditions, is amply demonstrated in figure 4: at a typical service stress of 
40 MPa at 550°C, ageing will reduce lifetimes by more than two orders of magnitude. 

Environmental-attack by oxygen to create an oxide which then spalls periodically 
was identified as a problem for remanent lifetime prediction by Cane & Manning 
(1981). They pointed out that data obtained in a laboratory on testpieces (approximate 
diameter 10 mm) may give a pessimistic estimate of remanent lifetime of components 
that are often in excess of 100 mm in section: testpiece data would fail prematurely due 


Table 2. Intrinsic model material parameters for a 1 Cr 1/2 Mo ferritic steel. 
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Figure 4. Model predictions 
of creep lifetime as a function of 
stress at two temperatures. The 
solid lines refer to material 
undergoing particle-ageing and 
the dashed lines to a hypotheti¬ 
cal “non-ageing” material. 
Model parameters for lCr 
l/2Mo steel (Dyson & Osgerby 
1993). 


to the greater rate of loss of load-bearing area in 10 mm section size compared to 
100 mm. 

Various models have been proposed for the damage rate, S c , due to this mechanism 
(Manning 1981; Ashby & Dyson 1984; Dyson & Osgerby 1989). It was shown by Dyson 
& Osgerby (1989) that these could all be written in the following way: 

S c = (2/R)Ks a , (18) 


where a is either 0-4 or 0-5 and k is a term containing the parabolic rate constant for 
oxidation, K p ; k is proportional to either K° p 5 or K p ' 6 . Table 1 gives the formalism due 
to Ashby & Dyson (1984). 

Equation (18) was incorporated into equation-set (17) to assess the effect of spallation 
on predicted creep lifetimes. The creep strain rate equation becomes 

£== e 0 sinh[<r(l — S)(l — S c )(l — co)}]. (19) 

The spalling criterion proposed by Manning (1981) has been used here to give an 
equation for S c ; 


S 


c 


2/4 E 


K° p H 04 


( 20 ) 


where the oxide modulus £ = 200GPa; and the oxide/metal interfacial energy 
y J *2Nm" 1 . could be considerably lower than this and thus represents a lower 
bound on S c . However even a ten-fold decrease in y t would result in only a 60% increase 
in S c . A much more sensitive parameter is K p . Data for K p as a function of temperature 
have been reported for a number of low-alloy ferritic steels (Pinder 1977). A mean line 
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Rupture life, h 


Figure 5. Model predictions 
of creep lifetime in air and in 
vacuum as a function of stress at 
two temperatures. Model par¬ 
ameters for lCrl/2Mo steel 
(Dyson & Osgerby 1993). 


through these data has been calculated for the present work: 

Kp — 7'7 x 10~ 6 exp — [13000/7’]nr» 2 h~ *. (21) 

Equation (21) applies only to the temperature range 500-625°C. 

Lifetimes were calculated at 550°C and 625°C using (19)—(21), along with the 
intrinsic model parameters given in table 2 and assuming a testpiece diameter, 2R, 



Figure 6. Illustrating that the 
predicted effect of air exposure 
on a lCr l/2Mo steel at 625°C is 
qualitatively similar to the 
measured effect on a l/2Cr 
l/2Mo 1/4V steel. 
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equal to 10 mm (figure 5). The predictions cannot yet be verified for this alloy since 
there do not appear to be experimental data in the literature. However, qualitative 
support for the model is shown in figure 6 which reproduces experimental data in air 
and argon using a l%Cr l/2%Mo 1/4%V steel (Cane & Townsend 1985). The 
important point to note is the similarity in the relative lifetimes between argon and air, 
since the longer absolute lifetimes of the CrMoV steel simply reflect its greater intrinsic 
creep resistance. 

4. Concluding remarks 

Physics-based creep continuum damage mechanics is emerging as a powerful quanti¬ 
tative tool in design and remanent lifetime prediction. Its computer-friendly format 
makes it ideal for finite element work which will increasingly become less specialised as 
the ratio of price to computational power continues to fall. This paper has focused on 
two mechanisms of creep/environmental interaction; there are almost certainly others 
to identify but the strength of a physics-based approach is that these can be accom¬ 
modated without prejudicing the conclusions of existing models. 

Attack of nickel-base alloys by oxygen diffusing into the metal from the environment 
and forming C0 2 bubbles is now a well-studied phenomenon, but work still needs to be 
done to validate predictions of the CDM model, particularly in extending the temperature/ 
time range of applicability of the damage-rate kinetics. The paper has made some 
progress particularly with regard to defusing the idea prevalent in the literature that the 
process is controlled by oxygen diffusion along grain boundaries. A kinetic model 
utilising oxygen diffusion within the matrix has been proposed which can account 
satisfactorily for the large experimental dataset on this subject. 


The authors would like to thank Dr M C Pandey (Defence Metallurgical Research Lab, 
Hyderabad) for general discussions and for bringing to their attention the papers by 
lacocca & Woodford (1988) and Park & Alstetter (1988). 

List of symbols 

A dot over any symbol signifies the derivative with respect to time 

a [c] activity of carbon in nickel; 

C [c] atom fraction of carbon dissolved in nickel; 

C [D] oxygen concentration; 

C [0] R oxygen concentration at reaction-front; 

C[ 0] oxygen concentration at scale interface; 

D [0] diffusivity of oxygen in nickel; 

D v volume self diffusion coefficient; 

d distance of reaction-front from surface; 

E Young’s modulus; 

H internal variable describing material hardening during primary creep; 

H* saturation value of H; 
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h' coefficient for material hardening during primary creep; 

K' coefficient for material softening due to precipitate coarsening; 

K p parabolic rate constant for oxidation; 

K{T) equilibrium constant for solution of oxygen in nickel; 

n stress exponent for power law creep; 

P (COi) pressure of carbon dioxide in bubble; 

P { 02) dissociation pressure of nickel oxide; 

P* critical pressure for nucleation of carbon dioxide gas bubbles; 

R radius of cylindrical testpiece; 

r c critical radius for gas bubble; 

S internal variable describing material softening; 

S c internal variable describing material softening due to corrosion; 

T temperature; 

t time; 

a exponent for damage dug to oxide scale spallation; 

y surface tension; 

Ji oxide/metal interfacial energy; 

£ creep strain rate; 

£ 0 material parameter in sinh law; 

£ u uniaxial creep ductility; 

k coefficient for damage due to oxide scale spallation; 

Ap<Nio> standard free energy of formation for nickel oxide; 

Ajufcxh) standard free energy of formation for carbon dioxide; 

a applied uniaxial stress; 

<j 0 material strength parameter in sinh law; 

co internal variable describing material softening and fracture due to creep 

cavitation. 
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The effect of hydrogen induced cracking on the integrity of 
steel components 

INDRANIL CHATTORAJ 

National Metallurgical Laboratory, Jamshedpur 831007, India 

Abstract The occurrence of hydrogen embrittlement is a much researched 
phenomenon, known to cause mechanical property degradation and cata¬ 
strophic failures. The ductility loss brought about by hydrogen ingress is 
encountered even in unstressed bodies where such cracking is termed 
hydrogen induced cracking (HIC) and is in phenomenological contrast to 
catastrophic failures encountered by stressed bodies subjected to hydrogen 
producing environments. This article will discuss HIC in some detail. This 
form of cracking is especially detrimental and often observed in oil country 
tubular goods (OCTG) which are subjected to sour gas. Consequently, the 
significance of HIC is most appreciated by oil companies at various stages 
of oil extraction, transportation and storage. In this article the chemical and 
metallurgical genesis of HIC, its harmful impact on material and compo¬ 
nent integrity are discussed. It has been noted that MnS inclusions are 
extremely harmful to this form of cracking. Similarly, centreline segregation 
in the ingot stage and deoxidation practices during steelmaking were found 
to affect HIC. Some case studies of HIC obtained from literature are 
presented. The variables affecting the propensity to HIC are provided in 
brief. Suitable measures to reduce or eliminate HIC in steels are also 
discussed. 

Keywords. Hydrogen-induced cracking; embrittlement, decohesion theory; 
hydrogen absorption. 


1. Introduction 

Hydrogen-induced cracking (HIC) is a phenomenon widely encountered by components 
exposed to environments capable of introducing hydrogen into the component material, 
especially sour environments to which oil and gas installations and transmission systems 
are exposed. The presence of H 2 S in such environments catalyse hydrogen entry into 
steels. HIC can be thought, of as one form of hydrogen embrittlement, and is 
differentiated from the other forms of hydrogen embrittlement phenomena by the 
appearance, morphology and sites of cracking and by the mode of crack propagation. 
Hydrogen induced cracking occurs at ambient or near-ambient conditions and in 
stressed as well as unstressed components. These cracks have a planar appearance, with 
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different cracks linked in a stepwise fashion. HIC is often accompanied by surface 
blisters. 

This article deals with the general occurrence of hydrogen entry and hydrogen 
trapping and with the theories of hydrogen embrittlement, to provide a background of 
the various kinetic processes causing material degradation by hydrogen ingress. 
Subsequently hydrogen induced cracking is dealt with by providing a brief introduction 
of the phenomenon, discussing the variables affecting HIC and providing case studies 
including our recent experiences with this failure mode. The various preventive 
measures to counter HIC are also discussed. 


2. Hydrogen entry 

The hydrogen evolution reaction was characterized by McBreen & Genshaw (1967). 
The evolution of hydrogen occurs frequently due to cathodic reduction of H + ions in 
aqueous corroding solutions, or hydrogen may be absorbed due to the use of high 
pressure hydrogen gas in the environment. In addition, the material itself may have 
hydrogen retained from the process of ore reduction (steelmaking for example). 
Hydrogen either absorbed from the environment or present in the material can lead to 
unpredictable catastrophic failure. This is evident in the various failures in the oil and 
gas industry, aircraft and automotive industries. 

The problem of hydrogen embrittlement of any material involves various mechanisms 
which finally lead to embrittlement. The steps can be broadly classified as follows. 

(1) Hydrogen adsorption at the metal/alloy surface. 

(2) Hydrogen evolution at the metal/alloy surface. 

(3) Absorption of some of the adsorbed hydrogen into the bulk of the metal. 

(4) Transport of absorbed hydrogen to normal lattice sites as well as potential sites for 
cracking, under the influence of a naturally existing or externally applied chemical 
potential gradient. 

(5) Accumulation of hydrogen at the crack sites leading to catastrophic failure. 

Various schematics have been proposed for the reaction sequence mentioned above. 
McBreen & Genshaw (1967) proposed a simple scheme for the evolution and entry 
reaction and over the years it has served as a satisfactory basis for iron and steels and is 
included here in figure 1. The overall reaction of hydrogen evolution in solutions are as 
follows, 


2H 3 0 + + 2e~ = H 2 4- 2H 2 0 (in acid solutions), (1) 

2H a O + 2e~ — H 2 + 20H - (in alkaline solutions). (2) 

The overall reaction can proceed by two mechanisms namely: 

(a) The recombination mechanism which involves chemical recombination of adsor¬ 
bed hydrogen atoms as follows 

i. M+H + +r=MH ads , (3) 

ii. MH ads + MH ads = H 2 + 2M. (4) 


(b) The electrochemical desorption mechanism which involves the following 
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► H 2 4 + 2 Fe 
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H + Fe + e' -Fc H 
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Hydrogen flux through metal 


Figure 1. Hydrogen evolution and entry scheme (according to McBreen & Gen- 
shaw 1967). 


electrochemical steps, 

i. M + H + +e“ = MH ads , (as above) (3) 

ii. MH ads + H + + e" = H 2 + M. (5) 

As a result of the need for multiple steps, it is obvious that one of the reaction steps can 
become rate controlling in the overall reaction path. 

3. Hydrogen trapping 

Darken & Smith (1949) were the first to observe that hydrogen diffusion in steels was 
affected by imperfections in the lattice. The first formalism for trapping in steels was 
developed by McNabb & Foster (1972). They suggested a trapping and a detrapping 
parameter and they came up with a mathematical formulation based on the assump¬ 
tion of local equilibrium between trapped and normal lattice sites. Their formalism 
considered only the reversible traps; irreversible traps were considered not to influence the 
diffusion as they represented sinks with hydrogen essentially removed from the permeation 
system. The works of McNabb & Foster (1972), and Oriani & Josephic (1974) considered 
a single population of reversible traps. Pressouyre & Bernstein (1978) characterized the 
trapping behaviour of different kinds of traps, both reversible and irreversible, by 
controlling the type and number of traps and by observing their effect on permeation 
transient. Johnson et al (1979) analysed saturable and non-saturable traps by consider¬ 
ing relationships between the trapped hydrogen concentration and the lattice hydro¬ 
gen content and redefining effective hydrogen diffusivity in terms of these relationships. 
According to their theoretical development, one can predict an increase in apparent 
hydrogen diffusivity v/ith lattice hydrogen content when saturable traps are present, 
while the reverse is true for non-saturable traps. To summarize, we can consider traps as 
either reversible or irreversible depending on the depth of the potential well they present to 
hydrogen diffusion. Trapping manifests itself by changing apparent diffusivity values. The 
amount and direction of such a change from perfect lattice diffusion depends on the 
amount and type of traps present in the lattice. Simultaneous creation of traps and their 
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influence on hydrogen ingress, also known as dynamic trapping, occurs during 
straining of hydrogen charged specimen. 

This brief introduction to hydrogen trapping was necessary, since the trapping of 
hydrogen at MnS inclusions is the factor most responsible for HIC. The trapped 
hydrogen causes decohesion of the matrix-inclusion interface, either by pressurization 
or by reducing the interatomic bond strength. The net effect is the generation of 
delaminations and cracks near inclusions. 


4. Hydrogen embrittlement theories 

Although many theories have been proposed to explain the hydrogen embrittlement 
phenomenon, two have gained substantial popularity. 

4.1 Decohesion theory 

This theory was developed by Troiano and others (Troiano 1960; Mortlet et al 1958; 
Johnson & Troiano 1957) to explain the synergistic effects of hydrogen on material 
degradation. Troiano refuted the pressure theory and various surface adsorption 
theories for embrittlement, as these would predict an initial loss in ductility, in variance 
with experimental observations. Troiano and co-workers (Troiano 1960; Mortlet et al 
1958; Johnson & Troiano 1957) proposed that the degree of embrittlement would vary 
as the hydrogen concentration in the region of maximum triaxiality in the lattice. 
Whereas the other theories were concerned with hydrogen content in cracks and voids, 
this theory renders hydrogen in voids harmless entities, and only hydrogen in special 
lattice positions as significant in embrittlement. The role of voids and cracks is to act as 
stress raisers, so that the region of maximum triaxiality is generally near such 
imperfections. The disappearance of hydrogen embrittlement at low temperatures can 
be explained by the decrease of hydrogen diffusivityand at high temperatures by the 
conflicting effects of stress induced diffusion and increased homogenization. Troiano 
{I960), made the observation that crack initiation and propagation in HE occurs in 
three distinct stages: (a) incubation, (b) slow crack growth and (c) catastrophic failure. 
From fractographic observations, Troiano concluded that the third stage of cata¬ 
strophic failure was essentially overload failure and there was no effect of hydrogen in 
this sta.ge. The incubation period according to Troiano was the time required for 
hydrogen to reach a critical concentration in the region of maximum triaxiality. The 
slow crack growth in the second stage is essentially the rate of hydrogen diffusion to 
regions in front of the crack tip, thus implying discontinuous cracking. Troiano 
suggested a decrease in cohesive strength of the lattice due to presence of hydrogen; this 
coupled with the observation that hydrogen accumulates at regions of high triaxial 
stresses near imperfections, explained the embrittlement phenomenon. Troiano’s 
original speculation was that hydrogen atoms in solution in transition metals donate 
their electrons to the d band of the metallic core, thus increasing the repulsion of 
metallic cores. This theory predicted that addition of copper to nickel would increase 
embrittlement initially and then decrease it as the 3d band in nickel is progressively 
filled; this prediction was substantiated by experimental findings. 

Tetelman & Robertson (1963) observed that crack growth due to hydrogen charging 
occurred discontinuously. Oriani & Josephic (1974) demonstrated the existence of 
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a threshold stress at a given hydrogen fugacity and conversely a threshold fugacity at 
a given stress intensity, below which crack growth did not occur in steels.'They 
observed that the threshold values for deuterium were consistently higher than those 
for hydrogen. This observation can only be reconciled with the decohesion theory; 
deuterium has a lower molal volume than hydrogen in a-iron, thus it is less effective in 
concentrating at triaxiality sites due to stress induced diffusion. According to Oriani, 
decohesion due to hydrogen is manifested in high strength steels by an increase in 
cleavage fracture and in low strength steels by an enhancement of microvoid nu- 
cleation. Stress relaxation studies (Oriani & Josephic 1979) on AISI 1045 steels were 
explained by suggesting that since hydrogen causes decohesion, it favours microvoid 
generation over relaxation by dislocation activity. Oriani & Josephic (1980) also 
proposed that the reduction of Peierl’s stress and the reduction of screw dislocation cross 
slip in bcc iron are due to reduced cohesive strength in the presence of hydrogen. 

Birnbaum’s objection to this concept of lowering of cohesive strength is that such 
a reduction should also manifest itself in the reduction of atomic force constants and 
elastic constants. In the case of fee metals a decrease in these constants is indeed 
observed in the presence of hydrogen. However in group Vb metals like niobium, even 
at very high hydrogen charging, there is significant increase in C u and bulk modulus 
values (Bimbaum & Mazzolai 1985). The other criticism of this theory is in the 
appearance of fractures. Decohesion theory predicts that intergranular fracture surface 
should show the morphology of grain interface whereas actual observations with high 
resolution microscopes reveal fracture along slip planes near grain boundaries, often 
crossing over into adjacent grains along slip planes (Robertson et al 1984). In spite of 
these criticisms, decohesion theory is one of the more popular theories of HE. 
Calculations based on the embedded atom model and the cluster variation method 
(Briant & Messmer 1980) do suggest that hydrogen would indeed cause decohesion, 
thus vindicating this theory. 

4.2 Internal pressure theory 

A theory of hydrogen embrittlement based on hydrogen accumulation at voids was 
proposed by Zapffe & Sims (1941). According to these authors hydrogen atoms 
accumulate at voids until an equilibrium pressure is attained. When this pressure 
attains a value dose to the elastic strength of the metal, the metal embrittles, de Kazinczy 
(1954) elaborated on this concept by suggesting that crack growth is facilitated by the 
energy released by expanding hydrogen gas in cracks, de Kazinczy utilized Griffith’s 
theory of brittle fracture, which states that for a stable crack to grow, the system must 
undergo a total decrease in the energy content. He suggested that part of the total 
energy of the material charged with hydrogen is due to the energy content of hydrogen 
gas in the cracks and voids. On crack propagation, the gas expands releasing energy 
and thus the energy due to strain required to meet Griffith’s criterion is reduced, that is, 
the strain at which crack propagates is lower. This model however requires that the 
material have preexisting cracks in it. However Bilby & Hewitt (1962) and Garofalo et 
al (1960) proposed that crack initiation could be due to dislocation pile-up under the 
influence of external stress. Bastien & Azou (1951) suggested that hydrogen is concen¬ 
trated in Cottrell atmospheres of dislocations and is transported along with disloca¬ 
tions to the cavities. This dislocation enhanced diffusion of hydrogen would account for 
the incubation time and temperature effects of embrittlement. 
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Bockris & Subramanyan (1971) developed relationships for the hydrogen fugacity in 
microcracks and voids as a function of overpotential, in electrochemical charging of 
hydrogen. Based on theoretical calculations, they predicted which rate controlling 
mechanisms for hydrogen evolution would cause embrittlement when pressure theory 
was valid. Beck et al (1965) suggested that blister formation due to high internal 
pressure of hydrogen in cavities would lead to plastic deformation in the vicinity of 
blisters. This was supposed to increase the dislocation density near blisters, which 
would act as crack initiation sites. 

. There are two criticisms to this theory. Johnson & Hirth (1976) showed that 
hydrogen supersaturations obtainable at dislocations were greatly overestimated in 
literature, and dislocation transport, even if it was effective, could not account for large 
internal pressure required by this theory. The findings of Hancock & Johnson (1965), 
that crack propagation occurs even at very low hydrogen pressure environments, also 
contradicts the internal pressure mechanism of embrittlement. Such a mechanism 
might be effective in high fugacity environments but cannot be accepted as a general 
theory. 


5. Hydrogen-induced cracking (HIC) 

In the previous sections an introduction to the broad area of hydrogen embrittlement 
was provided. This was done to impress upon the readers that the general effect of 
hydrogen entry into alloys, especially steels, is a deterioration of their mechanical 
properties. However the external manifestation of such deterioration can take up 
various forms depending on the presence or absence of other variables in the system. 
The aim of this paper is to concentrate on HIC which is a specific type of hydrogen 
embrittlement, and is often encountered by components exposed to sour environments. 
The rest of this article will deal with the various parameters enhancing HIC, a few case 
studies of HIC observed in service, and measures adopted to counter HIC. 

Whereas the problem of stress cracking in a wet sour environment was identified as 
early as the 1950s, it was only a couple of decades later that hydrogen induced cracking 
as a mode of pipeline failure was appreciated (Moore & Warga 1976). This is different 
from sulphide stress cracking (SSC) in that it can occur in stressed as well as unstressed 
bodies. Burns (1976) pointed out the difference in the appearance of these two types of 
sour environment cracks. HIC was identified as cracking due to a mechanism involving 
the formation, growth and liqkage of internal blisters which often gave it the appear¬ 
ance of a series of steps and hence its alternate name - “stepwise cracking (SWC)”. In 
this article I have used the terms HIC and SWC interchangeably, signifying the same 
phenomenon. Ikeda et al (1977) provided a schematic of the SWC process, which has 
been included here as figure 2. 

6. Variables affecting HIC 

6.1 Environmental variables 

The important environmental variables are pH, chloride content, temperature, H 2 S 
concentration, presence of dissolved oxygen and the exposure time. The findings of 
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INCLUSION 


Figure 2. Direct linkage of two closely spaced cracks producing steps. After Ikeda 
etal (1977). 


Ikeda et al (1980) indicated that lower pH environment accelerated corrosion as well as 
stepwise cracking (SWC). They also demonstrated that higher concentration of dissol¬ 
ved chlorides increased the severity of the environment. The extent of hydrogen 
damage was found (Kowaka et al 1975) to be maximum in the temperature range of 15 
to 35°C. Sharp reduction in SWC at temperatures above 35°C is attributed to the 
decreased concentration of H 2 S in the saturated solutions. A threshold concentration 
of H 2 S is often reported (Nakasugi et al 1979) below which HIC does not occur. 

6.2 Metallurgical variables 

One of the most important considerations is the casting practice with particular 
reference to the segregation of impurities and alloying elements occurring during 
casting. Moore & Warga (1976) recommended the study of specimens from the pipe 
section of ingots (segregation zone), as these are most sensitive to SWC. They also 
found that the deoxidation practice during casting leads to increased SWC sensitivity 
so that fully killed steels are almost always more susceptible to HIC as compared to 
semi-killed steels of a similar composition. Moore & Warga pointed out that increased 
deoxidation increases sulphur solubility as well as promotes the formation of detrimen¬ 
tal type II (stringers) MnS inclusions in comparison to the type I inclusions. Similarly, 
the practice of controlled rolling is conducive to the formation of MnS stringers. 
Coldren & Tither (1976) concluded that SWC arises because of voids at non-metallic 
inclusion/steel interfaces. Wilde et al (1980) confirmed that SWC initiated at elongated 
type II MnS inclusions. Herbsleb et al (1980) also concluded that inclusions with sharp 
edges and large surfaces are preferred sites for initiation of SWC. After the realization 
that MnS inclusion shape and type are of vital importance in HIC, various modifica¬ 
tions in steelmaking practices have been tried out to control these inclusions. Decreas¬ 
ing the sulphur content was found beneficial by Herbsleb et al (1980) and by Brown 
& Jones (1984). The inclusion shape control by calcium or rare earth metal addition 
have been found to improve HIC resistance. 

Miyoshi et al (1976) found that SWC propagated along anomalous microstructures 
composed of low temperature transformation products. Microstructural banding 
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(pearlite or martensite) has also been found to increase crack propagation. The 
formation of banded microstructures in pipeline steels is due to dendritic segregation of 
manganese and phosphorus. According to Taira et al (1984) restricting the segrega¬ 
tion zone hardness to below 300 HV is needed to eliminate HIC, since hard “banded” 
structures of bainite or martensite are susceptible to hydrogen embrittlement as well as 
HIC. Thus homogenization of microstructures by quenching and tempering would 
also reduce SSC. 

6.3 Stress variables 

A correlation has not yet been established between steel strength and SWC susceptibility. 
HIC has been reported for a wide range of tensile strengths (300 to 800 M Pa). It seems 
that nonmetallic inclusions and anomalous microstructures are more important 
factors in determining HIC susceptibility as compared to the strength of the steel. The 
SWC mechanism involves segregation ef hydrogen atoms to internal interfaces fol¬ 
lowed by decohesion at these interfaces possibly by hydrogen pressurization. The 
plastic regions generated at the blister crack tips are embrittled by hydrogen, and 
transverse cracks propagate through the embrittled regions to join parallel cracksthus 
giving the appearance of steps. SWC occurs not only in unstressed components, but 
also in steels under elastic tensile stresses. Very few studies have been conducted to 
investigate the effect of welding residual stresses on the occurrence of SWC. The study 
of Taira et al (1979) seems to suggest that there is no effect of residual stresses. 

6.4 Chemical variables 

The addition of copper has often been found (Inagaki et al 1978) beneficial to SWC 
resistance of linepipe steels. This was attributed to the formation of a black protective 
film of Mackinawite, FeS! _ x , enriched with Cu. This reduces the corrosion rate as well 
as the amount of absorbed hydrogen. Thus Cu-alloying is beneficial only in environ¬ 
ments where a protective film can form. Similarly Cr additions were also found to 
improve SWC resistance. Nishimura et al (1977), and Parrini & Devito (1978) had 
observed the beneficial effects of Cr. Ni was also found to greatly reduce hydrogen 
absorption from sour environments (Nishimura et al 1977). Researchers have found 
that Co, Bi and Rh additions cause decrease in hydrogen absorption by steels from H 2 S 
containing environments (lino et al 1979). 


7. Case studies 

7.1 First reported failures 

Parades & Mize (4954) were the first to report service failure due to stepwise cracking 
(SWC). This occurred in 1951-52 on a API-5LX grade 359 transmission linepipe, after 
several months of service. It was found that the imperfections necessary for blistering 
were abundant in the material. Delaminations and blistering were evident with some 
blisters observed to have initiated at the delaminations. The analysis of the gas in the 
blister was found to contain 94% hydrogen, 2% methane, with traces of carbon dioxide 
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and carbon monoxide. Class (1963) reported the failure of a pipeline in sour service. 
These were made from killed steels, were longitudinally welded by resistance welding 
and were used in sour service. Analysis revealed manganese sulphide and silicate 
inclusions in the material. It was also observed that seamless pipes performed signifi¬ 
cantly better in service. 

7.2 Hydrogen damage in a failed pressure vessel 

McHenry et al (1982) reported their investigation on a pressure vessel rupture at 
an oil refinery resulted in the death of 17 people and extensive property damage. 
The vessel was a 18-8 m high steel cylinder used to strip hydrogen sulphide from 
a process stream of propane and butane. The vessel was observed to fracture circum¬ 
ferentially adjacent to a repair weld made to replace a section damaged earlier by 
hydrogen blisters and delaminations. McHenry et al (1982) observed five types of 
hydrogen damage. 

Hydrogen blisters and delaminations were observed in the bottom part of the 
cylinder. Blisters and delaminations were often close enough to the surface to 
cause visible bulging. The material was found to be quite susceptible to hydrogen 
induced cracking. These delaminations were indirectly responsible for failure as 
they led to the replacement of a section of the vessel. The steel matrix between 
the interfacial cracks was found to fail by cleavage because the hydrostatic stresses 
generated by absorbed hydrogen at blisters, inhibit plastic deformation. In addition, 
hydrogen induced cold cracking was observed in the heat affected zone of the 
repair weld. However McHenry et al (1982) did not believe that these modes 
were primarily responsible for failure. According to them, hydrogen stress cracks 
developed during operation of the vessel at the inside surface adjacent to the 
repair welds and hydrogen pressure cracking caused these surface cracks to pro¬ 
pagate in a zigzag fashion through the vessel wall. These were facilitated by hydrogen 
embrittlement of the material which reduced the fracture toughness of the material 
by more than half. 

7.3 Stepwise cracking in a spirally welded pipeline 

Moore & Warga (1976) reported that in 1974, three service ruptures occurred in 
a spirally welded, API 5LX-Grade 42 sour gas transmission pipeline between four to 
seven weeks after commissioning. It was found that extensive blistering and cracking 
had occurred over 10 km of this 90 km pipeline. The failures could be attributed to 
stepwise cracking near elongated MnS inclusions. Although the failures occurred near 
the spiral weld, they could not be associated with any kind of weld defects. Moore 
& Warga used a test proposed by Cotton to study the HIC susceptibility of the 
material. They found that the most important factor affecting the hydrogen cracking 
sensitivity of steels was the MnS inclusion morphology. Elongated type II, MnS 
stringers were observed to render the steel very crack sensitive. Moreover, since all fully 
killed steels except those treated with rare earth elements contain type II MnS 
inclusions, they would be more susceptible to HIC than semi-killed steels. Moore 
& Warga were of the opinion that although steels can be made insensitive to HIC, 
a completely immune steel was not feasible. 
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7.4 Ongoing research at the National Metallurgical Laboratory, Jamshedpur 

Under the aegis of The Component Integrity Evaluation Programme (CIEP), which is 
a concerted effort on life estimation of various components, one of the projects is 
focusing on the studies of HIC in API 5LX Grade 52 pipeline steel exposed to simulated 
sour environment. This investigation is an attempt at an ‘in toto’ observation of the 
mechanical property degradation brought about by sour environment exposure, and is 
one of the few attempts at correlating the extent of HIC with the loss in ductility of the 
material. One of the observations of this study was that-the extent of HIC was 
a monotonically increasing function of the time of exposure to the sour environment. It 
was found that even after twenty-eight days of exposure to the NACE prescribed sour 
environment, no saturation of the HIC damage occurred. This observation indicated 
the great potential of lattice imperfections to trap hydrogen. In this investigation the 
mechanical property deterioration caused by HIC was studied in some detail through 
slow straining after exposure to the sour environments. The investigators found that, 
surprisingly, hydrogen induced cracks, per se, did not cause a deterioration of ductility. 
Such deterioration was the result of the presence of mobile lattice hydrogen which are 
distinct from hydrogen trapped at interfaces of matrix/ inclusions, the latter being 
responsible for HIC. This investigation has so far revealed the clearly distinct roles of 
mobile lattice hydrogen and trapped hydrogen, each causing different types of damage 
to the matrix, the former causing lattice embrittlement most observable on dynamic 
straining and the latter causing cracking under static conditions. 

8. Preventive measures 

The preventive measure which can be adopted during service include the injection of 
inhibitors, desulphurization and dehydration of the gas passing through the line. 
National Association of Corrosion Engineers (NACE) recommended a hardness limit 
of HRC 22 for steels in use in sour environments. Various metallurgical parameters can 
also be controlled to reduce HIC. 

There are three well-known measures to improve the HIC resistance, 

(1) reduction of hydrogen entering into steels by the formation of protective films. 

(2) reduction of MnS and inclusion shape control. 

(3) control of microstructures, especially segregation zones. 

The first of these methods is often insufficient as the film formation is pH dependent and 
in aggressive media (low pH) protective films are often not formed. It was argued earlier 
that the beneficial effect of Cu alloying was due to the enrichment of the protective 
surface film of makinawite. The more effective preventive measures are the modifica¬ 
tion of inclusion shape and amount, and the control of the microstructure. 

Taira et al (1984) found that complete elimination of inclusion improves SSC 
resistance as well as HIC resistance. It was mentioned earlier that fully killed steels were 
more prone to HIC, however, due to their better mechanical properties, it is often 
desirable to use them in spite of the HIC limitation. This calls for modifications in the 
steelmaking practice. Sulphide shape control by rare earth additions is often used. 
However since the amount of rare earth addition has to be very precise, steelmakers are 
often tentative about following this practice. Moreover, rare earth sulphides tend to 
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produce ionization effects which interfere with arc stability in automatic welding 
processes. Calcium treated steels were also found to have improved resistance to HIC 
(Brown & Jones 1984; Taira et al 1984). According to Taira et al (1984) calcium content 
must be delicately controlled so that neither manganese sulphide, nor calcium sulphide 
clusters are present. They believed that the HIC resistance of a calcium-treated steel 
depended on the ratio of the Ca content to the S content. Brown & Jones (1984) found 
that HIC resistance increased with Ca:S ratio. They observed that rare earth metal 
treated steels were less resistant than Ca-treated steels. 

HIC is often found to propagate along microstructural bands (pearlitic). The 
formation of the latter is a result of dendritic segregation of Mn and P. The hardness of 
the centre segregation in steel plates is one of the most important factors affecting HIC 
susceptibility of high strength line pipe steels. Matsumoto et al (1986) found that 
lowering the carbon content to below 0-05% decreased the hardness of the segregation 
zones. They also found that homogenization treatments were effective in decreasing the 
same. However such treatments are time consuming due to the slow diffusion of Mn 
and may not be practical, causing reduced industrial productivity. Brown & Jones 
(1984) advocated the use of quenched and tempered steels and reduction of manganese 
to reduce banding. They also suggest the use of modern casting techniques like 
superheat temperature control and electromagnetic stirring to reduce the centreline 
segregation in continuously cast steels. 
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Weldability and weld joint failures 
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Abstract. Welding plays a prominent role in the fabrication and erection 
of structures and components in the industry. Weld failures when they do 
occur, either during fabrication or service, involve considerable rework, 
apart from production, time and material losses. Hence, it is essential to 
adopt appropriate welding procedures depending on the material, weld 
configuration and the service requirement. In addition, the weldability of 
the material should be established with adequate laboratory tests in order 
to minimise the tendency towards hot and cold cracking susceptibilities. 
The mechanisms of such cracking and the tests to be undertaken to evaluate 
the cracking susceptibility are enumerated in this paper along with a brief 
report on weld failures and criteria to be adopted for their repair. 

Keywords. Weld failures; weld repair; weldability; cold cracking; hot 
cracking. 

1. Introduction 

Weiding plays a significant role in the fabrication, erection and commissioning of 
plants and machinery for power, petroleum, chemical, steel and other industrial 
sectors. With the improvement in steel-making technology and the availability of better 
and cleaner steel grades with higher strength-to-weight ratios, the capacity, efficiency 
and availability of plants and machinery are continuously increasing. The advent of 
better design capabilities, using complex structural analysis with finite element and 
other methods, has enabled optimisation of section thickness with reduced factor of 
safety and considerable savings in material and fabrication costs. Manual metal arc 
welding is being replaced by better, automated and sophisticated welding technologies 
which reduce the fabrication and erection time considerably and also give better and 
consistent quality of welds. 

The effect of these technological advances is that the pressure parts and other 
components are now exposed to severe temperature, pressure and environmental 
conditions for prolonged periods of time which may, sometimes, lead to unforeseen and 
catastrophic failures. The weldments and heat affected zones (HAZ), which are structurally 
non-integral with the base metal, are more prone to such failures. Apart from the cost and 
time required for the weld repair of such components, the plant outage leads to consider¬ 
able financial losses. Hence, it is important to understand the factors responsible for such 
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Table 1. Welding defects and their effects. 


Defect 

Possible effects 

Action 

Incomplete 

Reduces strength and notch 

Complete removal 

penetration 

toughness 

necessary 

Lack of 
fusion 

Reduced fatigue strength and static 
and impact strength 

Remove and reweld 

Undercut 

Reduces fatigue strength, increases 
incidence of hot tears 

Addition of weld metal 

Overlap 

Reduces fatigue strength 

Removal by grinding 

Slag 

Irregular and large lenticular 

Chipping, cleaning 

entrapment 

inclusions may reduce strength 
and act as sites for initiation of 
brittle fracture. 

and rewelding 

Porosity 

Dense porosity reduces load 
carrying area and acts as stress 
raiser 

Excessive amount 
necessitates removal & 
rewelding. 

Incorrect weld 

Static and fatigue strength may 

Many require 

size and profile 

get affected, may often indicate 
presence of other defects e.g. lack 
of penetration 

rewelding 

Poor surface 

Acts as stress raiser and may affect 

Grinding and chipping 

appearance 

fatigue strength 

followed by welding 
may be necessary 
where it is particularly 
bad 

Weld or HAZ 
cracking 

May propagate and lead to brittle 
failure 

Grinding and rewelding 


weld failures and try to eliminate or at least minimise them to the extent possible. The 
present paper aims at discussing the various metallurgical and other welding related factors 
that can cause weld failures and methods of preventing them. Creep, fatigue and fracture 
mechanics are excluded from the scope of the present paper since they are covered else¬ 
where in the book. Similarly, corrosion aspects are also outside the purview of this paper. 


2. General considerations for welding 

A wide variety of metals and alloys are used to fabricate the structural and pressure 
parts in the industry and a number of welding technologies are available for the same. 
The welding processes commonly used can be broadly divided into three categories 
based on the heat input. 

(a) Low heat input 1.0 to 1.8kJ/mm 

(manual metal arc) 

(b) Intermediate heat input 1.8 to 4.0 kJ/mm 

(submerged arc, metal inert 

gas, cored wire etc.) 
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(c) High heat input 10to40kJ/mm. 

(electroslag, electrogas etc.) 

While it is desirable to achieve the fastest possible deposition rate consistent with the 
properties required as per design, the welding process is generally dictated by several 
factors such as section thickness, type of joint, welding position, equipment and 
manpower availability and shop/site environments. Well established welding pro¬ 
cedures are adopted in each case with welders trained for that material, joint configur¬ 
ation and welding position. 

Despite these precautions, weld failures do occur occasionally, either during the 
manufacturing stage itself or during service. These failures could be in the form of 
porosities, lack of fusion, crack etc. which will have a direct bearing on the performance 
of the component. Table 1 lists some of these defects, their possible effects and remedial 
actions to be taken on them. 


3. Welding of structural and pressure vessel steels 

A large number of steel grades are extensively used in welding fabrication. They can be 
classified into three broad categories (Doty 1970) for their usage in structural and 
pressure vessel welding fabrication (table 2). Apart from carbon and low alloy steels, 
several grades of high alloyed and microalloyed steels are also used for specialised 
fabrication. 

Generally all steel grades, either in the plate or pipe form, are amenable for all 
welding processes. The plate material composition, however, can affect the following 
mechanical and metallurgical properties of the welded joints. 


Table 2. Classification of structural and pressure vessel steels 
(Doty 1970). 


Composition 

Thermal history 

Minimum yield 
strength (N/mm 2 ) 

Carbon 

Non heat treated 

207-345 

steels* 

Normalised 

207-345 


Quenched & tempered 

345-689 

Low alloy 

Non heat treated 

276-482 

steels** 

Normalised 

345-413 


Quenched & tempered 

413-689 

Alloy 

Normalised 

241-345 

steels*** 

Normalised & tempered 

345-552 


Quenched & tempered 

345-1241 

* 

0-33% max. C, C-Mn-Si 
alloying elements. 

and with certain 

** 

0-26% max. C, C with upto about one percent 
total of Cu, Ni, Cr, V & Nb. 

*** 

0-25% max. C, C with 0-5% or more of Ni, Cr or 
Mo. 
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• Mechanical strength of the joint; 

• Fracture toughness of the base metal, HAZ and weld metal; 

• HAZ cracking sensitivity; 

• Weld metal cracking sensitivity; 

• Stress corrosion susceptibility. 

Adequate precautions are to be taken to eliminate the above mentioned problems. 
The relationship between heat input, chemical composition and thickness must be 
reasonably defined in a practical manner and the effects of general and local restraint 
taken care in establishing adequate welding procedures. 


4. Welding of stainless steels 

Among the stainless steels, austenitic stainless steels are used for more than 90% of the 
fabrication because of their excellent weldability (Castner 1993). They have lower 
melting points, higher electrical resistance, lower coefficients of thermal conductivity 
and higher coefficients of thermal expansion than carbon steels. Austenitic stainless 
steels are amenable to all welding processes if a few of the factors are taken into account. 

• Solidification cracking is the most frequently encountered problem; 

• autogenous welds may be subjected to variable penetration; 

• the welding procedure may affect corrosion resistance; 

• several factors must be considered if post weld heat treatment (PWHT) is required; 

• welding of austenitic stainless steels can result in greater distortion than carbon 
steels; 

• welding fumes present potential hazard to workers; 

• dissimilar metal weld joints require proper selection of filler metals and procedures. 

Solidification cracking is a problem when the welds solidify primarily as austenite. 
Instead, if the weld metal is solidified primarily with delta ferrite, the hot cracking 
propensity is the least. The delta ferrite content can be estimated by Schaeffler and 
Delong diagrams. Presently, the Welding Research Council (WRC) has published 
modified diagrams in 1992 (figure 1) that allow accurate prediction of delta ferrite 
number (FN) in austenitic stainless steels. 

A number of sources recommend a minimum of 4 FN to prevent hot cracking in arc 
welds of austenitic stainless steels. However, in certain situations involving higher 
welding speeds and deep and narrow weld beads, this may not be adequate to prevent 
hot cracking and hence a higher range of 5-20 FN is recommended. Care must be taken 
not to go for higher FN where the possibility of sigma phase formation increases. 

Minor variation in the chemical composition of austenitic stainless steels can 
sometimes cause problems of variable penetration for autogenous full penetration 
welds. These problems referred to as ‘cast-to-cast’ or ‘heat-to-heat’ variation are 
particularly evident in heats containing less than 0-003% sulphur. They are always 
associated with the surface active elements on the fluid flow of the metal. Recommenda¬ 
tions to avoid variable penetration include; avoiding calcium and rare earth treated 
steels, avoiding steels with less than 0-005% sulphur and with aluminium content of 
higher than 0-02%. When such steels have to be used, careful control of process 
parameters have to be worked out. 
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Figure 1. Welding Research 
Council (1992) constitutional 
diagram for stainless steel 
welds. 


Welding can reduce corrosion resistance of HAZ regions that are exposed to 
temperatures between 425 and 900°C for long enough time to precipitate chromium 
carbide at grain boundaries. The most widely used method to prevent sensitisation of 
HAZ is to use low carbon stainless steel. Another approach is to use stabilized grades 
that contain titanium or columbium. Low carbon stainless steels have lower strengths 
than stabilized grades. Stabilized grades, on the other hand, may be more hot crack 
sensitive and may require careful ferrite control. They may also be susceptible to the 
localised precipitation of carbides in narrow regions of HAZ that can lead to inter¬ 
granular corrosion known as knifeline attack. 

Austenitic stainless steel weldments are normally used in service in the as-welded 
condition. However, to minimise residual stresses, prevent distortion and also in 
environments that can cause stress corrosion cracking, PWHT may be required. 

The PWHT must be done in such a way as to avoid the following problems. 

• Heating in the range 425-900°C can result in sensitisation. 

• Heating weld between 540 and 925°C can transform ferrite to a hard brittle sigma 
phase. 

For these reasons, PWHT usually involves solution annealing at temperatures of 
1065-1120°C followed by rapid cooling or water quenching. Sigma phase can be 
minimised if the weld metal FN is limited to 10-12 FN. 


5. Weldability 

As mentioned earlier, depending on the heat input, the consumables used, the restraint 
on the weld and the cooling rate, a weld failure which manifests as porosity, lack of 
fusion or cracking may occur. To understand these behaviours, it is essential to study 
the weldability of the material. Weldability has been defined by the International 
Institute of Welding (IIW. document no. IIW/IIS-22-59) as: ‘A metallic material 
is considered to be weldable to a certain degree by a given process and for a given 
purpose when a continuous metallic connection can be obtained by welding using 
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a suitable procedure so that the joints comply with the requirements specified both in 
regard to their local properties and their influence on the construction of which they 
form a part’. 

This definition properly refers to both metallurgical and structural standpoint. Thus, 
any test to evaluate the weldability of a material should take into account all aspects of 
welding, the heating cycle, the cooling cycle, the stresses induced during solidification 
and fusion, and also the structural variations. All these factors, individually or 
collectively, can cause cracking in the weldments. The weldment cracking is broadly 
classified into two categories: cracks which develop above the solidus of the metal are 
called ‘supersolidus cracks’ and those that develop below the solidus temperature are 
called ‘subsolidus cracks’. They are also commonly referred to as ‘hot cracks’ and ‘cold 
cracks’ depending on the temperature at which they develop. 

6. Cold cracking in weldments 

Cold cracking essentially depends on the underbead hardness, the hardness of the H AZ 
and the restraint imposed on the structure during cooling. The maximum possible 
hardness of the steel depends primarily on its carbon content. It is also influenced by the 
other chemical elements present in the steel and the weld thermal cycle. The methods of 
predicting the hardness have, by and large, focussed on defining and determining the 
carbon equivalent of the steel which takes into account the effect of various alloying 
elements. The effect of restraint history is evaluated separately by weldability tests such 
as implant and controlled thermal severity tests. The effect of thermal history is 
evaluated by determining/predicting the microstructure in weld simulation during 
cooling between 800 and 500°C. 

Cold cracking is also sometimes referred to as hydrogen-induced cracking (HIC) 
since hydrogen is known to play a major role in this method of cracking. In ferritic 
steels, the phase changes which accompany welding may lead to hardened structures 
(like martensite formation) in either the weld metal or HAZ or both. These structures 
have low ductility which is further reduced by the presence of hydrogen, unavoidably 
introduced into the weld. This is because the solubility of hydrogen decreases as the 
temperature decreases. When the austenite contains hydrogen, the transformation 
during cooling results in the liberation of molecular hydrogen. In such cases, cracking 
or brittle fracture may occur following welding under the influence of residual stress 
and applied stress from the system (Baker et al 1966). The critical hardness levels of the 
weld will alter with type of joint, hydrogen potential of the welding process and the 
strength and composition of the parent metal. A typical cold crack is shown in figure 2. 

6.1 Carbon equivalent 

While assessing a plate material for its cold cracking susceptibility due to welding, 
the important aspect of carbon equivalent (CE) should be taken into account. The 
limiting value of CE is, in essence, an index of the HAZ hardening that may be allowed 
without cracking and this affects the rate of cooling which can occur during welding. 
Also for a given value of carbon equivalent, there is a thermal conductivity balance 
which necessitates higher preheat temperatures to avoid HAZ cold cracks in thicker 
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Figure 2. Cold cracking in the root run of butt weld. 


A large number of carbon equivalent formulae have been proposed by different workers 
(de Meester 1991) which essentially predict the hardenability of the steel. Among these, the 
most widely used is the IIW carbon equivalent formula proposed by Dearden & O’Neil 
in 1940 and incorporated in the national standards of various countries. 

^ C + Mn Ni + Cu Cr + Mo + V 
CE "W = —T“ + -^5~ + -5-■ 

This formula can be conveniently used for steels with carbon content of more than 0-18 
or for steels with r 800/500 cooling time of longer than about 12 s. 

The preheat temperatures suggested for several ranges of equivalent carbon content 
are: 


CE 

Upto 0-45% 
0-45 to 0-60% 
above 0-60% 


Suggested preheat 
temperature °C 
Optional 
95 to 200 
200 to 370 


The above suggested preheat temperatures are for arc welding processes. They may 
be affected by the thickness of the workpiece. For gas welding processes, however, 
preheating is usually not required owing to the slow cooling rates associated with such 
processes. 


6.2 Effect of hydrogen 


More than 50% of the in-service failures, to date, have been attributed to the hydrogen 
induced cracking. The accident at the off-shore oil drill platform of North Sea in 
Ireland (Alexander 1981), the failure of the first stage rocket motor casing during 
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(a) 



(b) 


Figure 3. Critical crack stress 
determined by implant tests on 
carbon steel vs total hydrogen 
(ppm) plotted on (a) linear and 
(b) logarithmic scale (Evans 
& Christensen 1971). 


hydrotest at NASA (Kula & Anetil 1977) and the brittle fracture of an ammonia 
pressure vessel (Rosenfield & Reid 1977) are some of the major weld failures that have 
been reported and attributed to HIC. 

Hydrogen is inadvertently introduced during welding in the weld metal. During 
cooling, this gas diffuses rapidly towards HAZ while the solidification and phase 
transformation progress in the opposite direction. This explains the formation of 
‘underbead cracking’ in the HAZ close to the fusion boundary in which hydrogen is 
known to play a major role. 

Among the techniques for diffusible hydrogen measurement, the glycerine method, 
mercury method and gas chromatography method are popular. The glycerine method, 
which was almost exclusively used in 50's and 60’s is now considered not very accurate 
because some of the hydrogen dissolves in glycerine. The IIW and the American 
Welding Society (AWS) committees are trying to standardise the test conditions for 
mercury and gas chromatography methods and a detailed status report can be found in 
the literature (Kotecki 1992). 
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There has also been considerable work on the effect of diffusible hydrogen on 
cracking. Many researchers (Evans & Christensen 1971) have examined the critical 
cracking stress by the implant test of a variety of steels as a function of hydrogen 
content in the weld metal. Figures 3a and 3b show the data for a 0T7%C and 
1-36% Mn steel and it is apparent that the critical cracking stress (implant rupture 
stress) is a linear function of the logarithm of the hydrogen content. Another method of 
representing the data is via the critical preheat temperature needed to avoid cracking as 
a function of diffusible hydrogen and hardenability; the latter expressed as carbon 
equivalent. The results are shown in figures 4a and 4b (Christensen & Simonsen 1981) 
and again the critical CE at a given preheat temperature is an approximate linear 
function of the logarithm of the weld metal diffusible hydrogen. Similar analysis has 
been conducted by other workers also (McParlan & Graville 1976). 

Attempts are being made to develop and introduce low and very low hydrogen 
electrodes in order to combat the above problem. The AWS specifications (AST and 
5-5) define a ‘low hydrogen coated electrode’ as having no more than 0-6% moisture 
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Figure 4. Carbon equivalent 
adjustment plotted against dif¬ 
fusible hydrogen on (a) linear 
and (b) logarithmic scale 
(Christensen & Simonsen 1981). 
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content. The IIW definition of ‘hydrogen controlled electrodes’ are those producing 
15ml/100g of hydrogen with 10ml and 5ml/100g defining progressively better 
consumables. The 15 ml/100 g compares well with 0-6 percent coating moisture of AWS 
specification, while 16 ml/100 g plotted on a logarithmic scale fits in exactly with 0-6% 
moisture. Hence, the AWS committee, (Kotecki 1992) has decided to introduce three 
designations ‘HI6’, ‘H8’ and ‘H4’ corresponding to hydrogen levels of 16, 8 and 
4 ml/100 g respectively. This system would be introduced for covered mild steel 
electrodes (ANSI/AWS A5-1), covered low alloy steel electrodes (A5-5), submerged arc 
mild steel wires and fluxes (A5T7), submerged arc low alloy steel wires and fluxes 
(A5-23), gas shielded mild steel wires (A5-18), gas shielded low alloy steel wires (A5-28), 
flux cored mild steel wires (A5-20) and flux cored low alloy steel wires (A5-29). Some of 
these eight specifications have since been introduced and others are in the process of 
being revised. For all critical applications these low hydrogen consumables can be used 
to minimise the problem of HIC. 


6.3 Weldability tests for cold cracking 

The susceptibility of a steel for cold cracking depends on a multitude of events that any 
single test employing a set of standard parameters cannot adequately and with a degree 
of certainty quantitatively predict the cracking susceptibility (Veeraraghavan et al 
1988). Controlled thermal severity and implant tests have been used for long to 
evaluate the cold cracking susceptibility. 

Controlled thermal severity (CTS) test (Granjon 1960) consists of two plates with 
varying thicknesses fastened to one another by a central bolt and two lateral weld 
beads, called anchor runs-C l5 are run (figure 5). The test beads C 2 and C 3 are run in 
a sequential order with the assembly being preheated to a select temperature. Since the 
assembly is not symmetrical, the bead C 3 cools more rapidly then the bead C 2 which 
has a lower thermal severity. The cooling rate in the critical zone can be modified by 
varying the plate thickness and the preheating temperature. Combined thickness of the 
plates, expressed in quarters of an inch is denoted as the thermal severity number 
(TSN). Fillet welding is done with increasing thickness until cold cracking occurs at 
a particular TSN. The demerit of this test is that the plate thickness determines both 
the HAZ hardening tendency as well as restraint values which cannot be varied 
independently. 

The implant test, also proposed by Granjon (1969), is also being widely used to 
evaluate the cold cracking susceptibility. The standard procedure has been specified by 
the IIW and incorporated in the national standards of several countries. It is essentially 
a constant load rupture test of real HAZ of a steel immediately after welding. Taking 
fracture of specimen as a criterion, the critical stress applied for fracture at specific 
conditions of welding can be evaluated which can serve as an index to describe the 
hydrogen induced cracking susceptibility of the steel. One can also evolve the welding 
conditions under which the critical stress (<r or ) cracking will be brought equal to or 
a specified percentage of the yield strength in order to select a welding procedure for 
the steel. 

Many times cracks have'been found to develop despite the above mentioned tests 
indicating positive results. This is because in most of the cases, the rigidity of the 
structure or the restraint which is not exactly quantified, is the contributory factor for 
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Figure 5. Controlled ther¬ 
mal severity test. 


the cold cracking in welds. To overcome this problem, Rigid Restraint Cracking (RRC) 
tests (Satoh & Matsui 1968) were developed. In RRC tests, specimens prepared for butt 
weld are perfectly rigidly clamped during welding and cooling down period, and the 
restraint intensity is measured. It is varied by varying the specimen length. The test 
enables one to assess the critical reactive stress (cr cr ) and the critical restraint intensity 
(K cr ) for a given material, under a set of welding conditions. 

The disadvantages of the above test are that it requires sophisticated equipment for 
perfect clamping and maintaining constant gauge length during testing. Also long 
lengths of specimens are needed for obtaining low restraint intensity which increases 
the material requirement and cost. 

To obviate these disadvantages, alternate methods known as Elastic Restraint 
Cracking (ERC) tests were developed at CSSR and elsewhere. Of these, the test method 
developed by Povezan (1978) of Bratislava Welding Research Institute has been 
adopted by us in WRI, Tiruchirapalli and is shown in figure 6. Window type elastic 
frames are used and the test plates are clamped by means of high strength bolts. The 
side arms of the clamping elements are provided with resistant strain gauges enabling 
recording of the reactive force during the tests. The effective restraint of the weld joint 
will depend upon the restraint intensity of the frame, restraint intensity of specimen and 
the length of the weld bead deposited. The development of weld reactive force is 
monitored with a time chart recorder. After allowing a day for development of cold 
crack, the weldment is evaluated by NDT and metallography for the presence of crack 
which can easily be correlated with the weld reactive force, data. The test method is 
economical and easy to conduct and can serve as a standard test for the evaluation of 
cold cracking susceptibility of structural steels. 


224 


M N Chandrasekharaiah 


U 4 



ZZZ5^=SZZZxA 

Stction AA 

v////x\v/A\v;/m 

S«ciien B8 

Mat trial : $A SIT Gr.F 

(a) 



1 . Clamping Promts 
I - I. Sfwcimons 
*. Friction Facts 
S • Nigh Strength MM 

* «»H (b) 

7 . Strain Cay ft o 


Figure 6. (a) Restraining frames for ERC test; (b) ERC test assembly 
(Veeraraghavan et al 1988). 


7. Hot cracking in weldments 

Hot cracking in weldments develop during or after solidification and are generally 
associated with the phase transformation and shrinkage strains. Depending on the 
conditions of cracking, they are classified as solidification cracking, liquation cracking, 
lamellar tearing and reheat cracking. 

Many types of hot cracking tests have been proposed to evaluate the weldability of 
materials (Wilken & Schoenherr 1975). They are divided into two categories: self 
restraint type and external restraint type. Typical examples of self-restraint tests are the 
Houldcraft test and the Lehigh restraint tests. However, these tests give rise to 
considerable scatter in the results due to their inability to reproduce strain fields. These 
strain fields depend on the thermal pattern which is largely uncontrollable. Typical 
methods which use external restraint forces are the Varestraint test and the tensile 
restraint cracking test with bending and/or tensile restraint respectively. 


7.1 Solidification cracking 

During solidification of the weld metal, grains grow from the fusion boundary to the 
central region of the weld pool, impurities and alloying elements are rejected ahead of 
the growing crystals and become more and more concentrated in the liquid that 
solidifies later. Their presence lowers the melting point of this liquid. As solidification 
proceeds, contraction strains will develop across the weld as a result of the cooling of 
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Figure 7. Schematic illustra¬ 
tion of (a) varestraint test, (b) 
spot varestraint test and (c) 
transvarestraint test. 


the weld and the surrounding material. If the ductility is lowered sufficiently by the 
persistence of the liquid films, cracking may result longitudinal to the weld bead. 

The incidence of this problem is most likely in high dilution processes such as the welding 
of the root run and submerged arc welding of plates. Another form of crack transverse 
to the weld is also possible. Thus, a satisfactory assessment procedure should fulfil: 

(a) application of controlled strain both longitudinally and transversely about the 
test weld; 

(b) determination of critical strain parameter as a function of dilution; 

(c) assessment of welding techniques like TIG, SMAW and SAW; 

(d) correlation must be established between test results and practical joints 

Such tests for assessing solidification cracking could be carried out using both 
Varestraint (Savage & Lundin 1965) and the Transvarestraint tests (McKeown 1970) 
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Both these tests satisfy the requirements ‘a’ to ‘c’ but further work is required to 
establish ‘d\ A third variant of the test called Spot-Varestraint test (Tigama-Jig test) has 
also been developed (Savage et al 1977). All these tests are schematically illustrated in 
figure 7 (a to c). 

All these tests are similar in principle and differ only in the direction of application of 
strain with respect to weld direction. The test piece is a flat plate fixed at one end 
(figure 7a) or at both ends (figure 7b) or in the middle (figure 7c) on a rigid block the 
surface of which comprises a cylinder of radius R. After the weld is run to a certain 
length (figures 7a & c) or at a spot (figure 7b) the plate is rapidly bent against the 
cylindrical part of the block. The test severity is expressed in terms of the strain which 
varies with the plate thickness and with the radius R. It is relatively easy to evaluate the 
susceptibility to cracking by the strain above which the cracks appear or by the total 
length of the crack for a given strain. Several investigators have used different criteria 
for interpretation of result of varestraint tests (figure 8) (Dixon 1983). The degree of 
cracking can be quantified by the average crack length, or the total number of cracks, or 
the maximum crack length or the cracked HAZ length - all as a function of strain. 
Thus, considerable subjectivity exists in the interpretation of results. 

In addition to the varestraint tests, hot ductility tests can also be used to quantify the 
HAZ cracking susceptibility. The hot-ductility test characterises the ductility of the 
material at elevated temperatures and relates this data to the cracking susceptibility. 
Basically, small tensile specimens are fractured rapidly at some specific temperatures 
during either heating or cooling cycles of a thermal cycle in a thermomechanical 
simulator. The transverse reduction in area of the fractured sample is subsequently 
determined providing a measure of ductility. Both the on-heating and on-cooling 
curves in the vicinity of the solidus temperature can be obtained as shown in figure 9 
and the nil ductility temperature (NDT), nil strength temperature (NST) and the 
ductility recovery temperatures (DRT) determined. In addition to the hot ductility, the 
hot strength of the material can also be evaluated. 

In order to obviate the subjectivity in the interpretation of data in the above tests, Lin 
et al (1993) have evolved a new methodology for quantifying HAZ liquation cracking 
susceptibility by defining the thermal crack susceptibility of the region of the material, 
which is a material specific parameter. This region can be determined using Gleeble hot 



Figure 8. Shape of graphs ob¬ 
tained by the different testing 
procedures: (a) Average crack 
length vs. strain (Farrar & Gar¬ 
land 1973), crack area vs strain 
rate (Konishchev et al 1977), 
number of cracks vs strain 
(Nakagawa et al 1974). (b) 

Maximum crack length vs 
strain (Savage & Lundin 1965). 
(c) Modification to (a) by Bailey 
(1978). 
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Figure 9. Representation of results obtained from hot ductility test. DRT - ducti¬ 
lity recovery temperature, NDT - nil ductility temperature. NST nil strength 
temperature, T L - liquation temperature. 


ductility, spot or longitudinal varestraint tests. However, the authors also acknowledge 
that these data alone cannot easily accommodate the variations in weld restraint that 
often dictate the occurrence and severity of cracking. 

A similar comparative study of hot ductility rate test and modified varestraint tests 
were attempted in Germany over a wide range of materials obtained from different steel 
manufacturers (Rawlings & Wilken 1992). In the hot ductility test, the maximum 
transverse deformation rate which can be endured without causing hot cracking on the 
surface of the weld deposit was used as the criteria. In the varestraint test, the total 
length of all the hot cracks found under the stereomicroscope at a magnification of 25X 
was used as the criteria. Remarkably good correlations were obtained between the two 
methods on all the materials tested by SMAW, TIG and MIG techniques. The results 
are under discussion for standardisation by the German Welding Society. 

7.2 Burning or liquation cracking 

At the high temperatures reached during welding in the HAZ adjacent to the fusion 
boundary, local melting may occur as a result of the presence of non-metallic impurities 
giving rise to the phenomenon known as ‘burning’ (Boniszewski & Watkinson 1964). 
These cracks are intergranular with respect to prior austenite grains and occur in the 
HAZ normal to the fusion line of the weld (figure 10a). Hemsworth et al (1969) classified 
such cracks, also sometimes known as ‘hot tears’, depending on whether they occurred 
because of the segregation leading to grain boundary liquation or precipitation of 
brittle constituents or a loss in ductility dip and consequent tearing. 

In ferritic steels the problem is principally caused by carbon, sulphur, phosphorus 
and manganese. An increase in the concentration of the first three elements and 
a decrease in the concentration of the fourth one leads to an increase in the incidence of 
cracking. Control of this problem lies in the ability to avoid susceptible compositions 
with due care to the welding process, trace elements and other deleterious factors. 
Carbide formers are helpful in that they remove the carbon from the solution. Increase 
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Figure 10. (a) Hot tear across HAZ and weld metal and (b) lamellar tearing in 
T-butt weld. 


in manganese content and, in general, a higher Mn/S ratio is considered helpful 
(Meitzner& Stout 1966). 

A possible test for detecting liquation cracking is by mechanical testing of specimens 
at elevated temperatures during a simulated HAZ thermal cycle. A cylindrical test 
specimen is heated by resistance or induction heating and the strength and ductility are 
measured either at the peak temperature or immediately after, and these properties are 
correlated with the occurrence of burning and liquation cracking in the HAZ of the 
welds made with specific levels of heat input (Widgery 1971). 

7.3 Lamellar tearing 

The problem of weld failures has become more common with increased use of higher 
strength and thicker steels, particularly where the plate material is welded normal to its 
surface with high restraint being imposed, as in the case of full penetration T-butt welds. 
The problem may arise where high strain rates may develop in the short transverse 
direction of a plate. Cracking may result which is primarily due to the presence of 
inclusions lying on planes parallel to the plate surface which reduces the short 
transverse ductility. Substantial parts of the fracture path lie in planes parallel to the 
surface of the plate. In contrast to HIC, lamellar tearing may propagate both within 
and outside the visible HAZ (figure 10b). A tensile test is the simplest of the destructive 
tests for examining the short transverse properties. Measurement of reduction in area 
has been shown to correlate well with fabrication susceptibility to lamellar tearing 
(Farrar et al 1969). Some anomalies can be expected where plates of low inclusion 
content' show low values of reduction in area, and this can arise when individual 
inclusions are large compared with the specimen cross-section area. 

7.4 Reheat cracking 

When a weld joint is reheated after welding, whether for stress relief or during service, 
the residual stress remaining after welding will be relaxed by high temperature 
deformation. This relaxation is accompanied by plastic deformation in the weld metal 
and HAZ and, at points of stress concentration such as the toes of fillet welds, the degree 
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Table 3. Weld cracks and methods of testing. 


Potential 

problem 

Specification tests 

Batch quality 
control tests 

Hydrogen 
induced cracking 

Continuous cooling transformation 
assessment 

Fracture mechanics susceptibility 
tests 

Granjon implant test 

Chemical analysis 

Taper hardness test 

Solidification 

cracking 

Varestraint and transvarestraint 
test 

Hot deformation rate test 


Burning or 

High temperature test on thermally 

Side or face bend 

liquation 

cracking 

simulated specimen 

tests 

Lamellar tearing 

Short transverse tensile test 

Crack notched bend test 

Automated inclusion counting 

Ultrasonics 

S T tensiles 
Automated inclusion 
counting 

Reheat 

cracking 

Fracture mechanics stress 
relaxation and constant load tests 
at elevated temperatures 



of local strain may be considerable. In certain steels with poor high temperature 
ductility, the deformation may be too great to be accommodated and cracking may 
result. Such cracks, known as reheat cracks or stress relief cracks, are intergranular in 
nature and appear in the HAZ and occasionally in the weld metal. 

Historically, reheat cracks were observed in austenitic stainless steels SS 347,321 and 
then in other grades of low alloy steels used in superheaters, turbine rotors, forgings etc. 
(Dhooge & Vinckier 1992). After extensive investigations, Lundin & Khan (1990) 
concluded that chemical composition is the most significant factor in PWHT cracking. 
The elements found to be deleterious are either those that promote formation of 
carbides of M Z C or M 4 C 3 types or those which are known to have grain boundary 
embrittling effects. 

Multitude of tests have been employed by different researchers to evaluate reheat 
cracking and the test results are difficult to compare and sometimes confusing (Lundin 
& Khan 1990). The remedy against reheat cracking is careful dressing of the surface of 
the weld. Temper beading, if correctly performed, combined with weld dressing, should 
provide a microstructure and surface favourable for avoiding initiation and propaga¬ 
tion of cracks. 

The various forms of cracking discussed above and the possible tests for evaluating 
the cracking susceptibility are tabulated in table 3. 


8. Weld failures and their repair 

Weld failures can be broadly divided into two categories: first, failures that occur 
during fabrication when a weldment fails to satisfy appropriate inspection criteria, e.g. 
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undercut or porosity which may exceed the permitted limits or more seriously, 
flaws such as hydrogen cracking. These are normally repair welded with adequate 
precautions and using well-established repair procedures. The second category 
covers weldments that experience cracking or are damaged during service. These 
include fatigue cracking, stress corrosion cracking and fracture. Their repair may be 
difficult and time consuming and in some cases, the failure may be in inaccessible 
locations without adequate data on the previous history of the components involved 
which makes weld repair very difficult. All the factors to be considered during the weld 
repair are enumerated in the IIW Commission XV in their document No. IIS/IIW- 
956-87. 

In this context, the ‘fitness-for-purpose’ approach using fracture mechanics concepts 
and a thorough knowledge of the stress, imperfection size and material properties, help 
in taking crucial decisions on whether to repair a weld crack which may not propagate 
under the given service conditions. In fact, there is considerable evidence that the repair 
of innocuous imperfection may be positively harmful when working to arbitrary 
acceptance criteria. Three case studies from literature will be briefly described to 
illustrate the three conditions enumerated above. 

What is considered as the most successful and difficult weld repair, was carried out 
on a steam turbine casing which had developed 21 cracks totalling approximately 
508 cm in length (Yosh & Captoon 1992). The failure was due to a combination of low 
cycle fatigue and creep. After excessive trials on weldability establishment, carbon 
equivalent consideration, possible distortion and necessary heat treatments, the repair 
was carried out on a 1 Cr-lMo-l/4V material with a preheat of 343 ± 2TC using A WS 
8018 B2L iron powder, low hydrogen electrodes with sufficient clampings to prevent 
distortion and sufficient padding to maintain adequate preheat. The job is reported to 
be functioning satisfactorily since 1987. 

Perhaps the most striking example of the effect of inappropriate weld repair 
procedures was the explosion at the Union Oil Refinery near Chicago in 1984 which 
caused the deaths of 17 people (McHenry ei al 1986). The failure was associated with 
a circumferential weld used to join a replacement section in a manoethanalomine 
contactor vessel to the original shell. The replacement was necessary due to the 
extensive damage to the original casing caused by HIC. All materials were A516 Gr.70. 
The vessel was originally fabricated using submerged arc welding with very high heat 
input, which led to low HAZ hardness values. The repair weld done in the field in the 
2G position had to be made using SMAW electrodes at a much lower heat input. This 
inevitably led to much higher hardness levels in the repair weld HAZ such that, during 
service, hydrogen from the process entered the steel and initiated cracking in the 
hardened HAZ. This crack propagated in a mechanism similar to hydrogen induced 
cracking until full penetration of the vessel wall was reached and then a catastropic fast 
fracture ensued. 

Another example is a decision made not to repair a crack in view of the possible 
differences in toughness in the case of North Caramount Jacket failure (Harrison 1981). 
The original welds were made by SAW process and had excellent toughness. At a late 
stage of the fabrication of the jacket, cracks were discovered in the longitudinal seams 
of the tubular members forming the legs of the structure. Repairs were considered but 
trials carried out prior to repair indicated that manual welding repair will have much 
lower toughness than the original welds. This factor led to the decision not to repair the 
pre-existing discontinuities in the seam welds. 
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Weld joints are particularly tolerant to some types of discontinuity and fracture 
mechanics techniques are now available which allow rational decisions on the signifi¬ 
cance of weld discontinuities to be made, ff these techniques are more widely applied, 
the number of weld repairs carried out could be reduced significantly. 

9. Conclusion 

Weld failures and their repairs affect the productivity and safety of any industrial 
organisation. While weld failures that occur during fabrication can be repaired rather 
easily, those that occur during service pose serious problems during repair. Inaccessi¬ 
bility of the area, inability to use the shop practices with appropriate preheat and 
post-weld heat treatments and lack of adequate data on the previous history of the 
components are some of the problems. Hence, during welding of carbon steels, low 
alloy high strength or stainless steels, adequate data on the weldability of the material 
including carbon equivalent, hydrogen induced cracking, solidification cracking, 
lamellar tearing etc. should be established. Several testing methods are available to 
evaluate the cracking susceptibility. In addition, fracture mechanics and fitness-for- 
purpose criteria should be adopted and an appropriate weld repair technique can be 
judiciously selected. 


The author wishes to thank the management of the Bharat Heavy Electricals Limited 
for all their encouragement. 
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Application of fracture mechanics for weld integrity assessment 

V R RANGANATH and S TARAFDER 

Fracture & Fatigue Group, Materials Evaluation Division, National Me¬ 
tallurgical Laboratory, Jamshedpur 831007, India 

Abstract. The structural integrity assessment of a weld joint by conven¬ 
tional techniques is inadequate, because of unavoidable defects in the weld 
composite. The stress situation in a component having a defect is quite 
different from that of a homogeneous material. The significance of fracture 
mechanics to deal with such integrity assessments is brought out. A brief 
review on the basic formulations in the application of fracture mechanics is 
followed by established guidelines for evaluating the integrity of engineer¬ 
ing components containing crack-like defects. 

Keywords. Fracture mechanics; integrity assessment; welding; crack tip 
opening displacement; ./-integral; fracture toughness; stress intensity factor. 


1. Introduction 

The advent of high toughness structural materials necessitated a corresponding 
research in the area of welding - the process which is extensively used in fabrication. 
The specific properties that characterize a good weld are the homogeneity of the 
weldment and matching strength of the weld joint to that of the base metal. While these 
properties are not difficult to achieve in the weld metal zone, in the zone lying between 
the weld zone and the base metal, termed as the heat affected zone (HAZ), control of 
property is often difficult to achieve. The successive heating and cooling of the HAZ 
during multipass welding could result in a microstructure which is prone to brittle 
fracture (Lancaster 1980). In fact, a number of catastrophic failures, resulting in huge 
losses in terms of personnel and financial, have been reported to have originated from 
the HAZ of weld joints (Williams & Ellinger 1953; Bennett & Mindlin 1973; Ghosh et al 
1990). 

In order to evaluate the health of a weld joint, a number of test procedures have been 
evolved (ASW 1990). Depending on the convenience and mutual agreement between 
the manufacturer and the consumer, conventional tests, like the ‘u’ bend test for 
assessing the ductility, and controlled thermal severity test (CTS) for assessing tendency 
towards cold cracking are generally carried out. These tests, however, serve to 
qualitatively assess the weld joint as a whole. For a more objective and scientific 
assessment of the weld joint, the composite comprising three zones that constitute the 
weld joint, viz., the base metal, the HAZ and the weld metal, have to be individually 
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assessed with regard to their tendency to cracking (Ranganath et al 1991). It is now 
widely recognized that any weld joint will invariably contain defects. Defects could 
arise due to improper welding e.g., voids, slag entrapments, or under-cutting. An 
incipiently embrittled HAZ microstructure may also be considered as a defect (Barsom 
& Rolfe 1987a). During service, all the defects in a weld composite are candidate 
sites for cracks to initiate. Once initiated, the cracks grow through a component, 
following a path of least resistance, leading to failure. A recent tool in analysing 
and characterizing the behaviour of cracks is fracture mechanics. The theoretical and 
functional basis of fracture mechanics has been sufficiently well developed in order for 
it to be applied to the prediction and prevention of service failures in engineering 
components. The present paper explores the use of fracture mechanics in failure 
analysis, life estimation and integrity assessments, with particular reference to 
weldments. 


2. Origin of fracture mechanics 

Many engineering components fail much below their design stress level, even though 
they are designed and inspected as per the relevant codes. This is because cracks or 
crack-like defects are always present in the material/component and failure is a result of 
such cracks extending to their critical length. The incipient flaws, which act as cracks or 
crack initiators, severely affect the local stress field and the severity of the stress near 
and around these flaws could be significantly higher than the design stress (Kanninen 
& Popelar 1985). Usually, general design practices do not take such stress intensities 
into account while calculating the design stresses. Fracture mechanics originates from 
this lacunae and provides the basis by which the effects of intensified stresses on the 
integrity of components can be accounted for. 

2.1 Modes of failure in a cracked body 

The modes of failures are schematically shown in figure 1. The three modes are 
represented as I for opening mode (tensile stress in y direction), II for edge sliding mode 
(shear stress in z direction) and III for screw-sliding mode (shear stress in .x direction). In 
metals, most of the crack-related failures occur in mode I. Hence, mode I fracture is 
widely discussed (Knott 1973). 




Mode 1 - Opening Mode 11 - Sliding 




Mode III-Tearing 


Figure 1. Various modes of 
loading. 
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2.2 Stress intensity factor 

The stress intensity factor (SIF) is a term coined to characterize the stress magnitude around 
the crack tip, in a cracked body, and is denoted as K. Although in-service components 
encounter a mixture of modes I, II and III stresses, the contribution of stresses of modes 
II and III to the fracture process is negligible. Due to this reason, the SIF is indicated as 
K„ implying opening type of loading (mode I, tensile) on the crack faces. When the 
stress intensity approaches a critical value beyond which the stressed component fails 
in a catastrophic manner, the applied stress intensity, K,, is termed as K lc . This K lc is 
called as critical stress intensity factor. Analogous to yield stress, er y , K lc is a material 
property. Customarily, it is known as plane-strain fracture toughness and indicates the 
material’s resistance to crack propagation. Mathematically, the SIF is represented as 
(Broek 1986): 

K t = aa os f(g), (1) 

where a is the applied stress, a is the crack depth and / (g) is a factor depending on the 
geometry of the stressed component. 

As can be seen from (1), K t is a function of load, geometry of the component and size 
of the crack. A number of relationships between the SIF and various body configur¬ 
ations, for varying crack sizes, orientations, shapes and loading conditions are avail¬ 
able from hand books (Tada et al 1973; Rooke & Cartwright 1976). 

2.3 Determination of SIF 

Stress intensity factors can be calculated by using numerical methods such as weight 
function, boundary collocation and finite element methods (Kanninen & Popelar 
1985). Whereas in these techniques most of the conditions are idealized, the experimen¬ 
tal techniques narrow the gap between a service component stress field history and the 
laboratory calibration. 

A variety of test specimen geometries can be used to determine the fracture toughness, 
K lc , of metallic materials as described in ASTM (1992a). Two of the most popular specimen 
geometries are the compact tension and the three-point bend specimen. Figure 2 shows 
a schematic of these specimens. 

2.3a Test procedure : In order to produce a sharp crack, so that the situation in 
a service component may be simulated, the test specimen is first subjected to fatigue 
pre-cracking under predominantly linear elastic loading conditions. The crack length, 
a, thus generated should be about half of the specimen width, W. The specimen is then 
loaded monotonically till it fractures completely. An autographic load vs. displacement 
plot obtained during loading is then analyzed for K Ic . A typical plot of such a test is 
shown in figure 3. 

The load point identified as P q in the plot is considered as critical load at which crack 
extension starts. From P q onwards, up to the maximum load, the crack extends in 
a stable manner, beyond which it becomes unstable. A tentative fracture toughness K q 
is calculated as: 

K, = [P/B(W-a)]a 0i F(a,W). (2) 
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Figure 2. Popular fracture 
mechanics test specimen geo¬ 
metries. (a) Three point bend 
specimen; (b) compact tension 
specimen. 


If the specimen thickness, B, and the crack length, a, are greater than 2‘5{KJa y ) 2 , the K q 
is validated as K lc which has units of MPam°‘ 5 (MNm~ 1 - 5 ). K lc is a material property 
and is independent of size and geometry of the specimen. 

Having known K lc of a material, it can be readily applied to evaluate the criticality of 
a crack detected in a service component during routine inspection by employing the 
equation: 

a cr = tKJ(<rF(a/W))y, (3) 

where a cr is the critical crack length, K lc is the plane-strain fracture toughness, a is the 
net section stress and F{a/W) is the specimen geometry factor, based on crack length to 
width ratio. Depending on the difference between the detected crack length and the 
calculated critical crack length, as per (3), decisions can be taken on immediate 
replacement or time interval for next inspection. 



Figure 3. Schematic load dis¬ 
placement curve and the termi¬ 
nology related to a K, c test plot. 
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With the advent of high toughness materials, K lc was found to be inadequate to 
represent fracture toughness because linear elastic conditions are not preserved at the 
tip of cracks so that a crack in these materials may not start growing even at the 
maximum load. To characterize such high toughness materials, the concepts of 
J-integral and crack opening displacements have been introduced (Knott 1973). 

3. Other fracture toughness parameters 

3.1 J-Integral 

A simplest definition of J-integral can be stated as a measure of energy absorbed by 
a cracked body. The J value, in a laboratory specimen, is calculated by measuring the 
area under the load-displacement curve, the test procedure being similar to that of 
a K lc test. The equation used is of the form: 

J^AamW-a)), (4) 

where A is the area under the load-displacement curve and a is a factor depending on 
the specimen geometry and is obtainable from handbooks. Like K Ic , J, c indicates the 
critical fracture toughness value at which a fresh crack surface starts forming in a body 
and is a material property, having units of kJm -2 . The appropriate standard for J Ic 
testing is ASTM (1992b). 

3.2 Crack tip opening displacement 

For a cracked body under stress, the crack tip opening displacement (CTOD, 5) is the 
extent of opening at the crack tip. On the application of stress to the component, the 
crack tip undergoes elastic opening, blunting and then the crack extends. The amount 
of opening at the start of crack propagation is known as critical crack tip opening 
displacement, <5 Ic . Experimentally, the CTOD is calculated from the opening measured 
at the crack mouth in a loaded specimen (BSI 1979; ASTM 1992c). The mouth opening 
displacement (MOD) is measured by a clip-on-gauge. The equation for calculating the 
CTOD is given in figure 4, which is a schematic of three-point bend specimen under load. 



Figure 4. Three point bend 
specimen under load. 
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Figure 5. A schematic crack 
growth resistance curve. 


There exist a number of inter-relations between K, J and <5, knowing any one, the 
other parameters can be calculated. 

<5, c = (J lc )/(2ma y ) = {K? c )/{mEtj y ). (5) 

In the above equation, m is a constraint factor and depends on the strain hardening 
behaviour of the material. 

The CTOD concept has gained popularity, particularly for assessment of weldments, 
because it is believed that the local crack tip environment is better represented by 
CTOD and in the heterogeneity associated with a weldment, the local environment 
may vary considerably. 

3.3 Crack growth resistance curves 

The parameter J is very useful for developing crack growth resistance curves, generally 
known as R-curves. A schematic of the R-curve is presented in figure 5. 

These curves represent the complete characterization, i.e., initiation and propaga¬ 
tion, of the flaw. The slope of the stable tearing regime is termed as tearing modulus, 7}, 
having units of M Pa, and indicates the material’s resistance to crack propagation. 
Once these curves are obtained from laboratory tests for a given material, J values 
could be computed for an existing crack in a component under service, and the 
behaviour of the crack, as to whether the flaw is stable or approaching instability can be 
predicted from the R- curve (Turner 1983). 

4. Guidelines for fracture assessment 

Several leading organizations have developed guidelines for assessment of defects in 
components, based on one of the fracture parameters discussed above. Some of these 
are: An engineering approach for elastic-plastic fracture analysis (Kumar et al 1984) by 
Electric Power Research Institute (EPRI) and Guidance on methods for assessing the 
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acceptability of flaws in fusion welded structures (Garwood 1985) by Central Electricity 
Generation Board (CEGB). The latter has been the most popular, with particular 
reference to weldments, and the British Standards Institution has published a docu¬ 
ment, PD 6493 (BSI 1991), to streamline procedures for fracture assessment. Salient 
features of the document are discussed here. 

The PD 6493 guidelines are provided for three levels of assessment and are based on 
CTOD design curves. The levels are in the increasing complexity of the treatment. 
Level 1 is an initial screening level and provides a simplified method of checking 
whether particular flaws present a fracture problem, or whether the flaws can be 
considered safe without having to use the more complex assessment procedures. 
Level 2 is full treatment and is based on the use of an assessment diagram which 
combines consideration of fracture and local plastic collapse. It gives the user an 
opportunity to assess the safety margins required. The level 3 approach will permit 
greater accuracy but involves greater complexity and more material data. It is 
particularly appropriate where failure occurs by ductile tearing (high work hardening). 

4.1 Information required for assessment 

Before the treatment of three levels of assessment, the essential data required for this 
purpose can be listed as: 

(a) position and orientation of flaw (NDE); 

(b) structural and weld geometry, welding procedure (fabrication); 

(c) stresses and temperatures including transient (all loadings including those 
arising during transport, erection and testing, local loads, misalignments, struc¬ 
tural discontinuities, membrane stress, bending stress, thermal and residual 
stress, peak stress); 

(d) material properties (yield and tensile strength. Young’s modulus); 

(e) fatigue data (S-N curves, Paris constants); 

(f) fracture toughness (K ]c , CTOD, impact toughness); 

(g) high temperature behaviour of material (creep rupture, creep fatigue, creep crack 
propagation data); 

(h) Corrosion data (bulk corrosion, stress corrosion cracking i.e., K lscc ). 

Once the information as above is obtained (from laboratory tests, stress analysis, 
assumptions based on experience, service conditions and NDE), the assessment is 
carried out, at different levels, as discussed below. 

4.2 Level 1, preliminary assessment 

4.2a Estimation of the acceptability of a known flaw: Applied CTOD is calculated 
as: 

= {Kf/a y )E = ( erf Ka)/(a y E), (6) 

where a is the flaw length and ct, is the applied maximum tensile stress. 

The flaw is acceptable if <5°' 5 (ratio of applied CTOD, <5,, to that of c) lc of the material) 
is less than 0-707 and S r , plastic collapse ratio, (ratio of net section-stress, a n> to that of 
flow stress, a y, of the material) is less than 0-8. Since this is a level 1 approach, no 
additional safety factor is required. 
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4.2b Estimation of acceptable flaw size: The equivalent tolerable flaw parameter a m 
is given as: 

a m = {S [c E)/l2n(a i /a y ) 2 o y ]. (7) 

The relationship already includes a variable safety factor, averaging about 2. The flaw 
parameter, a m , may represent a variety of flaw shapes and dimensions of equivalent 
severity. Having obtained the flaw parameter, equivalent part-thickness embedded 
flaw and surface flaw dimensions can be derived from the figure 6. 




a 

2(p + a) 


Figure 6. Relationship between actual flaw dimensions and the parameter a for (a) 
surface and (b) embedded flaws. 
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4.3 Level 2, normal assessment 

4.3a Estimation of the acceptability of a known flaw: In the level 1 assessment only 
primary stresses are considered. At level 2, secondary stress is also taken into account 
while calculating <5, as per (6). Further, the parameter 5 r is calculated as: 

8? 5 = ( W °' 5 + P ( 8 ) 

where p is a correction factor for plasticity interactions of the primary and secondary 
stresses. 

The plastic collapse ratio, S r , is calculated as previously. Using the level 2 assessment 
diagram shown in figure 7, the point represented by (<5*, S r ) is marked on the plot. If the 
marked point lies within the assessment curve, then the flaw is acceptable. If the point 
lies on or outside the line, the flaw is not acceptable. 

4.3b Estimation of limiting flaw size: The limiting flaw size a m is defined by 
a computed point which lies on the failure assessment line given in figure 7. It may be 
obtained by plotting a series of calculated points for various sizes, and noting the flaw 
size at which the appraisal point falls on the assessment line. 

4.4 Level 3, advanced assessment 

The level 3 method makes use of assessment diagram derived from the stress/strain 
curve of the material. This allows advantage to be taken of any high work hardening 
characteristics of the material which may prevent failure. A partial safety factor may be 
applied depending on the consequence of failure. For example, the safety factor may be 
taken to be moderate if local failure does not cause complete structural collapse or it 
may be taken to be severe if there is risk of complete structural collapse resulting in 
severe hazard. 



Figure 7. Estimation of acceptability of a known flaw (level 2). 
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4.4a Assessment diagram: The assessment diagram is constructed by plotting values 
of 5 r calculated from: 


<5 


0'5 _ 


£ln(l+£) cr 3 (l+fi) 3 
<r(l+e) + 2a*E\n(l + e) 


0-5 


(9) 


against L r defined as the ratio a( 1 4- £)/a y , where a, e are stress and strain, respectively, 
of the structure or the component, and a y is the yield stress. The assessment diagram for 
level 3 is shown in figure 8. This failure analysis diagram can be adopted where the 
stress/strain curve is uncertain (e.g., HAZ regions) and for all materials which do not 
exhibit a yield discontinuity in the stress/strain curve. 


4.4b Estimation of acceptability of a known flaw: When <5 lc , a single value of 
toughness of the material, is known, the parameter S r is calculated as per (8) as 
a component of the assessment point. Equation (9) is used for calculating £°' 5 while 
constructing the failure assessment diagram. The value of L, is the ratio of net section 
stress to that of yield stress. Note that the denominator is the yield stress, as compared 
to the flow stress while calculating S r in level 1 & 2 approaches. The resulting point (<5 r , 
L r ) is plotted on the failure assessment diagram. If the point lies within the assessment 
curve, then the flaw is acceptable, if the point lies on or outside the line, then the flaw is 
not acceptable. 


4.4c Estimation of limiting initial flaw size: The limiting flaw size, a cr , is that flaw size 
which touches the failure assessment line tangentially when a locus of all possible 
assessment points is drawn. All the other points on the locus lie outside the area 
bounded by the axes and the assessment line. This corresponds to the limiting load 



Figure 8. Level 3 failure assessment diagram when complete stress/strain data are 
available. 
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factor, T, (ratio of limit load to that of applied load) equal to unity, and is most easily 
obtained from a plot of F as a function of initial flaw size. 


5. Assessment for fatigue cracks 

In components subjected to fatigue loading, flaws can originate from any of the existing 
defects e.g., stress concentrators, inclusions, misalignments, machining/welding defects 
etc. Most of the life of such components is spent at the crack initiation stage itself. Once 
the crack has initiated, it grows steadily and the crack growth in this stage can be 
represented by the Paris Law, which can be mathematically stated as: 

da/dN = A(AK) n (10) 

where, da/dN is the crack growth per cycle, AK is the operating stress intensity range 
(difference between maximum and minimum operating stress intensity) and A and n are 
material constants obtained from laboratory tests. Schematically, the Paris Law is 
shown in figure 9. 

In a laboratory test (ASTM 1992d), standard specimens from the material of interest 
are loaded in fatigue. During the test, the crack increment is monitored along with the 
number of cycles. AK is calculated from the specimen size, load levels and the crack 
length at a given instant. The Paris constants, A and n are calculated respectively, from 



Figure 9. Schematic represen¬ 
tation of fatigue crack growth 
rate data showing Paris regime. 
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the intercept and slope of the linear portion of the da/dN vs A K plot. From the plot in 
figure 9, it is seen that the fatigue crack growth rate curve has three distinct regimes. If 
the operating stress levels are below A K th , the threshold stress intensity factor range, 
the component has infinite life. If the operating stresses are approaching K c , as in 
regime III, the component has negligible life and needs immediate replacement. In 
regime II, which is also known as Paris regime, the crack grows in a steady state manner 
and the remaining life of the component can be predicted, by separating the variables 
and integrating for N 







cr 


da/{A{AK) n ). 

Hi 


( 11 ) 


In the above equation, N f gives the total life (in terms of cycles), a t is obtained from non¬ 
destructive examination of the component, the critical crack length, a cr is obtained from (3). 

Once the flaw dimensions and orientations are known using standard non-destruc¬ 
tive examination methods, the flaws are resolved as per the direction of the acting 
stresses. Various guidelines on flaw resolutions and assumptions are described in PD 
6493 (BSI 1991) and some stress analysis handbooks. The flaw, thus identified, the 
operating stress cycles and the limiting flaw dimension are the data required for fatigue 
assessment. Fatigue life is then calculated as per (11). Where the constants A and n are 
not available for a particular material, they may be taken asT = 3xlO~ 13 and n — 3 
(for da/dN units of mm/cycle; A K units of Nmm - 3/2 ) for all ferritic steels operating in 
air or other non-aggressive environments at temperatures up to 100°C. For structural 
ferritic steels operating in a marine environment, in the absence of specific corrosion- 
fatigue data, the crack growth constants may be taken as: A = 2.3 x 10 * 12 and n = 3. 
These values correspond to the upper bound values of all the published data. For 
carbon and carbon manganese steels, analysis of published data shows that A K th (stress 
range below which crack does not grow) is about 63 Nmm " 3/2 . In welded joints, if the 
residual stress (a R ) is not available, it is assumed to be of yield strength magnitude. In 
the absence of data, on a particular material, A K lh of 63 Nmm ~ 3/2 may be assumed for 
weldments. Where the materials are non-ferrous and the specific data are not available, 
as a first approximation, the constants can be calculated as: 


.4 = 3 x 1(T 13 


F 

•^steel 


Similarly the threshold stress intensity factor can be obtained as: 




steel 

F 


( 12 ) 


(13) 


6 . Special requirements for assessing weldments 

Fabrication by welding induces complex three-dimensional residual stresses that are 
caused by the heating and cooling effects on the parent metal adjacent to the weld pool. 
Proper fit-up of adjacent pieces, sequencing and positioning of the weld and preheating 
or postheating (stress relieving) of the weld assembly would minimize the residual 
stress. In weldments, the potential crack starters are internal imperfections, such as 
porosity, lack of fusion, trapped slag, weld toes, weld terminators (Signes et al 1967; 
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Fisher 1970; Watkinson et al 1970). However, the existence of tolerable imperfections 
cannot be avoided and in some cases, attempts to remove these imperfections may 
result in a condition worse than the original condition (Barsom & Rolfe 1987b). 

When conducting fracture toughness tests on heat affected zone regions, consider¬ 
able variability can occur due to location of the crack tip in different microstructural 
regions. It is necessary to carry out sectioning of specimens for tests on HAZ regions 
after testing to determine the precise location of the crack tip. In cases where the HAZ 
toughness cannot be assessed, it can be assumed to be the lower of the toughness value 
of the adjacent weld metal or parent metal. 

For assessing as-welded assemblies, the tensile residual stresses may be assumed to 
be equal to yield-strength of the material, for flaws lying in a plane transverse to the 
welding direction. For structures subjected to postweld heat treatment, the level of 
residual stress may be assumed to be 30% of the yield-strength. The desirability of 
having appropriate accurate fracture toughness data cannot be emphasised too 
strongly, and it is vital that toughness data be obtained from weld procedure samples at 
, the time of construction and recorded if there is any likelihood of flaw assessment being 
required at any later stage. 


7. Concluding remarks 

The stress situation in a flawed component under stress is quite complex as compared 
to a homogeneous component under identical stress. Lack of knowledge on behaviour 
of cracks in weldments has led to severe catastrophic failures in the past. Optimum 
decisions on retiring a component with a crack, or to continue in operation, could be 
based on fracture mechanics concepts. These concepts are particularly effective in 
dealing with integrity assessment of weldments wherein defects are most common. 
Fracture toughness evaluation has now been standardized by ASTM and BSI. The 
developments in this area are now at a stage where they are being used routinely in 
space and nuclear engineering components. Guidelines and a few specific software 
codes have also been evolved for fracture assessment by shop-floor inspection engin¬ 
eers with minimum exposure to principles of fracture mechanics. 
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Studies on fatigue crack growth for airframe structural 
integrity applications 
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Abstract A review is made of efforts at the National Aerospace Labora¬ 
tories in the development of fatigue crack growth prediction technology for 
airframe applications. The research was focused on extension of rainflow 
techniques for crack growth analysis and development of accelerated crack 
growth calculation methods for spectrum loading. Fatigue crack closure 
forms a crucial element of modelling and fractographic techniques were 
developed for its study. These, combined with binary coded event registra¬ 
tion enabled crack growth and closure mapping for part-through cracks in 
metallic materials. Experimental research on short cracks at notches led to 
discovery of the hysteretic nature of crack closure, which explains well- 
known history-sensitive local mean stress effects in notch root fatigue. 
Optical fractography of failures obtained under simulated service condi¬ 
tions revealed that short cracks do not exhibit any more scatter than long 
cracks at comparable growth rates. The nature of multi-site crack initiation 
and growth of small cracks at notches was investigated and the effort 
extended to lug joints that are widely used in airframe applications. Results 
from this work suggest the possibility of modelling crack growth from a size 
smaller than 50 microns through to failure, thereby accounting for a major 
fraction of total life. 

Keywords. Fatigue crack growth; rainflow technique; airframe structural 
integrity; fractographic techniques. 


1. Introduction 

For many years S-N (stress-life) curves served as the major input in design to meet 
endurance requirements. The curves were established from experimental data obtained 
predominantly on rotating cantilever machines. These work-horses of early experimen¬ 
tal research on metal fatigue were essentially restricted in application to constant 
amplitude high cycle fatigue. While the application value of such data is considerable in 
the case of some engineering components including axles and weldments in reversed 
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bending, the appearance of high performance structures such as airframes and automo- 
bilies imposed new requirements. The service load environment of many fatigue critical 
structures is often characterized by a mix of an extremely large number of small load 
cycles, which can cause the bulk of damage and a few overloads that can induce 
inelastic response and associated stress-strain hysteresis and residual stresses. The 
combination of Miner’s rule (Miner 1945) and S-N curves can fail to account for the 
complex nature of fatigue damage under such conditions. 

The advent of servohydraulic test technology and computer control introduced the 
capability to test materials and components under conditions that closely relate to 
actual service environment. The concept of servo feedback control made it possible to 
use displacement transducers for testing under axial strain control-an essential 
requirement to determine the relationship between strain amplitude and fatigue life in 
Low-Cycle Fatigue (LCF). Computer control made incremental strain testing easy - 
making it possible to determine the cyclic stress-strain curve from a single test. This 
curve is essential if one is to calculate fatigue life for a notched component under service 
loading. Finally, computer control also made it possible to validate life predictions 
through testing under simulated service load sequences. This was the framework for 
JoDean Morrow’s pioneering work (Morrow 1968) on the notch root mean stress effect 
in LCF and the subsequent development of the Manson-Coffin-Smith-Watson- 
Topper relationships for fatigue life (Manson et al 1966; Topper et al 1970). Wetzel’s 
modelling of the Masing relationship (Wetzel 1971), combined with the Neuber 
equation (Neuber 1961) relating applied elastic stress excursions to notch root inelastic 
stress-strain response provided the analytical foundation for extension of laboratory 
data obtained on smooth coupons in LCF to notched components under service 
loading (Dowling et al 1977). Today’s industry standard procedures for fatigue analysis 
are based on these inputs: general engineering design for endurance including automotive 
design applications rely on fatigue analysis using the local stress-strain approach (LSS). 


2. Why is fatigue crack propagation significant in fatigue analysis? 

It has been long recognized that fatigue crack growth is the final stage leading to failure 
of the material through gradual and irreversible reduction in residual strength. The 
fraction of total fatigue life exhausted on crack propagation (FCP life) is known to 
depend on a number of variables (Schijve 1967). The assumed initial crack length plays 
an obvious role - if crack growth is tracked from a smaller crack length, one would 
account for a larger FCP life interval. High strength materials as a rule show a smaller 
fraction of FCP life as opposed to high toughness materia ls - while the former may be 
highly resistant to crack initiation, once a crack is formed, failure follows early. Briefly, 
this dichotomy is related to ductility: a given alloy system will exhibit reduction in 
ductility with increase in yield and ultimate strength; crack initiation life reduces with 
increased cyclic inelastic strain (irreversible slip-band formation is accelerated), while 
conversely, crack growth is restrained by the formation of a large plastic zone (whose 
size increases with reduced yield stress). This enhances resistance to fracture in two 
ways: energy dissipation occurs over a larger volume of material (Raju 1980) and crack 
tip blunting associated with local yield reduces local stress concentration (Christensen 
1959). With increase in stress level, a larger fraction of total life is spent on FCP (when 
viewed from a given initial crack length to failure). In fact, recent studies on short crack 
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growth at notches revealed that crack growth at high stress level can commence 
without any incubation period (Newman et al 1992; Prakash et al 1994). 

As finite geometry affects critical crack size, a large structural component will permit 
a larger crack than a smaller component, particularly in a thin section promoting 
plane-stress conditions with a propotional increase in critical crack size. It would 
follow that in many large built-up, thin-walled structures like airframes, a crack may 
initiate early at a stress raiser under conditions of plane strain, then propagate into 
a through crack that may grow for a substantial time, particularly if local stiffeners 
inhibit rapid increase of stress intensity with crack length (Poe 1971). One may note 
that airframe design requirements recommend the implementation of multiple-load 
path configuration, which in a deformation controlled situation will always lead to 
stress relaxation in a cracked element. S-N curves obtained on small round coupons are 
unlikely to provide useful inputs for estimating the endurance of such structural 
components. 

Operational environment provides compelling reasons to address endurance from 
a crack growth viewpoint. Current accent in technology development is on the ability 
to monitor the condition of a structure in service. New design standards call for repair, 
overhaul and eventual retirement for cause, rather than on the basis of calendar years 
or hours of service. There is no practical way to track exhaustion of life to crack 
initiation. Indeed, there is no parameter that can be said to represent damage in the 
stage of crack initiation. The one exception to this rule may be the observation that 
residual compressive stresses induced as part of the manufacturing process tends to 
relax prior to crack initiation. Crack size on the other hand serves as a direct indication 
of damage. By defining an inspection interval based on conservative estimates of FCP 
life to failure and after defining initial crack size based on inspection capability, modern 
airframes are operated safely for over thirty calendar years or 60 to 120 thousand hours 
of useful service. 

The origin of design, operation and retirement based on assessment of residual 
strength in the presence of cracks goes back to the sixties. The United States Air Force 
which happens to be the world’s largest operator of aircraft was the first to recognize 
the significance of residual strength and life in the presence of cracks. Plagued by 
a series of catastrophic wing failures on B-47 aircraft in 1958, a programme was 
initiated to formulate new design standards for airframes (Lincoln 1994). Within ten 
years, MilSpec 1651A was evolved as the “USAF Aircraft Structural Integrity Pro¬ 
gram” (ASIP). This standard relies on fracture mechanics criteria for design purposes 
and is widely used in contemporary aircraft design. The success of ASIP has led to the 
evolution of similar standards for aeroengines (ENSIP). 

The past 30 years have seen many advances in understanding the mechanics and 
mechanisms of fatigue crack growth. Over this period, the discipline of Fracture 
Mechanics has developed with analytical and numerical tools that handle cracks of 
virtually any size and shape in both isotropic as well as anisotropic materials including 
composites and ceramics. The appearance of Transmission and Scanning Electron 
Microscopy (TEM and SEM) together with computer-controlled servohydraulic load 
frames contributed greatly to better understanding of mechanisms that control the 
growth of a fatigue crack under both simple as well as complex load sequences. 

This paper is an attempt to review research effort on FCP at National Aerospace 
Laboratories (NAL) over the past 15 years in a few selected areas which appear to be of 
practical significance: 
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• Nature of cumulative fatigue damage in FCP under random loading, representative 
of service conditions, including the contribution of individual cycles to overall 
fatigue damage, 

0 Phenomenon of fatigue crack closure, which effectively renders a fatigue crack closed 
even under tensile loading and is responsible for stress ratio effects and also for lead 
interaction effects, and, 

0 Growth of small, part-through cracks at notches that occupies a sizable portionof 
total fatigue life. 


3. Rainflow cycle counting in cumulative fatigue damange analysis 

Laboratory test data to characterize material resistance to fatigue are obtained 
under constant amplitude loading. Service load environment on the other hand is 
random in nature. Fatigue analysis for an arbitrary load sequence is complicated 
by the absence of complete fatigue cycles that characterized laboratory test conditions 
during data generation, thereby violating an important similarity criterion. Consider 
the sequence of loads shown in figure 1 along with the simulated notch root inelastic 
stress-strain response. We find that the small load excursions 1-3, 5-7 and 10-12 
constitute closed fatigue cycles - as indicated by the closed hysteresis loops in the 
stress-strain plot. Fatigue damage for these cycles can be computed both in terms 
of exhaustion of fatigue life in crack initiation and in terms of crack extension using 
available expressions based on laboratory test data for constant amplitude loading. 
While these cycles are of the same applied magnitude, the stress-strain simulation 
clearly indicates different local mean stress for each of these cycles due to the effect 
of superposed overload and underload. Local stress/strain amplitude however will 
always be identical and load history insensitive, given a stable and symmetric cyclic 
stress strain curve. In the event notch response would have been totally elastic, 
local mean stress due to the three cycles would also be identical and load history 
insensitive. Let us now consider excursions 3-5 and 8-10. While they appear to 
be complete fatigue cycles when viewed as applied load, clearly, the associated 



Figure 1. Arbitrary sequence 
of loads and simulated notch 
root stress-strain 
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Figure 2. Effect of minor stress range and stress ratio on envelop of notch root 
mean stress variation (Sunder et al 1984). 











stress-strain excursions indicate they are not. Obviously, one cannot compute fatigue 
damage for these load excursions. However, should there be one more load excursion 
starting from 13 and meeting or crossing the load level at 4, the hysteresis loop 
spanning the points 9 and 4 would close, permitting damage estimates due to the load 
cycle 4-9-4. 

The association of closed stress-strain hysteresis loops with closed fatigue cycles 
forms the basis for the rainflow cycle counting method invented by Endo (Endo et al 
1967). It provides the framework for extending constant amplitude test data to 
random service load sequences by reducing any given arbitrary load sequence into 
a table of closed fatigue cycles. ASTM Standard E-1049-85 incorporates rainflow as 
a recommended procedure for analysis of arbitrary load-time histories. 

The rainflow method is essentially based on LCF/LSS concepts. It may be noted that 
given stable cyclic stress-strain properties and absence of creep, rainflow counted cycles 
will be notch response insensitive. For this reason, cycle counting performed even on 
a nondimensionalised load sequence will be valid. As shown in figure 1, the notch root 
stress and strain amplitude corresponding to counted applied load cycles can be 
directly computed through Neuber analysis, without recourse to cycle-by-cycle simula¬ 
tion of notch root stress-strain response. Further, Dowling & Khosrovaneh (1989) 
have shown that history-dependent variation in local mean stress-strain will occur 
within a pre-defined envelope that is controlled by the largest load cycle in a given 
sequence. Another study showed, that the extent of this variation for individual cycles 
in the rainflow table will be inversely proportional to their amplitude - small cycles will 
see greater history dependence than large cycles (see figure 2) (Sunder 1992). It follows, 
that given a rainflow table, upper and lower bound fatigue life can be determined 
wihout recourse to cycle-by-cycle simulation of notch root stress-strain. For many 
spectra of practical interest, fatigue life estimates can be accelerated without losing the 
effect of load history. This illustrates the potential of the rainflow method in ‘digesting’ 
complex load histories. 


4. Validity of rainflow for crack growth analysis 

The nature of fatigue damage accumulation by way of fatigue crack extension is not 
analogous to notch root damage accumulation as handled by the LCF/LSS approach. 
Therefore, the concept of rainflow cycle counting cannot be automatically extended to 
fracture mechanics handling of fatigue crack growth. The fatigue crack tip moves as the 
crack extends. It leaves behind a wake that causes crack closure (a detailed consider¬ 
ation of the closure phenomenon is forthcoming), effectively truncating the applied 
load sequence below a certain level. Finally, crack extension is assumed to occur only in 
the rising portion of a load cycle, while notch root reversed slip accumulation occurs 
equally on both the rising as well as falling half of an applied load cycle. 

An experimental study was performed at NAL to examine the validity of rainflow for 
crack growth analysis (Sunder et al 1984). Typical results appear in figure 3. Figure 3a 
shows one of the load sequences used in the study. It consists of a number of small cycles 
embedded within a large cycle on the rising half (segment A) and falling half (segment 
B). When rainflow counted, segments A and B are identical (though when viewed as 
rising excursions only, the two are very different). Further, the rainflow counted cycles 
in the two segments are applied as separate blocks of cycles (segments C and D). As 
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shown by the fractograph in figure 3b, each of these segments leaves behind a discern¬ 
ible striation whose quantitative analysis in figure 3c and 3d clearly demonstrates that: 

• segments A and B result in identical crack extension, and, 

• crack extension from individual counted cycles in a segment adds up to match the 
overall crack extension in the segment 

In contrast, conventional crack growth computation on the premise that the crack 
grows during any rising load interval (rainflow is neglected) would have indicated that: 


Step -25 cycles 


Figure 3. ( Continued) 
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Sequence I 
C I B 


Sequence 2 
A I C I B 


N 


Sequence 3 
C I B 


Figure 3. Typical results from fractographic validation of cycle counting tech¬ 
niques (Sunder et al 1984). (a) One block of load sequence. This sequence was 
repeated to failure, (b) Typical fractograph, showing crack extension in individual 
segments of load sequence. Repetition of segments removes ambiguity in striation 
measurement, (c) Digitized fractograph and (d) Equivalent fractograph for quantifi¬ 
cation of striation spacing which correlate with estimates from rainflow cycle 
counting. 


• crack extension in B will be greater than in A, and, 

• cumulative crack extension from individual Rainflow counted cycles will be greater 

than estimates for A and B based on rising load excursions. 

Thus, fractographic evidence appears to back up the validity of rainflow cycle 
counting for fracture mechanics analysis. A similar conclusion would follow if fatigue 
crack growth is viewed from energy balance considerations (Raju 1980). Figure 4 shows 
a schematic of crack tip response to cyclic loading, with the associated stress-strain 
hysteresis loops. Note that the effect of fatigue crack closure is also shown. It was 
pointed out earlier, that for rainflow on the applied load sequence to be valid for notch 
root inelastic stress-strain, the cyclic stress-strain curve has to be stable. In a similar 
fashion, for rainflow analysis on applied load sequence to be valid for the associated 
stress-intensity sequence, crack extension over the sequence should be negligible when 
compared to crack tip plastic zone size. This condition may not be satisfied at higher 
growth rates, particularly under transport aircraft spectrum loading (Sunder et al 
1984). The restriction is illustrated by figure 5 with a schematic of loads in a full flight 
(ground-air-ground cycle). The overload seen in the middle of the flight will close into 
a full rainflow counted cycle only when combined with the rising load excursion before 
the first small cycle and the falling excursion after the last small cycle. The crack 
extension during the intervening small cycles can be considerable, requiring rainflow to 
be carried out on the stress intensity, rather than load sequence. Such an exercise would 
have to be iterative in nature: to determine stress intensity sequence, crack growth has 
to be tracked, which strictly speaking, requires prior rainflow analysis. 

5. Applications of rainflow to crack growth analysis and testing 

As shown in the preceding sections, fatigue analysis can be performed exclusively on the 
rainflow cycle count table, without recourse to cycle-by-cycle estimates. The advan¬ 
tages of this option are obvious from the viewpoint of computation speed. Even more 
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Figure 5. (a) Load excursions 
in a typical flight, (b) Major 
cycle. 


cycle count from the following adaptation of the Elber equation for crack growth rate 
(Elber 1971): 

(da/dN)„ = (C/n) £ (1) 

1=1 

where, AX eff; is effective stress intensity range computed for the ith rainflow counted 
cycle. In the summation, crack length is assumed to be constant. 

6. Cumulative damage in fatigue crack growth 

Equation (1) was used as the basis for an accelerated'fatigue crack growth prediction 
method, that is described in (Sunder 1992). By operating on the table of Rainflow 
counted cycles, rather than on the source load sequence, cycle-by-cycle estimates are 
replaced by calculations on the rainflow table, which is considerably shorter. Crack 
growth analysis is thereby highly accelerated. Even so, the quality of predictions was 
found to be comparable or superior to the best available methods as indicated by the 
results in table 1. The predictions listed in the table are for data used in a Round-Robin 
exercise involving a variety of aircraft wing load spectra. The results are for a 6 mm 
thick Al-Cu alloy sheet material. As a rule, spectrum load fatigue crack growth 
predictions are better for larger material thickness. Spectrum load crack growth rate 
estimates are shown in figure 6 along with experimental data obtained at NAL on 
7-5 mm 8090-T8771 Al-Li alloy plate stock and 1-6 mm thick 8090C-T81 sheet 
material. 

It is likely that for thinner sections down to 1 mm, characteristic of airframe skin 
applications, prediction quality would have been worse. This may be attributed to the 
influence of other load interaction mechanisms, notably crack-front incompatibility 
(Schijve 1974; Zuidema & Blaauw 1988), which are known to occur at smaller crack 
lengths in thinner material, thereby affecting growth in its early stages. Nevertheless, the 
sound analytical background of predictions based on rainflow, their compatibility with the 
LSS/LCF approach and the accelerated computations, thanks to the assumption of 
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Figure 6. Experimental crack growth rates under Light Combat Aircraft (LCA) 
spectrum loading (curve) compared with experimental data (points), (a) 7-5 mm 
thick 8090-T8771 Al-Li plate material, (b) l-6mm thick 8090C-T81 sheet material. 


a constant closure stress, renders them suitable for engineering application. The 
discovery of hysteresis in crack closure (whose description isforthcoming) and its 
implementation in the prediction model extended the scope of the analysis to short 
cracks at notches, thereby spanning a much larger fraction of total fatigue life. 

When fatigue crack growth estimates are performed directly on the rainflow cycle 
count, it is possible to study the cumulative nature of fatigue damage, including the 
contribution of individual fatigue cycles to crack growth. Such an exercise was 
performed for a variety of load spectra (Sunder 1992). Figure 7 shows typical results 
from a cumulative damage analysis for crack growth under the FALSTAFF and 
TWIST load spectra. FALSTAFF is a fighter aircraft wing loading standard, 
dominated by manoeuvre loading, while TWIST applies to transport aircraft and is 
characterized by a large number of gust loads. 

The nondimensionalised Range-Damage versus Exceedance (RDE) diagrams in 
figure 7 demonstrate qualitative differences between the two spectra. Each curve is 
identified by an encircled number. Curve 1 is the applied load range exceedance. It 
shows what percentage of the load cycles in the load spectrum exceeds a given applied 
stress range. Note that information on stress ratio of individual load cycles is lost in this 
curve. However, the concerned data are accounted for in damage analysis. Curve 
2 shows exceedance of effective stress range, assuming linear cumulative damage (load 
interaction is ignored). It accounts for crack closure in individual load cycles as 
experienced under constant amplitude conditions. Curve 3 represents exceedance of 
effective stress range, after accounting for load interaction - it is assumed that closure 
in the largest cycle of the rainflow cycle will dominate. Curve 2 is usually above 3, 
indicating that damage under spectrum loading is retarded due to load interaction. 
Note however, that in the case of the TWIST spectrum, there are locations, where 
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Figure 7. Range, damage-exceedance (RDE) diagrams for L73 Al-Cu sheet 
material under TWIST (a) and FALSTAFF (b) load spectra. Shown in figures are 
cumulative damage content of MiniTWIST and ShortFALSTAFF load spectra, 
when compared to full sequences. 


curve 3 goes over curve 2, indicating the reverse. Finally, curve 3 represents damage 
exceedance. 

In the case of the FALSTAFF spectrum, curve 4 shows that just 10% of the (largest) 
load cycles account for about 98% of all the damage. 90% of the (smaller) cycles could 
have been excluded from analysis without any impact on crack growth estimates. This 
explains why the Short-FALSTAFF spectrum widely used in tests gives the same 
results as the full spectrum (Schuetz 1989). The reverse holds true for the TWIST 
spectrum. We find that 10% of the cycles account for about 50% of total damage. While 
the FALSTAFF damage curve asymptotically approaches the 100% mark, the TWIST 
curve approaches peak value in a log-linear fashion. It would follow, that for the 
TWIST spectrum (and many other transport aircraft spectra as well), the large number 
of small cycles contribute significantly to damage. Figure 7a explains why the Mini¬ 
TWIST spectrum will provide retarded crack growth rates as observed in the literature 
(Schuetz 1989). 

The RDE diagram provides a simple format to assess the fatigue damage potential of 
a load spectrum, without cumbersome cycle-by-cycle analysis and shows potential for 
application in endurance/durability studies as well as in testing. 


7. Realtime application of rainflow 

Experience shows that the fatigue processes load-history sensitive. In other words, 
should there be a need to evaluate endurance of a structural component, fatigue testing 
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must be performed under a load sequence closely simulating the service environment. 
Computer controlled servohydraulic test systems permit such testing. The time of such 
testing however increases with test duration and an important question arises as to 
whether the load spectrum can be truncated to omit load cycles that may not anyway 
contribute much by way of fatigue damage. The RDE diagrams shown in figure 
7 provide the analytical framework for omitting cycles from a load spectrum. This was 
the basis for the derivation of the Mini-LCA spectrum from the full LCA spectrum. As 
shown by the RDE diagram and crack growth rates in figure 8, while the Mini-LCA 
spectrum contains only 10% of the load cycles, it causes the same fatigue damage as the 
full LCA spectrum-crack growth rate data from the two fall into a single scatter band. 

The Mini-LCA spectrum is an example of rainflow and RDE diagram based 
spectrum editing prior to a test. If cycle omission criteria can be specified prior to or 
even in the course of a test, an on-line rainflow analysis can be performed by the test 
application program on load sequence segments before their application on the test 
specimen (Sunder 1989). An example of ^‘variable’ criterion is omission stress intensity 
range, which will cause different number of cycles to be omitted depending on crack 
size. Figure 9 shows a schematic of how a source load sequence may be edited. Note 
that the sequence in figure 9e cannot be condensed any further because the rainflow 
algorithm will not see any more closed cycles. Thus, rainflow when implemented on 
a flight-by-flight basis will guarantee that at least the most severe cycle will be applied 
on the test coupon - irrespective of the specified omission criteria. 

The application program ‘pipelines’ test code into the microprocessor-based test 
controller (Sunder 1985; Sunder & Venkatesh 1990). Thus, while the test controller is 




Figure 8. (a) RDE diagram for crack growth in LCA spectrum and L73 Al-Cu 
sheet material. Line shows the basis for derivation of MiniLCA Spectrum, (b) 
Fatigue crack growth rates under full LCA and MiniLCA spectrum loading. 
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Figure 9. Consequences of on-line rainflow-based spectrum editing (Sunder 1989): 
(a) initial load sequence, (b) cycles below crack closure load omitted, and (c, d, e) 
cycles of progressively larger range omitted. Sequence cannot be condensed any 
further. 


busy applying a segment of the load sequence, the host computer performs on-line 
rainflow to prepare the next segment. The speed with which the cycles are omitted is 
much faster than the speed with which the load frame can apply the remaining load 
cycles. To the observer, the test appears to proceed uninterrupted. 

The scheme for on-line spectrum editing has been standard after 1984 in all spectrum 
load tests at NAL. It is used to accelerate pre-cracking (all minor cycles are deleted) and 
for accelerated data generation. It has also been used in iterative tests to determine 
effective crack closure level and threshold stress intensity valid for spectrum loading as 
shown in figure 10 (Sunder 1989). The principle behind closure stress determination is 
that a fatigue crack will not extend due to load cycles applied while it is fully closed. 
Their omission will therefore not retard crack growth. Starting from a high level, by 
selecting progressively decreasing omission levels in successive tests, a particular level 
will be identified associated with no further acceleration in crack growth rate. This is 
the effective closure level for the given load spectrum. Effective threshold stress 
intensity range is determined in a similar fashion. Progressively smaller omission 
ranges are specified for stress intensity. While the range is above a certain threshold, 
crack growth rate will accelerate. Below this point, crack growth rates will not see any 
change. This value is treated as the effective threshold stress intensity for the spectrum. 

As shown in figure 10 (top), the crack closure value determined from iterative testing is 
2-5 g or about 28% of maximum stress, which compares well with estimates using other 
techniques for the same material. On the other hand, it was surprising to find that threshold 
stress intensity range was effectively near zero. This is a significant observation and may 
explain why even very small load cycles are damaging in transport spectra, by their 
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Figure 10. Results from tests 
on aluminium-alloy sheet speci¬ 
mens with on-line spectrum 
editing (Sunder 1987): (top) tests 
with different omission level, 
and ( bottom ) tests with different 
omission A K. All tests used the 
same linearly decreasing 1 -g re¬ 
lated K-function listed in the 
figures. Spectrum clipped at 6g. 


sheer number. It would follow that threshold studies under constant amplitude loading 
may yield unconservative results when used in estimates of spectrum load crack growth. 


8. Fatigue crack closure 

Linear elastic fracture mechanics (LEFM) suggests that the slope of applied load versus 
crack opening displacement (COD) depends only on Young’s Modulus and crack 
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length. This implies a linear load versus COD relationship. In experiments on 2024-T3 
Al-alloy sheet centre cracked specimens, Elber (1971) discovered that the slope in the 
load versus COD relationship remains linear only over a certain upper fraction of load 
level. Below this load level, the slope increases, implying a diminishing crack length. 
This effect was attributed to fatigue crack closure, whereby, the stretched wake formed 
by the advancing crack tip closes even under tensile loading, with the near tip fracture 
surfaces transferring load in bearing. The area of closure increases with unloading, as 
indicated by the increasing slope in the load versus COD curve. A closed crack 
obviously does not see damage due to fatigue cycling. Elber (1971) thereby proposed 
the effective stress range concept, whereby it is assumed that crack growth occurs only 
a due to the portion of the applied load cycle when the crack is open. While Elber’s 
(1971) observations were on closure due to plasticity, more forms of closure were 
reported by subsequent research, such as oxide induced closure, roughness effects, etc. 
(Suresh et al 1981; Ritchie & Suresh 1982; Suresh & Ritchie 1982). It may be fair to say 
that fatigue crack closure has seen more publications than research on any other 
phenomenon in FCP, indicating the attention it has attracted from researchers the 
world over. 

For a given material and thickness, crack closure stress ratio, S cl /S max , increases with 
stress ratio. Elber’s discovery thereby explained for the first time, how mean stress 
affects sub-critical fatigue crack growth rate. Up to this point, mean stress effects were 
correlated by a suitable approximating equation. In principle, it could be argued that 
existing approximations were adequate considering that crack closure stress required 
experimental determination anyway. The same does not hold true however for random 
load conditions, typical of service environment. It was observed that crack opening 
stress, S op , remains more or less contant under random loading (Elber 1976; Schijve 
1979), effectively eliminating the effect of a large component of fatigue loading. S op 
would be controlled by the crack tip deformations due to the larger loads, explaining 
the effect of overloads on FCP under variable amplitude loading. The ability of fatigue 
crack closure to explain load interaction effects and the increasing power of analytical 
tools to model crack closure under variable amplitude loading have made the 
phenomenon a popular choice in life prediction models (de Koning 1981; Newman 
1981; Dougherty et al 1992; Vormwald et al 1992). 

The significance of fatigue crack closure in practical applications makes S op an 
important variable in experimental and analytical studies. Yet, even after about 25 
years of research and hundreds of publications on the subject, the engineering 
community does not have a standard procedure for its experimental determination - 
unlike other mechanical properties such as yield stress, K Ui J lc etc. It may be noted 
that even a universally accepted definition of S cl is lacking. The measured value ofS cl 
using Elber’s COD gauge depends on the location of the gauge with respect to the crack 
tip. More precise laser interferometry based measurements using indents near the crack 
tip have also shown the same effect. Many other techniques have been proposed for 
measuring crack closure. These include ultrasound, DC-potential drop (DCPD), 
acoustic emission etc. Being diagnostic in nature, these techniques cannot avoid the 
subjective element of data interpretation. This introduces inaccuracies and discrepan¬ 
cies in estimates, which are rather unacceptable, considering the power relationship 
between crack growth rate, da/dN and effective range, AK eU . 

Proceeding on the assumption that da/dN is in fact controlled exclusively by A K e[{ , 
it would follow that the crack closure phenomenon must also manifest itself in striation 
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Figure 11. Load sequences used for fractographic assessment of crack closure, (a) 
Constant S min and varying S max (Pelloux et al 1980), (b) constant maximum stress 
and varying minimum stress (Sunder & Dash 1982), (c) striation pattern obtained 
from load sequence in (b). 


patterns on the fracture surface. In many materials, individual load cycles form 
a striation on the fracture surface. The striation spacing indicates crack extension per 
cycle. Consider the simple load sequence in figure 11a in the framework of the Elber 
equation for crack growth rate: 


da 
d N 


= CAK 


m 

eff 


( 2 ) 
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Cycles 1 and 2 have the same S min , but S max 2 exceeds S max A . In the event of absence of 
crack closure, the striation spacing ratio, s r , for the two cycles will be given by: 

"c _ c 

_ u max, 1 ^min,! 

r _ C _ C 

_ max,2 min,2 

In the presence of crack closure (3) is rewritten as: 

o ^max,l S c , 

r "b maX) 2 -s CI 

Given s r from electron fractography, S cl can be determined from (4). This idea formed 
the basis for an experimental study by Pelloux et al (1980). The technique proposed by 
them relies on the validity of the exponent, m. This material constant is determined 
from experimental macro crack growth rate data, which need not coincide with 
striation spacings if other crack-extension mechanisms are involved, along with 
striation formation. In fact, Pelloux et al obtained S cl estimates assuming different 
values for the exponent. 

The shortcoming in the above technique was overcome by Sunder & Dash (1982), 
who proposed a different type of load sequence that is schematically described in 
figure 11 b. A programmed load sequence with constant S max and <S min varying gradually 
from cycle to cycle is used in the test. Assuming S cl is constant over this block of 
cycles, at the cycles with S min less than S c , will leave behind equally spaced striations. 
By simply counting the number of equally spaced striations in a load block, one 
can directly determine S cl (see typical striation pattern in figure 11c). The technique 
is based on the hypothesis that only fatigue crack closure can explain equally spaced 
striations from the type of load sequence shown in figure lib. It is not sensitive 
to the exponent, m, or in fact, even the nature of the growth rate equation. Besides, 
the method offers the possibility of determining -S cl with a known degree of error - 
equal to the difference in S min between successive cycles. On the other hand, it must 
be noted that in actual practice, there is an element of subjectivity in counting off 
equally spaced striations. Another disadvantage with the technique is that it requires 
a specially designed load sequence and cannot be easily adapted to a particular 
sequence of interest to the user. The material must be conducive to striation formation 
and finally, the technique provides best results with TEM, which is a rather cumber¬ 
some exercise. 

The above method for crack closure assessment has been used in a number of studies 
on the crack closure phenomenon. It confirmed for example, that crack closure close to 
the surface of the specimen is greater than in the mid-thickness region, where plane 
strain conditions prevail (see figure 12) (Sunder & Dash 1982). 


m 

( 4 ) 


m 

( 3 ) 


9. Crack closure development in notch root corner cracks 

One of the major advantages of the fractographic technique for closure measurement is 
that estimates are local in nature. This feature was exploited in a study which involved 
application of a ‘closure block’ along with a binary coded load block that ‘punches’ the 
cycle count onto the fracture surface (Anandan & Sunder 1987). The technique of 
binary coding was demonstrated in earlier work, where in alphanumeric information 
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Lood sequence 

0 2 4 6 8 10 12 14 16 18 20 



Figure 12. More equally 
spaced striations are found 
near the surface than at the 
mid-thickness area, indicating 
less closure at mid-thickness 
(Sunder & Dash 1982). 



was fractographically encoded into a striation pattern (see figure 13) (Sunder 1983). 
This opened up the possibility of introducing event markers on the fracture surface, 
which can be identified during subsequent fractography, thus permitting crack front 
and cr^ck closure mapping for part-through cracks. 

A fractographic study on closure development was performed on a keyhole type of 
SENT specimen, with a corner crack emanating out of a 025 mm deep crack initiator 
(Anandan & Sunder 1987). The tests were conducted after application of various levels 
of prior static overload to examine how notch root residual stresses will affect the 
evolution of crack front and associated S cl values. Some results appear in figure 14. 
Note the higher levels of S cl close to the notch surface. S cl varied by almost 50% 
depending on location. The binary coded striation patterns were used to decode the 
cycle count and thereby build up the crack front shape and map the growth of the 
corner crack at the notch root. It was found that residual compressive stresses caused 
by prior static tensile overload reduce the ‘local mean stress’ along the notch surface 
(see figure 15). This retards local crack front progress because of reduced effective stress 
as indicated by increased closure stress levels. 

The Jo Dean Morrow and Smith-Watson-Topper equations for LCF based fatigue 
damage (Morrow 1968; Topper et al 1970) account for residual stress effects by 
introducing a local mean stress correction factor. While this correction is empirical in 
nature, the notch root stress strain response to applied elastic loading is simulated by 
the Local Stress-Strain (LSS) method, that is based on Neuber conversion (Neuber 
1961) and Masing model (Wetzel, 1971). As demonstrated by the data in figure 15, the 
same Neuber con.ersion can be used to determine notch root stress ratio and 
associated S cl values to provide a fracture mechanics based explanation for reduced 
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Direction of crock growth 


Load sapnce and expected strict ion pattern 



x 60 000 Direction of crock qrowth — 


Figure 13. Binary coded load sequence (a) and striation pattern (b) representing 
the character string “SEECO'83” (Sunder 1983). The same technique was used to 
punch the load cycle count onto the fracture surface for event registration purposes. 


fatigue damage in the presence of compressive residual stresses: local stress ratio is 
lower than applied stress ratio, the fatigue crack remains closed over a larger portion of 
the load cycle and crack growth is retarded by the reduced effective stress intensity 
range. These results are for fairly large cracks, at least a fraction of a millimeter in size. 
Subsequent research whose description is forthcoming showed that the same con¬ 
clusion holds true for much smaller cracks that can form at the very commencement of. 
fatigue cycling. 
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Figure 14. Fractographically determined crack opening stress levels for a key hole 
notch specimen without prior overload (a) and after 100% prior tensile overload (b) 
(Anandan & Sunder 1987). 
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Strain 


Figure 15. Tensile overload of different magnitude (E,H,K) leaves behind pro¬ 
gressively greater compressive residual stress that reduce local stress ratio, As 
a consequence, the crack is closed over a larger fraction of the load cycle, leading to 
retarded growth. Crosses are fractographically estimated notch root S op , circles are 
from literature data for given stress ratio (Anandan & Sunder 1987). 


10. Hysteresis in fatigue crack closure 

Consider the arbitrary load sequence and associated notch root stress-strain response 
shown in figure 16. Cycles A, B and cycles C, D are of the same applied magnitude. Yet, 
because of the notch root stress strain hysteresis, larger loads applied between these 
similar cycles induce residual stress that change their local mean stress and strain. As 
a consequence of this load interaction, cycles B and D have a lower mean stress than 
cycles A and C respectively. When interpreted from a fracture mechanics viewpoint, 
crack extension in cycles B, D should be less than in A, C respectively. 

Experiments were conducted to investigate crack closure behaviour under condi¬ 
tions of varying local mean stress (Sunder 1991). Testing was on LI65 Al-Cu alloy 
coupons with a central hole, with natural crack initiation under the test sequence 
shown in figure 17. This sequence contains a number of smaller cycles superposed on 
a major cycle. The simulated notch root stress-strain response to this sequence is also 
shown in the figure. The cycles on the rising and falling half of the major cycle are equal 
in magnitude and applied stress ratio. An attempt was made to determine crack closure 
stress, S cl for each of these cycles. This was done by introducing into each of them, 
a suitable crack closure block that was described earlier, to give the rather complex 
load sequence which appears in a typical fractograph shown in figure 18. These 
experiments revealed that under variable amplitude inelastic notch root stress-strain, 
the crack will not open at a constant applied stress level in individual cycles. This 
phenomenon can be explained on the premise that crack closure stress, S cl is distinct 
from and always greater than crack opening stress, S op . S cl and S op for the major cycle in 



270 


R Sunder 



a sequence will form an interval over which S op i in individual embedded cycles will fall. 
Fatigue crack extension is controlled by the rising segment of the load excursion 
between S op and S max . S c] while being of academic interest under constant amplitude 
loading, acts as an upper bound for S op in smaller embedded cycles. 

Hysteretic crack closure behaviour is schematically explained in figure 19. In the case 
of a long crack under elastic loading, S op and S cl are similar (figure 19a). S op i in 
individual cycles under complex loading will not therefore see substantial variation. In 



Figure 17. Load sequence with many embedded cycles on rising and falling half. 
This sequence was modified suitably for fractographic estimates of S op in the 
embedded cycles (also see figure 18 ). 
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Figure 18. Typical fractograph from notch root area obtained from load sequence shown in the picture. This sequence was built around the 
root sequence shown in figure 17. 
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Figure 19. Schematic of hysteretic crack opening and closure under constant 
amplitude elastic (a) and inelastic (b) loading, and under arbitrary loading (c). 


contrast, consider the case in figure 19b, showing closure response for a notch root 
crack under conditions of local cyclic inelasticity. The onset of closure (S cl ) occurs at 
a stress level much higher than S op . This is due to the stress excursion from S cl down to 
S min , which lowers the stress at which the crack will open. The significance of this 
difference can be understood if we consider the possibility of applied stress not falling 
below S cl , but rather, going up once again towards S max . In this event, the crack would 
open immediately (see figure 19b). Figure 19c shows the interplay between closure and 
opening stess under a complex load sequence, that can be described by a ‘linear 
hysteretic’ model (Sunder 1991). 

The hysteretic nature of S op variation is qualitatively similar (in terms of its influence 
on damage due to crack extension) to that expected from load history effects on notch 
root local mean stress variation (in terms of damage estimates from LCF/LSS 
considerations). This provides a fracture mechanics foundation to explain well-known 
history sensitive local mean stress effects that were so far modelled empirically. Itmay 
also be noted that in the absence of hysteresis in crack closure, one would not be able to 
explain (from a fracture mechanics viewpoint), why notch root fatigue damage is load 
history sensitive. 


11. Fractographic characterization of crack closure hysteresis 

The work described in the preceding sections highlights the qualitative similarity 
between the hysteretic nature of notch root mean stress and notch root crack closure 
stress variation under complex loading. In an effort to develop a procedure for crack 
growth calculations under spectrum loading, a study was conducted to estimate S op 
and S cl values for Al-Cu alloy 2014-T6511 (L168 or HE15AST) under a variety of test 
conditions (Sunder et al 1993a). 

Figure 20 provides a summary of experimental results at R — — 0-3. This s represen¬ 
tative of stress ratio of the major cycle in most aircraft wing lower surface load spectra. 
Clearly, S op and S cl are barely different for long cracks under elastic loading. But as 
crack size diminishes and stress level increases, the difference between the two increaes 
substantially. While S cl varies only marginally, S op appears to fall rapidly below zero 
stress when the notch root sees cyclic inelastic loading. It follows that a fatigue crack 
can in principle be open in compression and implies that an assumption that negative 
stress intensity can be ignored may be unconservative. Indeed, experiments at 
R = — 3 3 confirmed that a notch root crack can be open over a large portion of 
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Figure 20. Schematic sum¬ 
mary of measured closure and 
opening loads versus stress 
range at R = — 0-3 in 6 mm 
thick LI68 extruded bar stock 
(Sunder et al 1993a). 


a compressive load cycle (Sunder et al 1993a). All these experiments used the specially 
designed load sequence and estimates of S op and S cl based on the number of equally 
based striations. 

The S op and S c j values obtained in the above study along with laboratory data on 
crack growth rate in laboratory coupons under constant amplitude loading and the 
proposed hysteretic model for crack closure variation formed the basis for the 
framework for notch root short crack growth calculations (see figure 21). The influence 
of notch root cyclic inelasticity on crack driving force was accounted for by introducing 
a correction factor on stress intensity. In this manner, the cyclic stress-strain response of 
the material forms an essential input for crack growth computations. Calculations were 
made for notch root crack growth under various conditions of the FALSTAFF load 
spectrum, starting from an initial crack size of under 30 microns. These results are 
summarized in figure 22 and appear to support the application of fracture mechanics in 
fatigue crack growth analysis starting from extremely small and barely detectable crack 
sizes through to failure. 

Figure 23 provides a schematic of how load interaction effects can be interpreted in 
the light of experimental observations and modelling described in the previous 
sections. Figure 23a shows a programmed load sequence with periodic overloads and 



Figure 21. Framework of notch root fatigue crack growth rate analysis (S 
et al 1993b) 
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Figure 22. Schematic of jjpad interaction effects in crack initiation, short and long 
crack growth (Sunder, 1991). 


underloads superposed on a baseline sequence of small load cycles. If the notch root 
does not see inelastic strain under these stresses, notch root stress strain will see 
a scaled, but linear response, indicating total absence of load interaction effects. Figure 
23b shows notch root inelastic stress-strain response. The beneficial effect of overload is 
seen by way of reduced mean stress in the following small cycles, while the underloads 
are followed by increased local mean stress. This is the basis for the load history effects 
as simulated by the Manson-Coffin-Topper equations in the LCF/LSS method. 
Figure 23c shows crack closure response for a notch root crack. We find that as 
a consequence of crack closure hysteresis, fatigue damage due to crack growth is 
affected in much the same way as fatigue damage from LCF/LSS considerations in 
figure 23b. Finally, figure 23d shows long crack elastic response, indicating little 
cycle-by-cycle sensitivity to load sequence. This observation may appear to contradict 
observations of retarded crack growth after an overload. It must be noted that the 
present discussion is restricted to cycle-by-cycle effects, while periodic overloads do 
introduce transient phenomena, with crack closure varying as the crack grows into and 
out of the overload plastic zone. Such transients are related to crack increment, rather 
than cycling and require due consideration in the event of large duration load spectra 
and high crack growth rates. 

Considering that short cracks can initiate very early in life, it is indeed possible that 
much of what has been hitherto modelling effectively through LCF/LSS approach, 
may in fact fall within the framework of a fracture mechanics approach. The associated 
‘damage mechanics’ concept was proposed by Newman et al (1992). However, the 
question of hysteresis in crack closure was not addressed in their work. 

The crack closure concepts presented in the present paper and their extension to 
random load conditions through rainflow considerations has recently been confirmed 
using high resolution laser interferometry studies of crack tip displacements under specially 
designed load sequences (Ashbaugh et al 1994). The study included a comprehensive finite 
element analysis of the problem which provided data that closely tie up with previous 
fractographic observations as well as the laser interferometry measurements. These 
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da/dH (Experiment), mm/Hour 

Figure 23. Summary of crack growth rate calculations for long and short cracks. 
Short crack data extend down to about 40 micron crack size (Sunder et al 1993b). 


measurements were performed with reference to microscopic indents about 150 
microns apart and about 50 to 100 microns ahead of the crack tip. Displacement over 
this gauge length versus applied load excursion is shown in figure 24. Note that 
displacement is shown in relative terms as a deviation from the unloading load versus 
COD slope. The load sequence contains five rainflow counted cycles, two each on the 
rising and falling half of the major cycle. Clearly, S op in the embedded cycles is greater 
than ifi the case of the major cycle as would also follow from the hysteretic closure 
model. The closure of the data loops also confirms earlier fractographic observations 
(Sunder et al 1984) on the validity of rainflow for fatigue crack growth. Finally, it 
appears to underline the relevance of energy balance considerations to fatigue damage 
through crack tip stress-strain hysteresis as explained in figure 4 (Raju 1980). 

The preceding sections provide an indication about the value of fractography in 
understanding the nature of fatigue crack growth under complex loading. Described 
below is another application of quantitative fractography. 


12. Notch root short crack growth under spectrum loading 

The evaluation of short crack growth rate predictions under spectrum loading 
(figure 23) posed the requirement of obtaining reliable experimental data (down to 
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Figure 24. Relative crack opening displacement versus load in a complex 
sequence shown at top right. Data from laser interferometry over entire sequence 
shown at left. Small circles indicate Sop in individual counted cycles (Ashbaugh et al 
1994). 


crack size of under 50 microns) that could be used as a reference. As such cracks have 
to initiate naturally (artifically initiated cracks would have an unacceptably large 
initial size), they cannot be tracked by conventional instrumentation, particularly 
when multi-site initiation is possible. Available data from the literature have largely 
been obtained using surface replica technique (Newman & Edwards 1988). In our 
studies, an attempt was made to characterize short crack growth rates through optical 
fractography. 

The basis for fractographic crack growth rate estimation is the correlation between 
crack extension per cycle and striation spacing. Striations in turn can be due to a single 
cycle or due to a group of cycles. Figure 3 showed examples of both. The FALSTAFF 
load spectrum which was used in the study is a randomized load sequence. It was 
adapted for fractography by conversion into a programmed load sequence (Sunder et 
al 1993a). To ensure equivalence of the two, the sequence was first reduced to a rainflow 
cycle count. A total of 18 large cycles of different magnitude were identified in the 
spectrum. The remaining cycles were broken into 18 similar blocks. One each was 
attached to each one of the major cycles. Care was taken to space the two largest cycles 
apart in such a manner that they could be easily identified on the fracture surface. The 
resultant programmed load sequence (shown in figure 25) was checked for equivalence 
with the original FALSTAFF sequence, first analytically, then, after minor adjust¬ 
ments, experimentally, to ensure that the two sequences provided the same growth 
rates on a long crack that could be monitored visually, through instrumentation (from 
unloading compliance) and fractographically (to ensure that fractographic estimates 
under programmed loading conform to visual observations).These results appear in 
figure 26. 

The fractographically determined crack growth rate curve was integrated ‘back¬ 
wards from the failure point’ down to the smallest possible crack size (ranging from 10 
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Figure 25. Programmed FALSTAFF load sequence used in fractographic studies 
on notch root crack growth (Sunder et at 1993a) 


to 50 microns, depending on the clarity of pictures). This gave the crack propagation 
life. Life to formation of the small crack was then estimated as the difference between 
endurance and crack propagation life. As a consequence, the fractographic technique 
provided the opportunity to examine aspects of early crack growth that could not be 
addressed using conventional methods. This procedure was used in experiments on 
a number of notched coupons at different stress levels. Later, the same procedure was 
used to study crack growth in a lug joint, whose description is forthcoming. 
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Figure 26. Long crack growth rates under randomized MiniFALSTAFF and 
programmed FALSTAFF (left), Fractographic and visual observations of crack 
growth rate under programmed FALSTAFF (right) (Sunder et al 1993a). 
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The study provided many significant observations (Prakash et al 1994; Sunder et al 

1994): 

9 Surface replica based measurements provide an exaggerated measure of scatter in 
growth rate. Short crack growth rates do not exhibit more scatter than long cracks at 
comparable growth rates. This is illustrated by the data in figure 27. The scatter seen 
in replica based data is obviously related to shortcomings in the technique and also 
to the possibility that there indeed may be some discontinuity in crack extension as 
registered on the specimen surface. 

• Number of naturally initiating cracks increases with applied stress level. Figure 28 
summarizes this effect from tests on coupons of two thicknesses. These data are for 
the FALSTAFF spectrum. It is possible that the relationship will depend on the load 
spectrum under consideration as crack formation is likely to be accelerated by the 
larger cycles in the spectrum. In case of the TWIST spectrum, the larger cycle 
accounts for less damage as indicated by the RDE diagram in figure 7. It would 



Crack depth c, mm 

Figure 27. Replica and fractography based crack growth rate estimates for 
naturally initiating multiple cracks at two different stress levels (Sunder et al 1994). 
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Modified FALSTAFF Modified TWIST 



No. of Crack Initiators 


Figure 28. Effect of stress level on number of crack initiating sites. Data for two 
different specimen thicknesses (Sunder & Prakash 1994). 


follow that in case of TWIST, fewer initiation sites will be associated with the same 
stress level when compared to FALSTAFF. 

• Artificially initiated fatigue cracks behave differently from naturally initiating ones 
(see figure 29). Experiments on edge-notched coupons showed that naturallyinitiat- 
ing cracks grow faster than artificially initiated ones. This was related to the crack 
initiator behaving as a blunt secondary notch inside the primary notch, when 
compared to a much sharper crack tip. It would follow that to be of practical value, 
short crack experiments must involve natural crack initiation. 

• Multiple natural cracks do not initiate simultaneously. On the other hand, cracks 
that form early will accelerate the formation and growth of new cracks due to stress 
redistribution along the notch surface. This is illustrated by the results in figure 29. 
Thus, the slowest crack on the fracture surface was possibly the dominant one, 
having formed earlier than the others. It was noted that for purpose of life 
calculations, consideration of the dominant crack alone is likely to provide satisfac¬ 
tory estimates. 


13. Fatigue crack growth in lugs under spectrum loading 

Advances in fracture mechanics and in our understanding of fatigue crack growth 
mechanisms have largely been based on laboratory testing of coupons with standard 
stress intensity solutions. Many fatigue failures on airframes occur at joints and 
fasteners, which represent a fairly complex geometry. 

In a recent new application of fractography, lug joints with different degree of 
interference fit were tested to failure under both constant amplitude as well as 
programmed FALSTAFF loading conditions (Sunder & Prakash 1994). The fracture 
surface was then studied to determine growth rates down to crack size below 50 
microns. The measurements were performed with an optical microscope equipped with 
digital imaging technology. The constant amplitude growth rates were then compared 
with data from standard laboratory coupons to derive a stress intensity calibration. 
These data appear in figure 30. They appear to indicate shortcomings in available 
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Figure 29. Reconstructed intermediate crack front mapping from multiple sites 
A-F (a), and (b) crack growth rates for individual cracks. ‘A’ was the dominant 
crack having appeared first, while B-F exhibit accelerated growth rates. Two dis¬ 
continuous lines in (b) show the lower growth rates for artificially initiated cracks 
(Sunder & Prakash 1994). 


analytical solutions for stress intensity in lugs, particularly when small cracks are 
involved (about 150 microns and less). On the other hand, the data also indicated that 
irrespective of the degree and nature of the interference fit, crack growth rate appear to 
fall into a single scatter band above a crack size of about 150 microns. While this 
observation requires verification for other stress levels, one may conclude that given 
a suitable stress intensity calibration, fracture mechanics can handle the durability as 
well as damage tolerance of joints and fasteners. 


14. Concluding remarks 

The research described in this paper was aimed at improving fatigue life prediction 
technology in engineering application. While the materials and load spectra studied are 
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Crack Depth, mm 

Figure 30. Calibrated stress intensity versus crack depth for a part-through crack 
growing out of the lug notch root. Calibration obtained under constant amplitude 
loading (Sunder & Prakash 1994). 


specific to airframe structural application, they may be relevant to many other fatigue 
critical structures and machinery. The complexity of metal fatigue in engineering 
application may be attributed to a number of factors, including: 

e Stress distribution in a builtup structure, particularly at stress raisers and its 
response to dynamic loading conditions, 

o cumulative damage under random load conditions, typical of service environment, 
and, 

« effects of environment, creep and residual stress variation that may change initially 
assumed mechanical properties and response. 

The work described in this paper relates to the second aspect, which centres around 
the extrapolation of test data obtained on laboratory coupons under constant ampli¬ 
tude loading to real components subject to a random load environment: 

• Difference between an arbitrary load sequence and constant amplitude loading from 
the viewpoint of cumulative damage. 

The rainflow cycle counting technique bridges this difference. While it was 
discovered in a low-cycle fatigue framework, its validity for crack growth analysis 
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has been demonstrated through electron fractography of failures obtained under 
specially designed programmed load sequences. In view of its mechanistic basis, the 
rainflow technique could be developed for accelerated fatigue life analysis and 
testing. 

« Sensitivity of both crack initiation as well as crack propagation to load history (load 
interaction effects). Experience shows these can account for more than an order of 
magnitude variation in fatigue life. 

Most life prediction models account for history effects by simulating the effect of 
mean stress. A hypothetical material, whose fatigue response is insensitive to mean 
stress, is also likely to be insensitive to load history effects. Fatigue crack closure has 
been identified as the means through which mean stress affects crack growth. The 
experimental research described in this paper provided a better understanding on 
crack closure distribution across the crack front in through as well as short, 
part-through cracks. 

A significant outcome of recent research has been the discovery of hysteresis in 
fatigue crack closure. This discovery provides the link between fracture mechanics 
and low-cycle fatigue considerations of fatigue damage under complex loading and 
may lead to a unified consideration of the fatigue problem. 

A central feature of the research described in the paper is its interdisciplinary 
approach. All the experiments involved specially designed loading that required 
extensive use of custom-built test automation. Computer controlled testing at NAL 
saw much progress over the past 15 years and the technology is currently being 
commercialized. Most experiments were followed up by electron and optical fractogra¬ 
phy, which in recent years has gained from the appearance of cost effective PC-based 
image enhancement and processing techniques. 

Further work is recommended in the area of natural crack formation under service 
loading. While fracture mechanics appears to be able to handle cracks as small as 20 
microns, high endurance applications (particularly engine components and automo¬ 
bile suspension) require better understanding of how a crack of that size forms - of 
course under a randomized load environment. The problem holds exciting avenues of 
research in the area of fasteners, which appear to often determine the quality of 
structural integrity assurance. Finally, load monitoring technology for built-up struc¬ 
tures combined with NDE and life prediction technology have potential application in 
life extension schemes in view of aging problems and growing cost of new installations. 

15. Scope of analytical effort 

Analytical modeling in solid and fluid mechanics including fracture mechanics pro¬ 
ceeds from first principles and basic material constants. Fatigue life prediction in 
contrast is essentially restricted to extrapolation of laboratory data obtained either 
under constant amplitude or random loading to other sequences, stress levels and crack 
geometries. Laboratory coupon level testing for data generation is therefore required 
for every new mateial and its variation including processing. National Aerospace 
Laboratories have been actively involved in material test data generation for design as 
well as type certification purposes. A few thousand tests have been performed over the 
past six years to determine the LCF and FCP constants for a variety of Al-Cu, Al-Li, 
steel and titanium alloys used in airframe applications. Such testing contributed to type 
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approval of a number of indigenous materials for airframe materials and to the qualifica¬ 
tion of imported candidate materials for use in aircraft development programmes. 

Stringent requirements on air worthiness and structural integrity require laboratory 
level full-scale testing to demonstrate the quality of a new design or modification of 
operating aircraft. All the preceding analyses including residual life predictions are 
intended to ensure a high degree of probability that the design as a total system will 
survive certification testing under conditions that closely simulate real operation over 
the stipulated service life. This concept is now followed by the highly competitive auto 
industry in view of the high costs of retrofits under warranty and liability clauses. In the 
years to come, it is likely that other industries in the country, including energy, chemical 
industries, rail transportation and consumer durable sector will follow the same trend. 
Thus, in the years to come, testing will continue to play a major role in qualifying the 
durability and damage tolerance of materials and structures, while analytical effort will 
be expected to optimize design and make testing more cost effective. 


The work described in this paper enjoyed the strong support of Dr S R Valluri, Prof 
R Narasimha and Dr K N Raju. Financial support for the effort was provided by 
Aeronautical Research & Development Board, Aeronautical Development Agency 
and Department of Science and Technology. The author would like to place on record 
the considerable effort of professional colleagues from many countries, who contrib¬ 
uted by way of peer review and encouragement to the documentation in international 
literature of the reported research. The support over the years from colleagues from the 
Materials Science Division, Structures Division, Engineering Services and Graphic 
Arts at NAL is gratefully acknowledged. Finally, the author is indebted to colleagues in 
the Structural Integrity Division for their active involvement and unstinting support in 
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Mechanisms and modelling of creep in superalloys 
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Abstract. The characteristics of creep deformation of nickel-base superal¬ 
loys are reviewed and the implications for the micromechanisms controll¬ 
ing the behaviour are considered. The development of a model of the creep 
deformation that is compatible with the physical mechanisms is traced, first 
in an isotropic form and later incorporating full crystallographic anisotropy. 
The validity of the model and its ability to be extrapolated to more complex 
loading conditions are evaluated against a wide range of experimental 
measurements. 

Keywords. Crystallographic anisotropy; nickel base superalloy; crystal 
rotation; shape change; low cycle fatigue; isotropic/anisotropic model; 
single crystal turbine blade. 


1. Introduction 

The development of nickel-base superalloys and the evolution of the gas turbine engine 
have been closely related, each having a history that stretches only about fifty years 
(Betteridge & Shaw 1987). Indeed, the gas turbine is still by far the largest engine 
making use of these materials and the continual drive to improve engine efficiency has 
lead to very rapid changes in alloy composition and microstructure. More importantly, 
the technology pull aspect of the development has meant that new variants of 
superalloys have been very rapidly introduced into service. 

A wide range of different alloy compositions and processing technologies have been 
developed for specific requirements in both different parts of gas turbines and for engines 
used for different applications. Thus, for example, in aero engines there is a requirement for 
materials with high creep strength capable of operating at high temperatures for use as 
turbine blades, whereas other materials with high yield and fatigue strengths capable of 
operating at high stresses and low temperatures are required for turbine disks. 
Similarly there are important differences in the requirements for materials for use in 
marine environments, for electricity generation and for civilian and military aircraft. 
However, alloys have been developed that suit all of these applications giving a wide 
range of compositions and processing routes. This has largely been achieved in an 
empirical manner and it is arguable that the development of new nickel-based alloys is 
becoming increasingly difficult because of the maturity of the technology. 
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Concentrating on the blade applications where creep behaviour is of particular 
importance we note that alloy micro structures have been controlled in addition to 
alloy composition in order to achieve optimum properties. The current state of the art 
for the most advanced turbine blade alloys is to produce these materials in single crystal, 
form which leads to a high level of anisotropy of properties (Duhl 1987). Although the 
alloy strengths have probably been optimised to a level which is going to be difficult to 
exceed, it is unlikely that the full potential of these materials is yet being achieved. This 
is because there is as yet an inadequate understanding of the variation in relevant 
mechanical behaviour as a function of operating conditions and crystallographic 
anisotropy. Even if it were, the design procedures currently available cannot yet 
incorporate full time-dependent, anisotropic plasticity. Computer-aided design is 
developing very rapidly as computer power becomes cheaper and more readily 
available. However, these engineering developments can only be effective if there is 
a parallel improvement in the understanding and representation of material behaviour. 
Although the technology of alloy production is mature, engineering aspects of the use 
of single crystal superalloys are at a very early stage of development. 

This paper will review current understanding of the factors influencing the creep 
behaviour of superalloys with particular emphasis on single crystal materials. It will 
describe the evolution of a model of anisotropic creep in single crystal superalloys that 
is expressed as constitutive laws which may be incorporated into finite element 
calculations of component behaviour. The paper will also indicate how the model has 
been validated and point to possible future directions. 


2. Creep behaviour of solid solutions and superalloys 

There is an extensive literature reporting studies of creep behaviour of a wide range of 
materials and of theoretical interpretation and modelling relating to these results 
(Wilshire & Evans 1987,1990,1993). Concepts of the mechanisms of creep deformation 
have largely been derived from studies on simple metals and solid solution alloys, there 
is now a very good understanding of the micromechanisms controlling creep in these 
materials (Langdon 1985). However, there are some very significant differences in the 
creep behaviour of nickel-base superalloys compared to that of simpler metallic 
materials. To highlight a few of these differences: 

• Plastic pre-strain of superalloys accelerates the creep rate whereas it hardens most 
solid solution alloys (Dyson et al 1976). 

• There is very little difference in creep curves of superalloys produced under constant 
load and constant stress conditions whereas these tests give large differences in most 
solid solution alloys (Loveday & Dyson 1990). 

• Nickel-based superalloys exhibit very little, indeed probably no, steady state creep 
but are dominated by a tertiary creep regime of increasing creep rate. Solid solution 
alloys generally have an extensive steady state regime. 

• Tertiary creep in nickel-base superalloys is exhibited in conditions of both tensile and 
compressive loading indicating that it cannot be due to cracking or cavitation. In 
solid solution alloys tertiary creep is generally associated-with cavitation damage 
and is only apparent when there is a significant component of tensile loading (Tilley 
& Harrison 1973). 
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In view of these important differences it is essential to reconsider the mechanisms 
controlling creep deformation in multi-phase systems such as superalloys. The conven¬ 
tional models of recovery-controlled creep leading to a steady state behaviour that 
evolves into a tertiary regime when cavitation develops describes simple metals very 
well. However, they are not appropriate for superalloys, or indeed for many other 
microstructurally compjex alloys. Dyson & McLean (1983), Henderson & McLean 
(1983), Dyson (1988) and Ion et al (1986) have, over a number of years, developed 
a model which attributes the characteristic softening of nickel-based superalloys to 
a progressive increase in the density of mobile dislocations that accumulate with 
increasing plastic strain. This can be thought of as being the result of inhibition of 
extensive glide of dislocations that encounter particles of the gamma prime precipitate. 
Insufficient dislocations are produced to provide a significant level of dislocation 
hardening and most new dislocations created remain mobile. Other factors that can 
contribute to the characteristic tertiary creep, such as loss of internal load bearing area 
due to the development of grain boundary cavities or reduction in load bearing 
cross-section during deformation (Ashby & Dyson 1984), which are common to most 
other materials, are relatively unimportant in single crystal superalloys and will not be 
considered here. 

It has been possible to develop a relatively simple set of equations based on these 
concepts that adequately describe the creep behaviour of isotropic nickel-based alloys. 
More recently Ghosh et al (1990) and Pan et al (1993) have extended the approach to 
account for the material anisotropy in single crystal superalloys. This has taken the 
isotropic model as its starting point but recognises that even at high temperatures 
deformation results from dislocations moving on a few slip systems of well-defined 
crystallinity. Consequently, the model has been reframed in term of shear stresses and 
shear strains on these allowed slip systems and the total anisotropic deformation has 
been computed by summing the shear displacements from all active slip systems. This 
paper will briefly review the model and will compare its predictions with a range of 
validatory experiments. 

3. Isotropic model 

The model has been expressed in the formalism of continuum damage mechanics 
(Kachanov 1958) in which the creep rate e is written as a function of state variables (or 
damage parameters) that represent the current condition of the material, in particular 
of the structural and microstructural features that control the strength of the material. 
If it is possible to derive equations describing the evolution of these state variables 
(S 1 ,S 2 ,S 2 ) then the creep behaviour can be determined by integration of a set of 
coupled differential equations: 

£ = f((T, T,S 1 ,S 2 ,.. •), 
s 1 = ^(u,7;s 1 ,s 2 ,...), 

S 2 = h(a,T t S lt S 2 ,...). (1) 

The problem is to identify the explicit forms of these equations that are compatible with 
the known micromechanisms of deformation so that these can be evaluated quantita¬ 
tively from standard creep data to allow extrapolation to complex loading conditions. 
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Figure 1. Examples of the fit of 
the isotropic model to creep 
curves for three different nickel 
base superalloys. 
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Dyson & McLean (1990) have given a comprehensive review of the isotropic model, so 
only the principal point will be given here. When cavitation and change of external material 
cross section can be neglected, the isotropic creep behaviour can be represented by a set of 
three equations involving two state variable S, a> and four constants e h H, S ss , C. 

e = e,(l — S)(l + at), 

s = m i ( \-(s/s ss )), 

cb = a. ( 2 ) 

S is a dimensionless representation of an internal stress that progressively increases to 
a steady state value S ss leading to primary creep. The increasing density of mobile 
dislocations p is expressed through co — (p — p^/Pi where the subscript i indicates the 
initial value. 

Dyson & McLean (1990) describe a procedure for analysing a creep curve to evaluate 
the constants e £ , H, S ss , C and this has been successfully applied to severalmaterials and 
been shown to be particularly appropriate to nickel-base superalloys. Figure 1 com¬ 
pares several creep curves for nickel-base superalloys with the curves generated using 
equation set (2). This is not a particularly demanding test of the model since other more 
empirical approaches can provide equally accurate fitting of the shapes of individual 
creep curves. However, the differential form of equation set (2) allows the creep strains 
that develop during variations in stress and/or temperature to be evaluated quite easily. 
Figure 2 compares the measured creep strain during cyclic loading with a prediction 
using model constants evaluated from constant stress tests. 

Ghosh & McLean (1992) have extended this model to incorporate a wider range of 
loading conditions. In particular they show that strain-controlled deformation can be 
taken into account by the addition of one further equation into equation set (2). This 
represents the contribution of stress relaxation during variations in applied stress. If we 
represent the total and elastic contributions to strain by e T ,e E respectively, while the 
creep component has no subscript, the additional equation is: 

■<j- Ee e = E{e T — e). (3) 



Figure 2. Comparison of the 
measured creep curves ob¬ 
tained under conditions of 
cyclic creep for the single crys¬ 
tal superalloy SRR99 axially 
stressed along the <001) di¬ 
rection with the model predic¬ 
tion based on a database of 
constant stress creep tests. 
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Figure 3. Predictions of the 
low cycle behaviour of 
IN738LC during constant load 
cycles based on model par¬ 
ameters established from 
a database of constant stress 
creep tests. The two predictions 
are for simulations of (a) irre¬ 
versible growth of damage and 
(b) for damage that grows in 
tensile stresses but heals in com¬ 
pressive stresses. 


Integration of the combination of equation set (2) and (3) subject to appropriate 
boundary conditions allows the simulation of several types of axial deformation: 
constant strain rate, stress relaxation, strain or load controlled low cycle fatigue. Ghosh 
& McLean (1992) considered inter alia the consequences of reversible and irreversible 
damage; i.e. damage that grew under tension but healed under compressive loads, 
compared to damage that continued to grow irrespective of the direction of loading. 
Figure 3 illustrates the different predictions for these two types of damage for 
load-controlled LCF. For irreversible damage the plastic strain range progressively 
increases, while for reversible damage the strain range remains constant but the loops 
are progressively displaced giving a ratcheting of the material. 

4. Anisotropic model 

Extension of the model to account for anisotropic creep of single crystals by Ghosh et al 
(1990), and Pan et al (1993) considers deformation to be restricted to specific slip 
systems and to occur at a rate y that depends on the resolved shear stress t, temperature 
T and equivalent state variables to those introduced in the isotropic model. For the kth 
slip system: 

f = #(l-S)(l+o>) 

S = Hy*(l-(S/S ss )) 
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The total displacement s fj is obtained by summing all N components of shear on the 
allowed slip directions { n 1 n 2 n 3 }(b l b 2 b 3 > 

£ o = E ( 5 ) 

k= 1 

The subscripts 1,7 take values 1,2 or 3 representing the three-cube axes and k identifies 
one of the N slip vectors being considered. Having constructed the strain matrix, 
anisotropic deformation is completely defined. An arbitrary crystal direction [x 1 x 2 x 3 ] 
will transform to a new orientation in the small strain approximation: 


*2 

= 

1 + £ 11 
£ 21 

£ 12 

1 T &22 

£ 13 

£ 23 


*1 " 
*2 

( 6 ) 

*3 


e 31 

£ 32 

1 + £33 _ 


,*3_ 



Strain in that direction is given by the magnitudes of the initial and final vectors, 
e = (X — x)/x. If the constants of the creep equation are known, this allows the 
calculations of: 

• crystal rotations during creep; 

• strain in any direction; 

• changes in material shape during deformation. 

There is a considerable body of evidence in the literature that dislocation activity in 
nickel-base.superalloys is largely restricted to octahedral planes even at high tempera¬ 
ture (Caron et al 1988). There have also been reports of cube slip, in addition to 
elements of octahedral to cube cross-slip. In our analysis we allow the possibility of 
activity on {ill} <110> and {100} <011). We have discounted the possibility of 
a {111} <211) system for the reasons detailed by Ghosh et al (1990). 

For certain orientations which are symmetrically oriented with respect to the active 
slip vectors there is no change in crystal orientation or of specimen shape as a result of 
deformation. In these cases the tensile and shear formulations of the model are totally 
equivalent. Moreover, for axial loading along <001) there is no resolved shear stress on 
the cube planes, so deformation only results from octahedral slip. This allows the 
parameters of equation set (4) relating to {111} <Tl0) deformation to be evaluated from 
a creep database of < 001 ) specimens using simple geometrical transformations of the 
parameters of the tensile formulation. Similarly, the parameters for cube slip can be 
determined by analysis of < 111 ) specimens. 

Figure 4 shows the optimum fit of the model to a creep curve for a <001 > specimen tested 
in tension under constant stress conditions; the tensile and shear formulations give identical 
predictions. The creep curve calculated for <111 > assuming that only octahedral slip is 
operative is also shown; this predicts a significantly longer life than for < 001 ) whereas 
a shorter life is measured as shown. The model also provides a good representation of 
< 111 ) creep curves by establishing the parameters for cube shear creep. 

Having established the model parameters for both octahedral and cube shear creep 
as functions of stress and temperature by analysis of creep databases for <001 > and 
< 111 ) specimens respectively it is straightforward to calculate the deformation that 
results for arbitrary crystal orientations and applied stresses. 

The model can also be combined with a representation of the anisQtropy of elastic 
constants to include aspects of stress relaxation needed to account for arbitrary loading 



294 


M McLean, L M Pan and R N Ghosh 



Figure 4. Comparison of the experimental and calculated creep curves for <001) 
and <111) oriented specimens of SRR99 tested at 450MPa/850°C assuming that 
only {111} <110) glide operates. 


conditions and arbitrary orientations (Pan et al 1993). For axial loading along the 
<h/c/> direction the Young’s modulus can be expressed in terms of the components of 
the compliance tensor S. 

E <hkl> = [S n + {S 44 -2(S u - S l2 )}(h 2 k 2 + k 2 l 2 + l 2 h 2 )Y x . (7) 

Using a time-stepping sequence, the creep strain in the <h/c/> is computed and 
combined with the equation to determine the stress under strain-controlled conditions. 

° = £ <mci>(£t ~ «)• ( 8 ) 

4.1 Validation of anisotropic model 

The model can provide an acceptable representation of a database of uniaxial creep 
results for the symmetrical <001) orientation, as indicated in figure 5, but this is not 
a particularly demanding test since equally good fits can be obtained by empirical 
models. The more demanding tests are to assess the ability of the model to extrapolate 
successfully from creep data for simple orientations to predict: 

• creep behaviour for arbitrary complex orientations being able to account for 
variations in the form and magnitude of the anisotropy of creep as functions of stress 
and temperature; 

• changes in orientation with creep strain; 

• shape distortions of creep test specimens during deformation; 
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Figure 5. Comparison of the measured and calculated times to (a) 0-5%, (b) 5% 
and (c) 25% elongations for SRR99. 


• material response to a range of stress and strain controlled deformation for both 
simple and complex orientations. 

All these aspects have been evaluated experimentally. 

4.1a Anisotropy of creep : Figures 6a-c show the predicted and measured tensile 
creep curves for <001) and <111) orientations of the alloy SRR99 for three different 
stresses at 850°C in which the observed changes in relative creep strength are 
successfully reproduced bearing in mind the intrinsic scatter in creep data (Ghosh et al 
1990). It is also possible to calculate the creep curves for arbitrary orientations but this 
is not a particularly demanding test. 

4.1b Orientation changes: Uniaxially stressed specimens of simple symmetric orien¬ 
tations are not predicted to exhibit any change in orientation during creep deforma¬ 
tion, and none are observed. However, in the case of complex orientations and/or 
multiaxial stresses where some slip systems are disproportionately activated there can 
be an expectation of quite large lattice rotations. Unfortunately, the increase in 
dislocation density with accumulating creep strain makes the accurate determination 
of crystal orientation by conventional X-ray techniques extremely difficult. Electron 
back-scatter patterns produced in a scanning electron microscope gives precise orien¬ 
tations with high spatial resolution (Dingley 1984). A series of fractured creep speci¬ 
mens, all of which exhibited a considerable degree of necking, have been examined 
using this technique to relate the change in orientation along the specimen length to the 
local strain as measured by the reduction in area. Figure 7 compares the measured 
changes in the axial orientation with those calculated by the model using parameters 
established from the creep database for <001) and <111) orientations (McLean et al 1992). 
There is clearly a high level of agreement between prediction and measurement of rotations 
that do not correlate with those predicted by the operation of a single slip system. 



strain _ 
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Figure 6. Measured and cal¬ 
culated creep curves for SRR99 
with <001) and <111) orienta¬ 
tions tested in tension at various 
temperatures (850, 900 and 
950°C) and a stress of 450 MPa 
showing the variation in rela¬ 
tive creep performance. 
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measured calculated 


Figure 7. Crystal rotations during creep deformation of SRR99 of different 
complex orientations tested at 850°C/450MPa: (a) experimental measurements 
using electron back-scatter patterns and (b) model predictions. 


4.1c Shape changes: The diameters of a series of creep specimens were accurately 
determined as functions of specimen rotation and reduction in area along the length 
of the necked specimens, carefully correlating the measurements with the crystal¬ 
lographic orientation of the specimens (Pan et al 1994, 1995). The measured and 
calculated cross-sections are shown in figure 8 as opposite halves of the same figure 
since planar symmetry is expected. Clearly, for the example shown, there is close 
agreement between measurements and model prediction for the orientation shown. In 
this case the model predicts a predominance of cube slip. Indeed these measurements 
combined with those of the orientation changes provide incontrovertible evidence of 
shear on cube planes on a macroscopic scale. This does not necessarily imply that there 
is extensive dislocation activity on cube planes at a microscopic level. 


(a) North<13 91-40> 




(-1) x reduction in area 


Figure 8. Comparisons of measured and predicted cross-sections of creep speci¬ 
mens of SRR99 at various reductions in area along the necked test-piece: (a) 
composite polar plots showing measurements (right) and calculations (left) and (b) 
ratio of maximum/minimum diameters as a function of reduction in area. 
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Figure 9. Simulated and 
measured strain controlled 
LCF behaviour for cycling be¬ 
tween 0 and 0-75% tensile strain 
at 950°C for <001) specimens of 
SRR99: (a) simulated and (b) 
measured stress/strain curves 
for selected cycles, and (c) com¬ 
parison of predicted and meas¬ 
ured maximum and minimum 
stresses as a function of cycle 
number. 


4.Id Low cycle fatigue: The results of a series of high strain, elevated temperaturelow 
cycle fatigue tests for SRR99 have been compared with simulations using model 
parameters derived from a creep database (Pan et al 1994). A typical example, for strain 
controlled cycle between strain limits of 0 and 0*75% tensile strain at 950°C, is shown ini 
figure 9. Each cycle taken individually exhibits largely elastic deformation, but there is a 
shakedown of stresses reducing the maximum tensile stress and placing the unloading cycle 
into compression. This is consistent with there being some relaxation of the elastic tensile 
stress due to creep. The model calculations reproduce the observed behaviour to an accep¬ 
table degree of accuracy, particularly in view of the large difference in strain rates associated 
with the low cycle fatigue tests and with the creep tests forming the database. 


5. Conclusions 

A physics-based model of creep deformation in nickel-base superalloys has been 
developed, first in an isotropic form and later to account for the full crystallographic 
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anisotropy of single crystal superalloys, that successfully represents the principal 
characteristics of creep in these materials. The model has been validated against both 
direct creep data and a number of more indirect measures of creep deformation. 
Comparisons of measurements with model predictions of orientation and shape 
changes and of the effects of cyclic loading provide particularly demanding tests of the 
model that are successfully achieved. 


Much of the work described in this review has been funded by the Defence Research 
Agency and creep data and specimens for examination were supplied by Dr M R Win- 
stone of DRA. 
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Life-assessment technology for fossil power plants 

R VISWANATHAN 

Electric Power Research Institute, 3412 Hillview Avenue, Palo Alto, Cali¬ 
fornia 94303, USA 

Abstract During the last decade, construction of new fossil power plants 
has come to a virtual halt in the United States. The main factors responsible 
for this situation are: (1) industry perception of the existence of sufficient 
reserve margins, (2) uncertainty in forecasting load growth, (3) cost of borrow¬ 
ing for new plant construction, (4) siting and licensing problems for new 
plants, and (5) the applicability of more stringent environmental standards 
for new plants. Unfortunately, by the year 2000 it is estimated that nearly 
20% of all the fossil units will be more than 40 years old, and almost 44% 
will be more than 30 years old. Based on historical trends, nearly half of all 
fossil plants have been retired before their fortieth year and three quarters 
before serving 50 years. In the future this trend is unlikely because of the 
shrinking reserve margins and the long lead time for new capacity addition. 
If the life of these aging plants can be extended by 20-30 years, utilities can 
have a viable low-cost alternative to building new plants while at the same 
time circumventing some of the regulatory, siting and licensing constraints. 
In view of this, plant life extension has become an integral part of the 
planning strategy for many US utilities. A critical component of the Electric 
Power Research Institute’s (EPRI) efforts in formulating life extension 
strategies for fossil power plants is the development of techniques to assess 
the current condition of high-temperature components. These techniques 
are necessary not only for assessment of the remaining life, but also for 
avoiding catastrophic failures and forced outages, for setting up inspection 
intervals and for optimizing the operating procedures. The components 
that are being addressed at present are boiler pressure parts, steam pipes 
and steam turbine rotors. This paper will provide a brief overview of current 
concerns and areas of research where significant progress has been made to 
address these concerns from a phenomenological point of view. 

Keywords. Life-assessment technology; fossil power plants; life extension 
strategies. 


1. Introduction 

Components operating at elevated temperatures can fail due to excessive deformation, 
bursting under pressure, or cracking under pure creep or creep-fatigue conditions. 
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Failure by cracking may further be viewed as consisting of three stages: crack initiation, 
crack propagation, and final failure of the component once the crack reaches a critical 
size. Life-assessment techniques are, therefore, aimed at quantifying uniform or localiz¬ 
ed incipient damage prior to crack initiation, the rate at which cracks grow, and the 
critical crack size that will lead to final failure. Eventual failure of the component can 
occur either by leakage or rupture at the operating temperature or by rapid brittle 
fracture at lower temperatures during start-up/shutdown transients. In the latter case, 
the critical size of the crack is defined by the fracture toughness of the material. Since 
embrittlement phenomena occurring in service can reduce the toughness of the 
material, knowledge of fracture toughness in the service-exposed condition is 
necessary. 

Based on worldwide experience during the last five years, a three-level approach for 
life assessment has evolved. In this approach, simple calculational techniques are used 
in the first level, followed by nondestructive and destructive tests in the next two levels. 
The focus of research has been mainly to reduce the uncertainties and hence the 
conservatism in each of the levels of assessment. This article is an overview of advances 
during the last five years in refining the assessment procedures under each of these 
levels, with special emphasis on results of work sponsored by the Electric Power 
Research Institute (EPRI). A more complete review of the extensive body of relevant 
literature may be found in Viswanathan (1989). While this paper will review advance¬ 
ments from a generic technology point of view, specific techniques applicable to 
individual components such as turbine rotors, steam pipes, headers and tubes may be 
found in other key overview papers (Becker et al 1987; Harris et al 1993; Viswanathan 
& Wells 1993; Wells & Viswanathan 1993; Viswanathan et al 1994). 

2. Calculation techniques 

In the calculational procedures, plant records and-the time-temperature history of the 
component are reviewed. The creep or creep-fatigue life fraction consumed is calculated 
using assumed material properties and damage rules. This procedure is usually 
inaccurate due to errors in assumed history, in the material properties, and in the 
damage rules. The temperature-history information may be refined by supplemental 
nondestructive or destructive examinations such as microstructural studies, hardness 
measurements, and oxide-scale measurements. The uncertainties in material properties 
can be reduced by building a database, particularly with respect to service-retrieved 
components taking into account the environmental effects. The limitations of the 
damage rules have also been explored based on extensive laboratory studies. 

2.1 Life fraction rule for creep 

The Life Fraction Rule (LFR) states that at failure 

L*iAr=l> (1) 

where t { is the time spent at a given stress and temperature, and t r is the rupture life for 
the same test conditions. When the damage fractions incurred under different sets of 
stress-temperature conditions add up to unity, failure is presumed to occur. Hart 
(1976), Woodford (1979), and Bolton et al (1980) have demonstrated that the LFR is 
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valid for temperature changes but not for stress changes. In a multiclient project 
recently completed at ERA Technology, an extensive body of long-time data has been 
generated, with the specific intent of verifying the LFR for varying temperature 
conditions (Brear & McCarthy 1989) and the results have been analysed in detail by 
Viswanathan (Viswanathan & Gehl 1992). Two heats of a 0-5Cr-0-5 Mo-0-25 V steel 
were evaluated: a pipe-grade steel with a predominantly ferritic microstructure and 
relatively high ductility (15% elongation) under long-term service conditions, desig¬ 
nated MG, and a cast turbine casing steel with a bainitic microstructure and low 
ductility (3-5% elongation), designed FW. In addition, a ductile tubing steel of 
2 l/4Cr-l Mo steel designated heat RG was also evaluated. 

Specimens from heat MG were subjected to creep damage fractions of0-25,0-45,0-63, 
and 0-82 at 575°C at a stress of 69-5 MPa; specimens from heat FW were also subjected 
to similar levels of initial creep damage by testing at 550°C and 84-9 MPa. The 
predamaged samples were then subjected to accelerated rupture testing at higher 
temperatures, holding the stress at the same level as in the initial creep damage tests. 

Results of the accelerated stress rupture tests on heat MG at 69-5 MPa for the virgin 
condition and for the predamaged conditions are shown in figure 1. The target values of 
remaining life at 575°C based on the LFR are also indicated. If the LFR is valid, the 
temperature vs. time-to-rupture lines should be parallel to the virgin material line and 
should pass through the respective target values. This is clearly not the case; linear 


Virgin material rupture life at 575°C * 62,210 h 
Estimated remaining life at 575°C for pre-damaged 
samples based on 


Damage 

Life Fraction 

Linear Extra 

Fraction 

Rule, h 

Potation, h 

A 0.27 

45,413 

49,900 

□ 0.45 

34,215 

48,300 

• 0.63 

23,017 

33,000 

0 0.82 

11,197 

45,000 



Figure 1. Stress rupture 
curves for heat MG in the virgin 
condition and after prior creep 
damage to various life fractions. 
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extrapolation of the accelerated-test data predicted remaining lives at 575°C in excess 
of the target value in every case. For the 0-82 predamaged samples, the accelerated-test 
results grossly over-predicted the remaining life compared to the calculated value 
based on the LFR. 

The trend, for the brittle cast FW (see figure 2) is exactly opposite to that observed for 
the ductile heat MG. With the exception of the 0-45 predamaged sample, for all other 
samples, the predictions from the accelerated tests fell short of the target values 
calculated from the LFR. 

A third heat of 2 1/4 Cr-1 Mo steel (designated RG) in the normalized and tempered 
condition was used for the program. Tube samples of the steel were pressurized to an 
equivalent stress of 116 MPa (16-6 ksi) and tested to creep life fractions of 05 and 0-72 at 
525°C (975°F), the mean rupture life being 45,475 h in the burst tests. From the 
interrupted test tubes, chordal samples were machined and subjected to accelerated 
tests, uniaxially at a constant stress of 16-6 ksi (116 MPa), at a number of temperatures 
in the range of 550 to 650°C (1025°F to 1200°F). Some tests were also carried out at 
525°C (975°F) to enable comparison of the actual remaining life at 525°C (975°F) with 
those predicted by the LFR and from isostress extrapolation. Results of these rupture 
tests are shown in figure 3. The uniaxial rupture life for the virgin material was shorter 
than the burst test lives (30,094 h vs 45,475 h) indicating that the mean diameter hoop 
stress formula resulted in an overestimation of the equivalent stress. For the pre¬ 
damaged samples, the actual remaining test lives at 525°C agreed very closely with 


Virgin material rupture-life at 550‘C - 67.726 h 

Estimated remaining life at 550’C for pre-damaged 
samples based on 


Damage Life fraction Linear 

fraction rule, (h) extrapolation, (h) 


A 0.26 
□ 0.45 

• 0.63 

0 0.91 


50,132 32,000 

37,260 62,000 

25.066 18.000 

12.871 10.000 



Time-to-Rupture. (h) 


Figure 2. Stress rupture 
curves for heat FW in the 
virgin condition and. after 
prior creep damage to vari¬ 
ous life fractions. 
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Virgin material rupture life at 525'C and 115.8 MPa is 
30.094 h m uniaxial test and 45475 h in biaxial tests. 
Estimated remaining life at 525'C far predamaged 
samples based on 



Damage 

fraction 

Life fraction 
rule.(h) 

Linear 

extrapolation, (h) 

Actual. 

(h) 

• 

Virgin, biaxial 

- 

- 

45,475 

0 

Virgin, uniaxial 

- 

30.094 

33.100 

A 

0.50 

16.350 

19.217 

20.920 

□ 

0.72 

9.268 

14,856 

14.473 



Time-to-Rupture. (h) 


Figure 3. Results of tube burst 
tests and uniaxial accelerated 
isostress rupture tests on 
samples machining from tubes 
previously subjected to 0-5 and 
0-72 life fraction creep damage, 
2 l/4Cr-Mo steel (Heat RG). 


those estimated from the remaining life fraction using the life fraction rule and from 
linear extrapolation of the accelerated test results. 

A correlation between the expected life based on the LFR and the observed life for 
the three heats (figure 4) shows that for the brittle material, the actual life is always 
shorter than the expected life while the opposite is true for the ductile heat MG. For the 
ductile heat RG of 21/4Cr-lMo steel, the expected life agrees closely with the 
observed life in the accelerated tests. 

The difference in behaviour between the brittle and ductile cases is believed to result 
from differences in damage mechanisms. Prior creep damage in the brittle case 
consisted of “mechanical damage” with extensive formation of creep cavities. In the 
ductile case, damage was “metallurgical” damage consisting of microstructural 
changes. In the brittle steel, subsequent testing by temperature acceleration presum-, 
ably resulted in rapid linkup of cavities, leading to premature failure; in the ductile steel, 
on the other hand, temperature acceleration presumably led to recovery of some of the 
metallurgical damage, resulting in increased life. 

The correlation between prior expended life fraction and remaining life fraction 
measured in the accelerated tests shown in figure 5 may be used as a modified basis for 
LFR. For a ductile material, the LFR may be used for a conservative prediction of 
remaining life. For a brittle material, a value of < 1 as defined by the lower-bound 
envelope in figure 6 may be used as a conservative failure criterion. 

For a ductile material, the implications of the above results to actual situations in 
which the LFR is invoked are as follows: When LFR is used to calculate cumulative 
damage under conditions where the average metal temperature increases with service 




306 


R Viswanathan 



Expected Life in Accelerated Test Based on LFR. (h) 


Figure 4. Correlation between 
expected life based on life frac¬ 
tion rule and actually observed 
life in post exposure accelerated 
test. 


time (e.g., superheater tubes), failure will always occur at 'Ltjt r > 1. Hence, 2t 4 /t r = 1 
should be used as a conservative failure criterion. The same reasoning may be applied 
to situations where periodic temperature excursions occur. 

In all life-fraction calculations, the virgin material behaviour (minimum or mean of 
the International Organization for Standardization’s [ISO’s] Larson-Miller data) is 
used as the reference condition for calculating the t r . In reality, as the component 
softens, the applicable Larson-Miller curve is continuously changing in such a way 



0 .2 .4 .6 .8 1.0 1.2 

Expended Life Fraction 


Figure 5. Expended life frac¬ 
tion under service conditions vs. 
the remaining life fraction as de¬ 
termined from post exposure 
accelerated tests for 3 heats of 
steel. 
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Figure 6. Larson-Miller stress rupture curve for 2-1/4 Cr-1 Mo steel specimens of 
different sizes, before (a) and after (b) correcting for oxidation effects associated with 
specimen size. 
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that the actual cumulative damage can be more than that calculated by using the virgin 
material curve. Application of the LFR could be improved by taking the softening 
reactions into account. 

2.2 Linear damage rule for creep fatigue 

A variety of rules have been enunciated for calculating total damage when both creep 
and fatigue damage are present in a component. The most popular among these is the 
linear damage rule, in which the life fractions consumed in creep (t/t r ) and fatigue 
( N/N f ) are added as follows: 

X(N/N f ) + l.(t/t r ) = D. (2) 

Differing approaches to calculating N/N f , t/t r and the value of D at failure have led to 
alternate rules which have been extensively discussed earlier (1989). 

An extensive critique of damage rules as they apply to fossil plant components 
has been published by Viswanathan (1992, 1993). Detailed review of literature 
shows that there are divergent opinions regarding which damage approach provides 
the best basis for life prediction. It is quite clear that a number of variables, such 
as test temperature, strain range, frequency, time and type of hold, waveform, 
ductility of the material, and damage characteristics, affect the fatigue life. The 
conclusions drawn in any investigation may therefore apply only to the envelope 
of material and test conditions used in that study. The validity of any damage approach 
has to be examined with reference to the material and service conditions relevant to 
a specific application. Broad generalizations based oh laboratory tests, which often 
may have no relevance to actual component conditions, do not appear to be produc¬ 
tive. Thus, one should use a tailored, case-specific approach for any given situation. 

One of the major problems in evaluating the applicability of different life-prediction 
methods is that in many cases it is necessary to use all the available data in deriving the 
life-prediction method and thus it is possible to examine only the accuracy with which 
a given method describes the. data. There also is a scarcity of instances in which service 
experience has been compared with specific life-prediction methods. In general, the 
available methods are utilized only to predict the lives of samples tested under 
laboratory conditions. Validation against component test data in the laboratory and 
in-service monitoring of actual equipment would lead to more confidence in the use of 
the various rules. 

Results from most studies show that even the best of the available methods can 
predict life only to within a factor of 2 to 3. Some of the cited reasons for these 
inaccuracies have already been discussed. Some additional reasons are: failure of the 
methods to model changing stress-relaxation and creep characteristics caused by strain 
softening or hardening, use of monotonic creep data instead of cyclic creep data, and 
lack of sufficiently extended-duration test data. None of the damage rules available 
today is entirely based on sound mechanistic principles. They are all phenomenological 
in nature, involving empirical constants that are material-dependent and difficult to 
evaluate. Extrapolation of the rules to materials and conditions outside the envelope 
covered by the specific investigation often results in unsuccessful life predictions. 
Material behaviour under isothermal LCF conditions in the laboratory often turns out 
to be totally irrelevant to material behaviour under thermochemical fatigue cycles 
involving in-phase or out-of-phase thermal cycles in the field. For application to service 
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components, the stress-strain variation for each type of transient and its time depend¬ 
ence must be known with accuracy. Such calculations are difficult and expensive to 
perform. Because of these limitations and the simplicity of the linear-damage summa¬ 
tion using the life-fraction rule, the latter approach continues to enjoy widespread 
popularity in engineering applications. 

2.3 Oxidation and specimen size correction 

The effect of oxidation on rupture life is relevant in several aspects of remaining-life 
assessment, particularly for heavy-section components with wall thickness values 
exceeding 25 mm. Rupture life databases used in original design, such as the ISO, 
ASTM, and National Research Institute for Metals’ (NRIM) databases, are based on 
test data generated on small specimens with diameters of 6-10 mm. When these data 
are utilized in remaining-life-fraction calculations, conservative results are obtained. In 
addition, when standard ASTM specimens of 10 mm or less diameter extracted from 
heavy-walled piping are tested in air for remaining-life assessment, the resulting values 
are likely to be conservative. Results of isostress rupture tests conducted at elevated 
temperatures may need to be corrected for oxidation effects. 

An extensive study of oxidation corection factors (CFs) for 2-25 Cr-1 Mo steels has 
been published by Nakashiro et al (1991). Oxidation correction factors were developed 
based on oxidation kinetics derived from literature and based on loss of diameter 
actually observed on samples. Using the correction factor, the original NRIM Larson- 
Miller plot based on 6 mm diameter specimens was revised to reflect the behaviour of 
standard 10 mm specimens. This study was, however, confined to tubing applications; 
specimen sizes beyond 10 mm were not addressed. 

An alternate approach for developing OCFs consisted of rupture testing specimens 
of varying size (2-5-50 mm) (Brear 1994). Selective tests were also carried out on 
9 mm-diameter specimens in argon. Preliminary results from one heat of 2-25 Cr-1 Mo 
steel (heat BSH) have shown that the rupture life in air is increased by increasing the 
specimen size from 2-5 to 9 mm with a further increase of life in argon as shown in figure 
6a. Foulds & Viswanathan (1994) have analysed these data and developed a methodol¬ 
ogy for rationalizing the specimen size effect. In the method, the following phenom¬ 
enological aspects of creep/rupture were utilized: 

et r = C (Monkman Grant Law), (3) 

where the constant C may vary from test to test and need not be of known value. 

£ = Ba n exp(/4/T)(Norton Law) (4) 

where the values of the constants B and A need not be known. V may be measured 
easily over short test duration and 

* t r 

(fid t)/C = 1 (5) 

Jo 

where s is time-dependent and specimen-size dependent as a function of oxide 
formation. The equation is based on the assumption that the Robinson Life Fraction 
Rule applies to the test specimen when one test specimen is represented by multiple, 
individual test specimens with a reducing size in time. 
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The resulting dosed form after integrating and solving for t R (in vacuum) is: 
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where k is the parabolic constant for oxide growth in air, t R is the predicted vacuum test 
life, t r is the measured air test rupture time, and r 0 is the initial specimen radius. The 
constants 1*6 and 3*2 arise from a Pilling-Bedworth assumption of 2*0. 

When (6) is applied to the data shown in figure 6 all the data collapse onto a single 
line, regardless of specimen size as shown in figure 6b. 


3. Nondestructive techniques for characterizing damage 

Conventional nondestructive evaluation (NDE) methods fail to detect incipient dam¬ 
age, which can be a precursor to crack initiation and subsequent rapid failure. Several 
improved NDE techniques have recently been developed for estimating life consump¬ 
tion. These include strain-monitoring techniques, microstructural techniques, hard¬ 
ness-based techniques, and oxide-scale measurement techniques for superheater/ 
reheater tubes. 

3.1 Strain monitoring 

Strain measurements are often employed to detect creep damage. Gross changes 
such as header swelling and rotor bore expansion have been monitored in the past. 
Frequently, due to unknown variations in the original dimensions, changes in 
dimensions cannot be determined with confidence. Dimensional measurements 
fail to provide indications of highly damaging and localized creep strains such as 
those in the heat-affected zones of welds and regions of stress concentrations 
in the base metal. Cracking can frequently occur without manifest overall strain. 
Furthermore, the critical strain accumulation preceding fracture can vary widely 
with a variety of operational material parameters, and with stress state. To enable 
measurement of localized strains, an “off-line” condition surveillance system has 
been developed. The system uses the replication principle to evaluate localized 
strains and life consumption (Cane & Bissall 1986). A surface grid is scribed 
at the region of interest and preserved by means of an oxidation-resistant coating. 
A hard replica of high stability is used to duplicate the grid. Biaxial strain assessment is 
then made by high-resolution measurement of the replicas taken at successive plant 
inspection shutdowns. A predictive strain-rate lifetime model approach is used to 
establish “fitness for service.” No field experience with this technique has been reported. 

3.2 Hardness-based techniques for creep damage assessment 

The changes of hardness in low-alloy steels as a function of time and temperature have 
been extensively quantified,^so that hardness changes can be used to estimate the 
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operating temperature (Askins 1989; Ellis et al 1989). Correlations have also been 
established between tensile (hence hardness) and the Larson-Miller rupture relation¬ 
ships for low-alloy steels. These correlations enable selection of the appropriate 
Larson-Miller plot corresponding to a given hardness level, which can be used to 
calculate the remaining life. 

Current procedures do not recognize that the Larson-Miller parameter for rupture 
decreases with in-service aging. A model that explicitly takes into account the de¬ 
creased rupture life due to softening has recently been developed for 2-25 Cr-1 Mo 
steels by Grunloh & Ryder (1989). The effect of aging on creep rupture endurance is 
modelled by the incorporation of the ultimate tensile strength at room temperature 
(UTS) into the second-degree polynomial shown in (7), which was derived by analysis of 
a large body of data, shown in figure 7. 

LMP (rupture) = 40,975 + 57(UTS) - 5,225(log a) - 2,450(log o)\ (7) 

In this equation, time, temperature, and stress are in units of hours, R (deg. Rankine), 
and ksi (1 ksi = 6-89 MPa), respectively. The effect of aging is taken into account by 
applying an aging correction factor to the original UTS, based on known kinetics of 
aging for the steel. Using temperature, applied stress, and starting UTS as input, 
a computer code calculates the LMP for rupture in the aged condition (and hence t r ) at 
1,000 hour intervals. The incremental damage for each time step, and hence the 
cumulative damage t/t r , can be calculated. Kimura and others developed a similar 
approach to calculate creep life consumption of CrMoV rotors (Kimura et al 1988) and 
casings (Fujiyama et al 1990). 

Goto and others have proposed using hardness as a stress indicator in the remaining- 
life assessment of CrMoV rotors (Goto 1985; Kadoya et al 1985). They have quantified 
the effect of stress on the aging process so that by comparing the kinetics of hardness 
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change in the rotor with that of thermally aged samples, the local stress can be 
determined. This value of stress and the known value of service temperature are used in 
conjunction with the Larson-Miller rupture plot to estimate t r for the material in its 
current damage state. 

In contrast, McGuire & Gooch (1989) show that the magnitude of the stress is 
unimportant once a “threshold stress level” has been exceeded. For all stress levels in 
the range 70-240 MPa, the hardness change in the stressed condition was found'to be 
21% higher compared to the change in the unstressed condition. 

A creep model incorporating structural degradation as monitored by hardness 
changes has been proposed by Cane & Bissall (1986). By equating the kinetics of 
hardness change to the kinetics of interparticle spacing changes, the decrease in the 
threshold back stress for creep is modelled. Substitution of the threshold back stress in 
the Norton creep law yields an expression for secondary creep rate £ in terms of 
hardness changes. By integrating the £ between f = 0 and t — t r , where t r is the time to 
failure in terms of the time to reach an arbitrarily chosen critical strain, the remaining 
rupture life is predicted. The model is currently based on limited data and involves 
numerous assumptions that can only be justified by further research. 

3.3 Hardness and low-cycle fatigue life 

Considerable work has been carried out (Goto 1985; Kadoya et al 1985; Kimura et al 
1988; Fujiyama et al 1990) in applying hardness for calculation of fatigue-life consump¬ 
tion in CrMoY rotor grooves. It has been observed that low-cycle fatigue damage 
results in strain softening and can be measured as a hardness decrease. The premise, 
therefore, is that if the fatigue curve corresponding to the current hardness (in service) 
could be defined, the fatigue-life fraction consumed could be calculated by entering the 
appropriate total strain range Afi t versus number of cycles to crack initiation (N f ) 
curve, as shown in figure 8 (Kimura et al 1988). These relationships have been 


quantified (Kimura et al 1988): 

A ,e t = k 1 N a / + k 2 N a f, (8) 

where 

log= 2-59 x 10” 3 Hv — 0-9 and k 2 = 22-0, (9) 

a, = 1-64 x 10" 4 Hv-009 and a 2 =0-59, (10) 

^ = 1 -° 7H ,- 4 °- 6 - (U) 


where k l and a x are hardness-dependent constants, and o yd is the cyclic yield strength. 
For any given hardness of the steel, (8)—(11) can be used to calculate fatigue-life 
fraction consumed under different transients, and these fractions can be summed up 
to calculate the total fatigue-life expenditures using the procedures detailed by Kimura 
et al (1988). 

3.4 Creep cavitation model 

Metallographic methods have been developed that can correlate either cavitation 
evolution or changes in carbide spacings with creep-life expenditure. It has been 
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Figure 8. Estimation of low-cycle fatigue properties by hardness for a Cr-Mo-V 
rotor forging at 566°C. H v = Vickers hardness number. 


observed that, in boiler piping, cavitation is the principal damage mechanism at brittle 
zones, weld heat-affected zones, and high-stress regions in the base metal (Ellis et al 
1988). In the other cases, carbide coarsening was found to be a better indicator of life 
consumption. A preliminary model based on carbide coarsening has also been develop¬ 
ed (Askins 1989). 

Neubauer & Wedel (1983) characterized cavity evolution in steels at four stages - 
isolated cavities, oriented cavities, linked cavities (microcracks), and macrocracks - 
and formulated recommendations corresponding to the four stages of cavitation. To 
provide a theoretical and quantitative basis for cavity evolution, Cane & Shammas 
(1984) used a constrained-cavity-growth model and proposed a relation between the 
number fraction of cavitated boundaries (parameter A) and the life fraction consumed 
using heat-specific constants. Values of these constants either had to be assumed 
or determined experimentally for each heat, thus restricting the usefulness of the model. 
Based on interrupted-creep tests on simulated heat-affected zone 1 Cr-0-5 Mo steels, 
Ellis and others concluded that the data had too much scatter to verify the life- 
prediction model of Cane. The data could nevertheless be used empirically, by plotting 
all the data together in the form of a scatterband whose lower limits are defined by (Ellis 
et al 1989). 

A = 0-517(t/t r ) — 0T86. (12) 

In using the A parameter method, the specific procedure used to measure A is crucial. 
The A parameter is defined as the number fraction of cavitating grain boundaries 
encountered in a line parallel to the direction of maximum principal stress. To measure 
A reproducibly, the procedure needs to be standardized. 

Since (12) was developed using two steels of widely different impurity concentrations, 
it is anticipated that the scatterband encompasses most 1 Cr-0-5 Mo steels in service. 







314 


jR Viswanathan 



.4 ,6 

Expended into Fraction. (t/t r ) 


Figure 9. Correlation between damage classification and expended creep-life 
fraction for l-l/4Cr-l/2Mo steels. 

Limited data on heat-affected zone samples from 2*25 Cr-1 Mo steels also have shown 
that the cavitation behaviour can be described by (12). Extension of this correlation to weld 
metal, to steels with fine grain size, and to other steels, however, would lead to errors. 

An alternate, easier-to-use method has also been proposed by Ellis et al (1989). The 
damage classifications have been correlated with life fractions, and thus a life-fraction 
range has been established for each class (see figure 9). The Wedel-Neubauer classifica¬ 
tions of material condition (undamaged, class A, class B, class C, and class D) 
correspond roughly to expended-life-fraction (tjt r ) values of 0-27,0-46,0-65,084, and 1, 
respectively, using the conservative lower-bound curve shown in figure 9. Since the 
expended life fraction corresponding to various damage classifications is known, the 
remaining life can be calculated using the relationship 

frem = '(M)-l) (13) 

where t is the service life expended. The remaining life corresponding to various damage 
classifications can be calculated to be 2-lt, l ilt, 0-54 1, and 019t for undamaged material, 
isolated cavitation, oriented cavitation, and microcracking respectively. By applying 
a safety factor of three to the above values, the safe reinspection intervals can be established 
to be 0-9t, 04t, 0*18r respectively. This approach has been developed and applied by 
Paterson as shown in figure 10 (Paterson 1989). The figure clearly shows that the reins¬ 
pection interval is not only a function of the current damage level, but also the service life 
elapsed, unlike in the Wedel and Neubauer approach, where the reinspection interval 
prescribed is independent of elapsed service life. For all combinations of damage classifica¬ 
tion and service life in regime B in the figure, the new approach results in increased 
inspection intervals. Several utilities that have adopted the approach have realized 
significant savings in inspection costs. (EPRI1990). 

The field replication data forming the basis of the Wedel-Neubauer recommenda¬ 
tions have recently been reviewed (EPRI 1991). When these limited data were analysed, 
no consistent trends of cavitation evolution with operating time or with calculated 
creep life fraction consumed could be discerned. The laboratory data generated and 
analysed by Ellis and others appear to be the only available quantitative basis for 
setting inspection intervals. 
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Figure 10. Service life versus 
reinspection interval for Ap- 
tech/EPRI remaining useful 
life method and the Neubauer- 
Weidel method. 


The progress of creep cavitation as a function of creep life fraction expended in 
CrMoV rotor steels has been investigated by Goto. Additional data by Carlton et al 
(1987) show wide variations in behaviour due to differences in grain size, ductility, and 
impurity contents. The results of Goto agree reasonably well with those of McGuire 
and Gooch for expended life fractions up to about 0*75, but show larger values of A at 
higher life fractions. Comparisons of the behaviour of CrMoV rotor steels with CrMo 
piping steels suggests that the CrMo steels cavitate more readily and reach much larger 
values of the A parameter at failure compared to the CrMoV steels. It has also been 
noted that in CrMoV steels, cavitation is observed only at the end of second-stage and 
beginning of third-stage creep, unlike in CrMo steels where cavitation occurs during 
the second-stage creep (Lempp et al 1986). The relative insensitivity of the A parameter 
to expended creep life for rotor steels makes accurate determination of the A parameter 
crucial. For this reason, the damage classification as a function of expended life is 
shown in figure 11 using the lower bound curve. The life fractions consumed are 
estimated to be 0-3,06,085, and 1, respectively, for undamaged and classes A, B, and 
C damage. These values are larger compared to CrMo steels, indicating that for a given 
damage classification, life consumption is higher in CrMoV steels, compared to CrMo 
steels. Blade root fixings after blade removal may be amenable to the use of replication, 
and cavitation has been observed in rotors at these locations. More complex equipment 
required for replication of rotor bores has also been developed (Goto 1985; Kadoya 
et al 1985). However, creep cavitation in rotor bores has not been reported to date. 

3.5 Analysis of carbides 

Askins (1989) has attempted to utilize parameters such as matrix-solute compositions, 
ratios of carbide phases, and matrix-lattice parameters as quantitative indices of 
temperature during aging of 1 Cr-0*5 Mo steels. Their data indicate a rapid initial 
decline in matrix-solute content and an increase in the M 23 C 6 /M 3 C ratio with time 
and temperature of aging. Changes in the matrix-solute content were also found to be 
reflected in changes in lattice-parameter values. Although these general trends could be 
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Figure 11. The correlation be¬ 
tween damage classification 
and creep-life fraction for 
CrMoV rotor steels. 


confirmed, excessive scatter in the results did not permit quantitative correlation of any 
of these parameters with aging history. 

In 2-250-1 Mo steels, there have been several investigations showing that 
prolonged exposure at about 540°C causes an irreversible form of embrittlement 
known as “carbide-induced embrittlement” (Cheruvu 1981; Qu & Kuo 1981; 
Watanabe & Shoji 1994). The embrittlement is manifested as an increase in the 
fracture-appearance transition temperature (FATT) and is generally associated 
with the formation of M 6 C molybdenum-rich carbides at the grain boundaries. 
The carbides act as sites for crack nucleation, promoting transgranular cleavage 
cracks at low temperatures. Carbide formation also results in softening of the steel 
since it depletes the matrix of the strengthening precipitate Mo 2 C and the molyb¬ 
denum in solid solution. The softening reaction is also manifested as decreased creep 
strength and rupture strength. 

Carbide evolution in 2-25Cr-lMo steel has been investigated by Stevens & 
Lonsdale (1984), Stevens & Flewitt (1985) and Munson (1990). In both studies, the 
amount of M 6 C carbides as a percentage of the total weight percent of carbides 
increased with time and temperature. Plotted in terms of a Larson-Miller time- 
temperature parameter, the results (figure 12) show the family of curves from the 
Stevens and Flewitt study to be shifted laterally by a significant amount compared to 
that of Munson. 

These studies suggest that although the evolution of M 6 C may be used as a qualitat¬ 
ive index of service temperature, wide variations may occur due to differences in initial 
microstructure and composition. Data in figure 12 also show that the kinetics of M 6 C 
formation are considerably accelerated by phosphorus. Improved procedures for 
carbide extraction as well as a larger database on samples with various initial composi¬ 
tions and heat treatments are needed before the microstructural techniques can be 
used for plant-life assessment. Stevens & Flewitt (1985) also found that the evolution 
of the M 6 C phase was unaffected by stress. Nakatani et al (1990) have developed 
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LMP = (T + 460) (20 + logt) 


Figure 12. Evolution of M 6 C 
in 2-25 Cr-1 Mo steel as a func¬ 
tion of aging time and tempera¬ 
ture. (In the equation for LMP, 
temperature is in degrees Fahr¬ 
enheit and time is in hours.) 


correlations between the percent M 6 C and rupture-life reduction so that the percent 
M 6 C in a service-exposed component could be used to deduce the creep-life fraction 
consumed. 

The evolution of carbide phases in CrMoV rotor steels has been investigated on 
a single heat aged at 650°C by McGuire & Gooch (1989). The percentages of M 23 C 6 , 
M 7 C 3j and VC were found to increase appreciably with aging, with a decrease in M 3 C. 
Most significantly, the average Fe:Cr ratio in the carbides decreased systematically 
with aging. Expansion of this database to more heats and aging conditions could open 
the door to the widespread use of the Fe: Cr ratio in carbides as an index of degradation 
in these steels. 


4. Destructive tests 

Destructive tests provide a direct measure of the current damage state of the material. 
They are the last resort in the assessment route, since the tests require removal of 
samples from operating components. There are limitations on the number of available 
samples and locations from which they can be taken. Periodic assessment of the 
remaining life is not possible. The costs of cutting out material, machining samples, and 
conducting tests can add up to a significant expenditure. These costs are further 
compounded by the plant outage during the extended period of evaluation and 
decision making. Despite these limitations, removal of core plug samples from headers 
and ring samples from bore or periphery of rotors is becoming increasingly common. 
Destructive testing generally includes tensile, impact, and creep/stress-rupture tests. 
Since the purpose and limitations of the tensile and impact tests are obvious, only the 
creep life evaluations are described here. 
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4.1 Creep tests 

A procedure for estimating the current damage state by conducting stepped creep tests 
using a single specimen has been described in detail by Voorhees (1985). The procedure 
defines which stage of creep the material is in under service conditions but does not 
provide estimates of remaining life. The Monkman-Grant/elationship for steels has 
also been in wide use and enables estimation of remaining life by determining the 
second-stage creep rate from creep tests. Improvements to this procedure have also 
been suggested by Castillo & Khoul (1986). The most common procedure, however, 
continues to be the use of isostress rupture tests. 

The isostress rupture test procedure consists of conducting accelerated rupture tests 
at a temperature above the service temperature. The stress is kept as close as possible to 
the service stress value. The results are plotted as log t r versus T and extrapolated 
linearly to the service temperature to estimate the remaining life. Results from recent 
research have identified the limitations-of and possible refinements to the technique. 

Results described earlier suggest that linear extrapolation of results from tempera¬ 
ture-accelerated tests may yield optimistic predictions of remaining life under service 
conditions in the case of ductile materials; the reverse would be true for brittle 
materials. In either case, the predicted values are expected to be in agreement with the 
actual value within a factor of + 2. 

This conclusion is in agreement with that of Melton (1983). Unfortunately, it is 
generally difficult to define a “brittle” and “ductile” class of materials since ductility can 
vary from heat to heat and with the test conditions. For our purposes, we can define 
a “brittle” steel as one prone to creep damage by cavitation, and a ductile material as 
one in which creep damage occurs primarily by softening. This is consistent with the 
observation by the ERA researchers that the ductile case MG showed no evidence of 
cavitation even at a life-fraction consumption of 0-82, while the brittle cast FW 
cavitated profusely. Using this definition, castings, coarse-grained heat-affected zones 
in welds, and fusion lines in welds may fit under the “brittle” category, while the base 
metal in forgings, piping, and tubes may fit in the “ductile” category. These observa¬ 
tions are consistent with the recent results of Masuyama et al (1990) indicating 
differences in behaviour between ductile base metal and brittle heat- affected zones in 
2-25 Cr-1 Mo steel. They further observed that in brittle heat-affected zones, both 
stress-accelerated and temperature-accelerated tests gave equivalent results that close¬ 
ly conformed to the LFR, while in the ductile base metal, only the temperature- 
accelerated isostress test results were in agreement with the LFR. 

Since isostress tests are normally conducted in air, oxidation correction factors are 
also important. The extrapolated remaining-life values obtained from the accelerated 
tests should be corrected for oxidation effects. If tests are conducted in inert environ¬ 
ments, no correction factor would be needed. A further refinement to the technique 
consists of using miniature specimens to minimize sample requirements. It has been 
demonstrated that miniature specimens (10 mm long, 3 mm diameter) tested in argon 
yield results representative of large specimens tested in air (Askins & Marchant 1988; 
Kadoya et al 1990). 

4.2 Crack-growth studies 

All of the techniques described so far relate to life prediction from a crack-initiation 
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point of view. For heavy-wall components, the initiation criteria have to be combined 
with crack-growth data to perform a fracture mechanics analysis of remaining life. 
Fatigue crack growth analysis procedures are well established. For creep-crack growth 
and creep-fatigue crack growth, however, methodologies and data needed for analysis 
have been gathered only during the last few years. 

4.2a Creep crack growth: Extensive creep crack growth data pertaining to CrMo 
piping steels and CrMoV rotor steels have been collected, analysed, and consolidated 
(Saxena 1988; Saxena et al 1994). It has been observed that a crack-tip driving force 
parameter termed C„ which takes time-dependent creep deformation into account, 
correlates much better with crack growth rates (da/dt) than the traditionally used 
elastic stress intensity factor K. The relation between da/dt and C t can be expressed as 

da/dt = bC™. (14) 

To assess the remaining life of a component under creep-crack-growth conditions, 
two principal ingredients are needed: an appropriate expression for relating the driving 
force C, to the nominal stress, crack size, material constants, and geometry of the 
component being analysed; and a correlation between this driving force and the 
crack-growth rate in the material, which has been established on the basis of prior data 
or by laboratory testing of samples from the component. Once these two ingredients 
are available, they can be combined to derive the crack size as a function of time. The 
general methodology for doing this is illustrated below, assuming C t to be the driving 
crack-tip parameter. 

The general expression for C t given in (14) essentially reduces to the form 

C t — <te(A, n)aH (geometry, n), (15) 

where o is the stress far from the crack tip, obtained by stress analysis; e is the strain rate 
far from the crack tip, which is a function of the constants A and n in the Norton 
relation; a is the crack depth obtained from NDE measurements; and H is a tabulated 
function of geometry and the creep exponent n. The values of A and n are either 
assumed from prior data or generated by creep testing of samples. By assembling all the 
constants needed, the value of C, can be calculated. 

Once C t is known, it can be correlated to the crack-growth rate through the 
constants b and m in (14). Combining (14) and (15) provides a first-order differential 
equation for crack depth (a) as a function of time ( t ). Theoretically, this equation can be 
solved by separating variables and integrating. However, the procedure is complicated 
by the time-dependency of C t and the crack size dependence of the term H in (10). To 
circumvent this, crack-growth calculations are performed with the current values of 
da/dt (or a), to determine the time increment required for incrementing the crack size by 
a small amount A a (i.e., A t = A a/a). This provides new values of a , t, and C t , and the 
process is then repeated. When the value of a reaches the critical size a c as defined by 
K IC , J IC , wall thickness, remaining ligament thickness, or any other appropriate failure 
parameter, failure is deemed to have occurred. 

Although this procedure appears complex at first sight, the calculations are relatively 
easy, once the principles are understood. Computer programs have been developed to 
perform the entire analysis on personal computers. Among these, the Steamline 
Inspection Code and PCPIPE are in wide use in the United States. The only judgment 
involved is in selecting proper values for the constants A, n, b, and m, because large 
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scatter in creep and crack-growth data necessitates subjective choices. If actual creep 
and/or crack-growth tests could be performed, more accurate results could be ob¬ 
tained. Even if only the values of A and n in the Norton law could be pinned down by 
running creep tests in argon using the miniature samples described earlier, uncertain¬ 
ties could be reduced. A sample output may be in the form of crack depth versus time or 
a plot of crack size vs. remaining life, as illustrated in figure 13. This plot was generated 
for a thick-wall cylinder under internal pressure containing a longitudinal crack. The 
outside radius and wall thickness of the cylinder were assumed to be 45-7 cm and 
7-62 cm, respectively, and the hoop stresses were calculated for internal pressures 
of 8-96 and 13-79 MPa. Materials properties in the degraded condition (hot region) as 
well as in the undegraded condition were considered. The results show that the 
remaining life is a function of the stresses as well as of prior degradation. Plots of this 
type could be used to determine remaining life or to set inspection criteria and 
inspection intervals. 

Ainsworth et al (1987) have recently described a unified approach for structures 
containing defects. This approach incorporates structural failure by rupture, incuba¬ 
tion behaviour preceding crack growth, and creep crack growth in a single framework. 
Service life is governed by a combination of time to rupture, time of incubation, and 
time of crack growth. All of these quantities are calculated using a reference stress that 
is specifically applicable to the geometry of the component and is derived analytically 
or based on scale-model tests. If the desired service life exceeds the calculated rupture 
time, retirement may be necessary. 

In the opposite situation, further analysis is carried out to calculate the incubation 
time during which no crack growth is expected to occur. If the calculation indicates that 
the incubation time is less than the desired service life, then a crack-growth analysis is 
performed to calculate the crack-growth life t g . If the total life, f ( - + t g , is less than the 
desired service life, safe operation beyond that point would be considered undesirable. 
This approach seems promising and deserves further exploration. 



Figure 13. Remaining life as a function of initial crack size for an internally 
pressurized cylinder, illustrating a typical output from crack-growth analysis. 
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A number of variables affect the crack growth rate by modifying b, C t , or m. The effect 
of these variables is summarized below (Saxena 1988; Saxena et al 1994). 

@ In-service degradation increases crack growth rate. In ductile CrMo steel base metal, 
the effect was attributable to increased C £ , while in brittle CrMoV rotor steels, the 
effect was attributable to an increased value of m. Widely divergent estimates of 
remaining life arise due to uncertainties in the crack growth coefficients b and m and 
in the Norton law coefficients A and n. Determination of component-specific A and 
n by miniature specimen creep tests on samples, and determination of b and m by 
creep crack growth, can greatly reduce the uncertainty in the remaining-life analysis. 
® Crack growth rates at welds, fusion lines, and heat-affected zone materials are at least 
a factor of five higher compared to base metal. 

• The presence of localized chains of inclusion, further assisted by segregation of 
impurities to interfaces such as grain boundaries and fusion lines, causes significant 
increases in creep crack growth rates. 

• The presence of large amounts of impurities in the steel accelerates crack growth by 
increasing m. 

• All material and experimental variables that reduce creep ductility result in higher 
crack growth rates. 

• Temperature can have mixed effects on crack growth. In cases where the effect of 
temperature is merely to increase creep rate, the da/dt increases with increase in 
temperature due to increase in C £ . On the other hand, if a transition from a brittle to 
ductile condition is involved, increase in temperature may actually decrease the 
crack growth rates. 

• Crack-tip constraint has a pronounced effect on crack growth. Assumptions regard¬ 
ing plane-stress or plane-strain conditions can have a pronounced effect on da/dt. 

• Inclusion of primary creep, in addition to the secondary creep in calculation to the 
secondary creep in calculating C t , results in larger values of da/dt and reduced 
remaining life. 

4.2b Creep-fatigue crack growth : Major advances have been made in developing 
methodologies and data needed to treat crack growth under the combined effects of 
creep and fatigue at elevated temperatures. The loading conditions in elevated- 
temperature power generation components can often be simply represented by a 
trapezoidal wave shape consisting of a loading period, a hold time and an unloading 
period. 

For these conditions, Saxena and coworkers (Saxena & Gieseke 1987; Gieseke 
& Saxena 1989; Yoon et al 1994) have suggested the following formulations for treating 
creep fatigue crack growth data. According to them the total crack growth rate during 
creep-fatigue loading can be partitioned into cycle-dependent and time-dependent 
components as follows: 

[dn/dn] total = (dn/dJV) faligue + (da/dJV) hold , (16) 

where for trapezoidal loading (dn/dlV) fatigue corresponds to the crack growth rate 
associated with the loading/unloading portion of the cycle and (da/dlV) hold corresponds 
to the crack growth rate associated with the hold period. The time-dependent crack 
growth occurs only under constant amplitude loading during C* and C t can be used in 
characterizing the crack growth rate during the hold period of the cycle, t h . Due to 
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experimental limitations, it is difficult to obtain instantaneous values of da/dt and C, 
during the hold period. However, average values of the crack growth rate and the C t 
parameter can be accurately measured. The average da/dt and C, are obtained as 
follows: 

«k/dr)„, = (l/t,)(da/dW) h0 , d (17) 

and 

(C,).. g = (l/!») Pc.dt. (18) 

The (da/dlV) hold is the crack growth during the hold period and is obtained by 
subtracting the cycle-dependent crack growth rate from the total crack growth rate. 

The total crack growth rate is then given by 

da/dN = C fatigue (AK)? atigue + C t [(Q ave ]% (19) 

By comparing (20) with (16), it is readily seen that the first term in (19) given by the 
Paris Law represents the pure fatigue component and the second term represents the 
creep/creep fatigue component. 

In test specimens, (C,) avg can be determined from the measurements of load-line 
deflection as a fuction of time during the hold period. The value of (C f ) avg can be 
estimated by the following equation: 

(C,) avB = lAP(&V c )/BWt h -] (F/F ), (20) 

where AF C is the change in load-line deflection during the hold period due to creep 
deformation, AP is the applied cyclic load, B and W are the thickness and width of the 
specimen, respectively, F — K- a calibration factor = (K/P)BW 112 , and F' = dFj 
d (a/W). Similarly, the (da/dt) avg can be determined by subtracting the pure fatigue 
crack growth component from the total da/dN per cycle as per (16) and dividing the 
resulting (da/dN) hold by t h as per (17). The C t average can be correlated with the 
(da/dt) average. 

Figure 14b shows a plot of (da/dt) avg with (C r ) avg for 1-25 Cr-0-5 Mo steels for 538°C. 
The data include test results for a 98-second hold time, ten-minute hold time, and also 
the creep crack growth rate data (Yoon et al 1994). When plotted as a function of (C,) avg , 
the time rates of crack growth for these very different conditions fall on the same trend 
curve, in contrast when the crack growth per cycle is plotted as a function of A K (see 
figure 14a) different curves are obtained. The significance of the above trend with 
regard to predicting the hold-time effect in engineering components is obvious since 
creep crack growth data can be used to estimate creep-fatigue crack growth data and 
vice versa. It must be mentioned that if da/dN were correlated with AX, as has been 
done in the past (Saxena & Bassani 1984), the data for each hold time will follow 
a different trend as shown in figure 14a. In order to use this approach for predicting 
crack growth during hold time it is necessary to estimate the magnitude of (C ( ) avg for 
components. An equation has been proposed to estimate the (C ( ) avg for any geometry 
for a material deforming by elastic-cyclic plasticity and power-law creep (Yoon et al 
1994). The details of this equation are given in the above reference and are outside of the 
scope of this overview. The applicability of this approach to steels other than Cr-Mo 
steels is yet to be explored. 

There are additional concerns that must be addressed in order to develop total 
confidence in this approach. Currently, there is no provision to consider the interaction 
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Figure 14. Comparison of 
creep-fatigue crack growth 
rates with (a) fatigue crack 
growth rate plotted using AK, 
and (b) creep crack growth data 
using (C t ) avg . 


between loading/unloading rates and the crack growth rate during hold time. During 
start-up, large transient thermal stresses often develop. Therefore, the hold period may 
be preceded by a fatigue cycle with a maximum stress considerably higher than the 
stress during hold. As yet, no methods are available considering such loading. 
Analytical expressions for estimating (C t ) avg need further numerical verification. Also, 
new expressions that consider primary creep deformation are needed. The crack 
growth data for verification of these concepts are currently limited to a single specimen 
geometry. Therefore, data on a different specimen geometry must be obtained. 
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5. Nondestructive evaluation of toughness 

In heavy-section components, severe thermal stresses occurring during startup/shut¬ 
down transients can occasionally cause failures by rapid brittle fracture at relatively 
low temperatures. The crucial last step in remaining-life analysis of cracked compo¬ 
nents is, therefore, the determination of the critical crack size, based on a knowledge of 
the current toughness of the materials at the critical location. This toughness, generally 
expressed in terms of the plane-strain stress intensity for fracture ( K IC ) determines the 
critical crack size for failure. The usual procedure involves estimation of the K 1C at low 
temperatures based on well-established correlations between the 50% ductile-to-brittle 
Fracture Appearance Transition Temperature (FATT) and K IC . Considerable prog¬ 
ress has been made in estimating the FATT of steels using relatively nondestructive 
techniques. 

In Cr-Mo steels, operating at or above 538°C, the embrittlement mechanism of 
concern has been the carbide-induced embrittlement described earlier. An electro¬ 
chemical polarization test has been developed by Watanabe & Shoji (1994) for 
measuring the increase in FATT due to embrittlement in such cases. In CrMoV rotor 
steels temper embrittlement is of greater concern. The most promising techniques in 
this case in vol ve the use of phosphorus versus F ATT correlation, picric acid etch (P AE) 
tests, small punch tests, and electrochemical tests. 

5.1 Phosphorus-FATT correlations 

Viswanathan & Gehl (1991) have performed an in-depth review of FATT data 
available on several CrMoV rotors that had been retired after 20 to 40 years of service 
as well as a few laboratory aged samples. The data population includes those from their 
work as well as additional data points from a recent paper by Seeley et al (1993). These 
samples were temper embrittled to varying degrees, the worst embrittlement occurring 
in the exposure temperature range of 370-425°C. The FATT in the embrittled 
condition could be correlated with the phosphorus content of the steel, as shown in 
figure 15. The D-grade rotor samples, which had been austenitized at 1,010°C and 
therefore had a coarser grain size compared to the C-grade rotor samples, exhibited 
slightly higher tendency to embrittle. Within limits of scatter, the post-service FATT 
could be estimated based on a knowledge of the phosphorus content alone. 

5.2 Small punch testing 

In the small punch test, a finished small punch (SP) specimen is placed between 
a 2-54 mm diameter hemispherical punch and a die. The punch displacement rate is 
controlled during the test, and a load versus deflection curve is thus developed where 
the area under the curve denotes the energy absorbed during the test. By conducting 
tests at different temperatures, a curve of absorbed energy versus temperature can be 
developed that is very similar in shape to that of a Charpy energy versus temperature 
curve, with the difference that the SP test curve is shifted laterally to lower temperatures 
along the temperature axis compared to the Charpy curve. The midpoint of the 
energy curve can be used to define a ductile-to-brittle transition temperature (T sp ). 
Foulds and others (Foulds et al 1991, 1993, 1994; Foulds & Viswanathan 1994) 
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Figure 15. Variation of post exposure FATT (corrected for exposure temperature 
and location variations) with the phosphorus content of CrMoV rotor steel. 


have successfully applied the test procedure to determine the T sp for several retired 
rotor and disk samples. The T sp values obtained correlated well with the Charpy FATT 
values as shown in figure 16. 

The specimen needed for an SP test is only 0-508 mm thick with a diamter of 6-35 mm. 
Specimens of this size can be easily removed from critical locations in the rotor bar 
without affecting the rotor integrity by using a recently developed “scoop - ” device. If 
bore samples from the critical locations were unavailable, samples from other locations 
could be tested and the FATT values obtained could be corrected for location 
dependence using the approximations described by Viswanathan & Gehl (1991). 

As stated earlier, the ultimate value of FATT in making run/repair/replace decisions 
is in the estimation of fracture toughness, K JC , and the K IC -dependent tolerable crack 
or defect size. A direct determination of K JC can reduce the potential conservatism 
associated with the indirect FATT-to-K JC estimation process. Recently, the Electric 
Power Research Institute (EPRI) has completed preliminary development of a non- 
empirical, purely analytical approach to using a single small punch test to directly 
determine K 1C (Foulds et al 1993,1994). The procedure is material-independent, and 
requires no prior knowledge of material mechanical properties. The approach is based 
on the continuum material toughness concept wherein the criterion for fracture is 
defined and measured via the continuum stress-strain deformation properties of the 
material. The procedure specifically involves computing the “local” strain energy 
density accumulated at the location and time of crack initiation in the small punch test 
specimen using large-strain finite element analysis. Since the procedure also includes 
estimation of the material uniaxial tensile stress-strain behaviour from the small punch 
load-displacement curve, both the fracture toughness and the uniaxial tensile behav¬ 
iour are determined from a single test. Details of the test and analysis procedures, and 
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in its application of a more cost-effective and nondestructive method for assessing the 
condition of tubing. 


7. Summary 

During the last decade major advances have been made in developing an overall 
strategy for assessing the remaining life of critical components operating at elevated 
temperatures. The strategy is based on a phased approach in which calculational, 
nondestructive and destructive evaluations are judiciously and sequentially linked to 
be most cost-effective. Research has been aimed at improving the accuracy of life 
prediction using all of these techniques in order to avoid undue conservatism and 
premature replacement of components. Key results of the research addressing both 
damage and crack growth phenomena under creep and creep-fatigue phenomena have 
been reviewed in this paper. 
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Figure 16. Correlation between Charpy transition temperature (FATT) and tran¬ 
sition temperature measured from small punch tests ( T sp ) for 1 Crl MoV and 2-5 to 
3-5 Ni CrMoV rotor and disk steels. 


results are contained in Foulds et al (1993). Uniaxial tensile stress-strain curves are 
derived from the small punch test to an accuracy of typically < 5% of the stress at any 
strain. With regard to fracture toughness, K lc prediction accuracy of the technique is 
about + 25%. This level of accuracy can result in more than a factor-of-two increase 
over the FATT-estimated lower bound K 1C . This translates to roughly more than 
a factor-of-four increase in the tolerable crack/defect size in the rotor, and a significant¬ 
ly increased safe-life estimate. 


6. Life-assessment for superheater tubes 

The EPRI has developed a methodology that uses measurements of steamside oxide- 
scale thickness to predict the remaining creep life of high-temperature superheater/ 
reheater tubing (Paterson & Rettig 1988; Viswanathan et al 1994). In addition, the 
validity of measuring the oxide-scale thickness ultrasonically has also been demon¬ 
strated by establishing a good correlation between ultrasonic measurements and direct 
measurements on tube samples removed from superheater assemblies from several 
fossil boilers. Comparison of uniaxial test results with tube burst tests results has also 
helped identify the correct formulas for computing the reference stress in tubing, 
replacing a more conservative formula used in the past. The kinetics of steam-side 
oxide-scale growth in heat-flux and nonheat-flux areas are also being measured in 
boilers. Cumulatively, these results have provided industry with the needed confidence 




Life-assessment technology for fossil power plants 


327 


in its application of a more cost-effective and nondestructive method for assessing the 
condition of tubing. 
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Creep life extension of high temperature components 
under wall thinning conditions 

R N GHOSH, S CHAUDHURI, N ROY and L CHATTOPADHYAY 

National Metallurgical Laboratory, Jamshedpur 831007, India 

Abstract. Current methods of life extension of high temperature compo¬ 
nents have been examined. Importance of changing section size as a result 
of corrosion and erosion has been emphasized. A simple method based on 
the influence of service exposure on rupture strength of the material and the 
data collected on loss of section size during routine shut-down of plants has 
been suggested. Highlights of the computer software developed to assist 
such an exercise have been discussed. The outputs have been shown to 
compare fairly well with the results reported in literature. 

Keywords. Stress rupture tests; service exposure; Cr-Mo/Cr-Mo-V 
steels; time-temperature parameter; rupture strength; corrosion/erosion 
rates; reference stress; rupture life. 


1. Introduction 

The high temperature components of power plants and petroleum refineries are usually 
designed on the basis that failure must not occur within a stipulated design life under 
the expected operating conditions during service. In the case of components like heater 
tubes or steam pipes, creep is the major life-limiting factor. At the design stage it is 
therefore necessary to know the stresses which the engineering material can sustain at 
the service temperature without creep failure within the planned operational life. Since 
most designs are based on an assumed life of 100,000 hours or even 250,000 hours 
(approximately 30 years), knowledge of long-term creep rupture properties for relevant 
steels is essential. Therefore expensive multi-laboratory long-term stress rupture test 
programmes have been conducted on a range of steels. At present we do have 
a database which could give a fairly good estimate of up to 250,000 h rupture strength 
for a variety of steels recommended for elevated temperature applications. However, 
the main problem with such data is the unusually large scatter of around ± 20% of the 
stress level (Evans & Wilshire 1985). This is equivalent to an order of magnitude of 
scatter in the expected rupture life at any specified stress level. Since failure of 
components can lead to serious financial and safety problems, plant operation can be 
allowed only with a reasonable assurance that catastrophic failure will not occur. 
Therefore designs of high temperature components are based on the minimum assured 
level of rupture strength i.e. the lower limit of the scatter band. Yet most of the materials 
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used for construction will have stress rupture properties better than the lower limit of 
the scatter band. In addition a safety factor of 1-3 to 1-6 in stress is commonly used at the 
design stage. Therefore even at the end of the specified design life most large-scale 
plants have a significant built-in additional life. This is supported by the fact that 
a survey indicates that more than 93,000 MW of power is being generated by plants 
which are more than 50 years old (Paterson et al 1987). 

Many of the plants installed in our country during the late'l950s and early 1960s are 
now approaching the end of their design life. With increasing capital costs for 
construction of new plants (which is currently around Rs. 4 cores/MW) considerable 
world-wide attention is being focussed on refurbishment and life extension of existing 
units. A three-stage procedure is being commonly followed (McNaughton et al 1986; 
Townsend 1987). Stage 1 looks at the design method in the light of the new long-term 
material data that may have become available since the plant was commissioned, 
whereas stage 2 involves in situ inspection of components using a variety A approaches 
such as measurement of dimension, nondestructive testing, surface metallography and 
chemical analysis, primarily to identify locations where locations stage 3 procedure 
should be undertaken. This stage involves stress rupture testing and microstructural 
examination of service-exposed material. 

Microstructural studies offer only an additional qualitative information on the 
condition of components. It is the accelerated stress rupture tests on service-exposed 
material that provide a quantitative estimate of remanent life of a component. 
Acceleration is achieved by performing tests at higher than normal operating stress and 
temperature. Some of the experimental studies have shown that whilst temperature 
acceleration provides a reasonable estimate, stress acceleration may give a rather 
optimistic value (Wiegand et al 1975; Hart 1976). This is primarily because high 
temperature tests do not significantly modify the microstructures developed in steels 
during long-term plant exposures. However, in order to allow tests to be performed at 
higher than normal temperature range without significant oxidation, it is necessary to 
provide an inert atmosphere or vacuum. If such tests are performed in air, it is likely to 
provide an optimistic estimate of life. 

In addition, section sizes of components change as a result of corrosion and erosion. 
Consequently, the operating stress that the component is subjected to increases with 
aging. Therefore data on loss of section size should also form an important input in the 
life assessment of aging high temperature components. The paper presents some of our 
experience on such an exercise. 


2. Effect of service exposure on rupture strength 

Accelerated stress rupture test data on a number of service exposed steam pipes were 
analysed using a menu-driven computer software developed at our laboratory (Chat- 
topadhyay et al 1990). This provides a framework for analysing short-term test data 
using various parametric techniques to predict either long-term rupture strength or 
rupture lives under any user-specified operating conditions. Figure 1 shows a typical 
stress vs Larson-Miller parameter plot where test data on CrMoV steel having 
different extents of service exposures have been superimposed with data on virgin steel. 
Twenty has been used as the parametric constant for all sets of data so that a visual 
comparison can be made. The trend reveals that if any attempt is made to extrapolate 
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*10 MPa 



Figure 1. Stress rupture 
plots of virgin and service ex¬ 
posed Cr-Mo-V steel hav¬ 
ing different extents of service 
exposure. 


beyond the domain of available test data it is likely to provide an unrealistic estimate of 
remaining life having no definite correlation with the extent of past service exposure. 
This aspect has been critically examined by Ghosh et al (1986). Table 1 presents 
a comparison of the life estimates arrived at by using three different time-temperature 
parameters for samples having different extent of service exposure. The inconsistency in 
life estimate is clearly as a result of extrapolation beyond the domain of available data. 

However, if no attempt is made to extrapolate, rather only an estimate of upto 
100,000 h rupture strength is attempted, results exhibit a definite correlation with the 
length of service exposure. Table 2 gives an estimate of 100,000 h rupture strength for 
four different sets of samples obtained by using three different time-temperature 
parameters. Irrespective of the method used the result demonstrates that the virgin 
sample has the highest rupture strength whereas the 100,000 h service exposed material 
has the lowest rupture strength. The table 2 includes two sets of samples having the 
same length of service exposure. The minor variation in their rupture strength could be 
as a result of different extent of damage accumulation possibly due to different average 
metal wall temperature. 


Table 1. Remaining lives in hours of steam pipes at 
550°C and 64 MPa obtained from an analysis of stress 
rupture data. 


Remaining lives (h) obtained by 

Service - 

exposure (h) LMP SDP MHP 


100,000 

100,000 

56,000 

Virgin 


91,000 97,000 89,000 

46,000 56,000 50,000 

184,000 199,000 105,400 

161,000 164,000 98,000 
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Table 2. 100,000h rupture strength in MPa at 
550°C obtained by three different parametric 
methods. 


Service 
exposure (h) 

Rupture strength 

LMP 

SDP 

MHP 

100,000 

67-7 

66-7 

55-9 

100,000 

68-7 

68-7 

60-8 

56,000 

72-6 

72-6 

66-5 

Virgin 

97-1 

97-1 

102-0 


Figure 2 represents the stress rupture test data on service exposed 5 Cr-^Mo steel 
heater tubes of Petroleum Refineries (Singh et al 1994). Irrespective of the service 
exposure all data lie above the minimum specified rupture strength plot for the same 
grade of steel. Rupture tests were carried out on virgin tube material as well. These also 
have been superimposed on figure 2. If one were to rely on short-term test data one may 
conclude from figure 2 that with service exposure the material becomes stronger 
whereas in reality it may not be so. This is primarily because stress rupture test data 
even on the same grade of material collected from various sources can exhibit large 
scatter. The material tested may also belong to different lots of products. Therefore in 
such cases it is more appropriate to conclude that both service-exposed and virgin 
material data lie within the same scatter band. This would imply that life assessment of 
components can still be made on the basis of the minimum specified rupture strength of 
this steel. 


MPa 



Figure 2. Stress rupture data of virgin and service exposed 5 Cr-0-5 Mo steel 
mostly lying above the minimum specified ASTM line. 
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3. Life estimation method 

Even though in many instances the material properties of service-exposed components 
are found to be better than the minimum specified levels, component dimensions do 
change as a result of prolonged service. Most prominent amongst these is the loss of 
thickness due to either corrosion, erosion or oxidation. Pertinent standard codes do 
not consider these for life estimation. Only an additional thickness allowance is kept, 
based on past experience to take care of the likely loss of section size. Zarrabi (1993) has 
recently proposed a simple method of estimating boiler tube life in the presence of 
corrosion and erosion processes. This is based on the calculation of reference stress for 
the tube as a function of time assuming a constant thinning rate on either side of the 
tube wall. The time it takes to exceed the flow stress or the rupture strength gives an 
estimate of the life. Two types of failure mechanisms have been considered by Zarrabi 
(1993) viz. plastic collapse determined by flow stress and creep determined by rupture 
strength; the shorter of the two being-the dominant mode. Use of reference stress, 
a concept popularized by Sim (1968), rather than equivalent stress is recommended as it 
has been found to provide accurate prediction of deformation behaviour. Recent 
experiments on ^Cr-^Mo-j V steel tubes have shown that rupture data also can be 
correlated with the reference stress. Expressions for reference stress (S) are now 
available for a host of component geometries (Boyle & Spense 1983). In the case of 
tubes it is given by: 

3 1/2 v P 

S ~^T K \n{(d+H)/(d-H)y ■ (1) 

where P is the average operating pressure, d is the mean tube diameter, H is the 
wall thickness at any instant t and K is a constant describing the nature of wall thinning 
e.g. uniform or localized. Kitching & Zarrabi (1981,1982) have reported the values of 
K for various kinds of wall thinning. If it is uniform, K = 1. 

Assuming that the tube thins down uniformly both from the inner as well as the 
outer surface at a constant rate Zarrabi (1993) has developed a computer code 
which could predict the life of boiler tubes under wall thinning condition. This 
requires a systematic representation of the stress rupture data in the form of specific 
equations rather than the tabular representation used in ISO/TR 7468 (1981). 
His program uses a mathematical form recommended by the Central Electricity 
Generating Board (CEGB), UK, for representing the master plot (Fielding 1979). 
The analysis revealed that in comparison to the 6 projection method popularised 
by Evans et al (1985, 1992) this provides a conservative estimate of life for any given 
thinning rate. 

Use of complex time-temperature parameter having several nonlinear terms to 
reduce the scatter in the master rupture plot do not necessarily provide realistic 
life estimates from short term test data. Very often such an analysis will require a 
large volume of test data. Data collected on the same material from different sources 
would add to the extent of scatter. In this context therefore it is worthwhile to examine 
how the predictions based on simpler forms of rupture parameter compare with those 
arrived at by Zarrabi (1993). CLIP (Chattopadhyay et al 1990) which is a user-friendly 
computer software developed at National Metallurgical Laboratory (NML) provides 
a simple framework for creep life prediction of engineering materials with the help of 
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Figure 3. Rupture strength of 0-5 Cr-0-5 Mo-0-25 V at 575°C has been plotted as 
function of time to rupture. Reference stresses as estimated from (1) have been 
superimposed for various assumed rates of wall thinning. The point of intersection 
with the rupture strength plot gives the estimate of life. 


three commonly used time-temperature parameters, viz; 

Larson-Miller Parameter (LMP) = T(C +log t r ) 

Sherby-Dorn Parameter (SDP) =logr r — b/T 
Manson-Heferd Parameter (MHP) = (log t r — log t 0 )/(T— T 0 ) 

where C, b, log t 0 and T 0 are material constants. These can be estimated from an 
analysis of the master rupture plot using the method of least squares assuming that the 
master plot can be expressed as a polynomial function of log (stress). The software 
provides the user an option to select its order. Usually an equation of order 3 or 5 is 
selected as it is sure to provide at least one real root. Use of equations of higher order is 
unnecessary as this does not improve predictive power of such representations 
significantly. 

Using the software, rupture lives at different stresses at a given temperature can be 
computed. Figure 3 gives a typical plot of rupture stress vs life using Larson-Miller 
Parametric representation of master rupture curve for Cr-Mo-V steel. In order to 
estimate life of simple components such as heater tubes, operating stress as calculated 
by (1) can be superimposed on this figure. Assuming various rates of wall thinning this 
has been computed for a tube having an initial dimension and average operating 
pressure given in table 3. Subsequently this has been superimposed on figure 3. The 
point of intersection provides an estimate of the life. 

4. Results and discussion 

The method described above has been used to estimate the life of typical boiler tubes 
given in table 3. Figure 4 shows the dependence of tube life on thickness for various 
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Table 3. A typical boiler tube specification. 


Original outer dia 

^0 

50-00 mm 

Original thickness 

H 0 

9-00 mm 

Internal effective pressure 

P 

16-0 MPa 

Effective temperature 

T 

575°C 


grades of Cr-Mo steels using standard rupture data. It is clear that steels having higher 
amounts of Cr and Mo do not necessarily have longer rupture lives. 2-25 Cr-1 Mo steel 
has better creep resistance than 9 Cr-1 Mo steel. It is only with Nb additions and 
reduced carbon content that creep resistance of 9 Cr-1 Mo steel can be improved 
beyond that of 2-25 Cr-1 Mo steel. 

Figure 5 compares the life estimates of tubes of different thicknesses made of 
Cr-Mo-V steel using the three different time-temperature parameters described 
above. The results reported by Zarrabi (1993) are also superimposed on. the same plot. 
The life estimate plots in figure 4 and 5 are almost linear because the thinning rate has 
been assumed to be constant. 

In order to show the influence of thinning rate on life estimates arrived at by various 
methods of representing the master rupture curve, calculations have been made for 
Cr-Mo-V steel tubes. Figure 6 depicts the effect of thinning rate on tube life. Results 
arrived at by using different parameteric representations are indeed expected to differ. 
The difference is however more prominent at lower thinning rates. In our country, 
where generally coals with high ash content are in use, design should be based on 
a relatively higher rate of wall thinning. In such cases, however, the difference in life 
estimates using any of the parameters described above is likely to be marginal. 


year. 

30 


20 


10 


- 

internal effective pressure-16 MPa 
effective temp.-575°c * 

0-5 Cr 0-5 Mo steel 

- —;He- 1- 0 Cr 0-5 Mo steel 

—X— 2-25 Cr 1-0 Mo steel 


-Q- 5-0 Cr 1-0 Mo steel 
—)— 9-0 Cr 1-0 Mo steel 

-A- Mod, 9 Cr Mo 


; 

original outside dia. 50mm 


thinning rate 0-25 mm/y 

—-1-1_i_ 


7 8 

thickness 


10 


11 mm 


Figure 4. Creep life vs initial tube thickness plots for various grades of Cr-Mo/ 
Cr-Mo-V steels have been compared. The thinning rates in all cases have been 

assumed tn he rnnstnnt 
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Figure 5. Creep life vs initial tube thickness plots obtained using different par- 
ameteric functions have been compared. NL represents a functional plot using the 
natural logarithm of stress as suggested by Zarrabi (1993). See text for details about 
rest of the representations. The material corresponds to 0-5 Cr-0-5 Mo-0-25 V. 


Thinning of heater tubes can take place by any one or a combination of the following 
three mechanisms, viz; (i) creep strain accumulation, (ii) oxide scale growth at the inner 
wall in the presence of steam/water at high pressure and temperature, (iii) fire-side 
corrosion. Since the magnitude of creep strain that can be accommodated because of 


years 



thickness 


Figure 6. Creep life vs thinning plots for 0-5Cr-0-5 Mo-0-25 V steel obtained 
using different parameteric functions have been compared. NL represents a func¬ 
tional plot using the natural logarithm of stress as suggested by Zarrabi (1993). See 
text for details about rest of the representation. The material corresponds to 0-5 Cr- 
0-5 Mo-0-25 V. 
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design constraint is very limited, its contribution to wall thinning may be considered 
marginal. Excellent predictive models are now available for estimating loss of thickness 
due to oxide scale growth (Paterson & Rettig 1987). However major problems would 
arise in estimating loss of wall thickness due to fire-side corrosion. Besides, the actual 
extent of wall thinning is critically dependent on the actual operating conditions and 
environment. Since it is mandatory to shut down boilers for routine inspection every six 
months, wall thickness measurement could be carried out at specified locations to get 
a fairly accurate estimate of the kinetics of loss of the tube wall thickness. Such 
measurement would provide more reliable estimates of life as it corresponds to the 
actual operating conditions. 

Although the method described assumes that the tube thins down at identical rates 
both from the inner and the outer walls it can easily be extended to cases where these are 
different. Work in this direction is in progress. However, a rough analysis indicates that 
for heater tubes having a low thickness to diameter ratio, as is commonly observed, life 
estimates will not be very much different. Therefore the procedure described above can 
safely be used for the evaluation of the life of tubular components of a boiler for any life 
extension exercise. 
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Advances in nonlinear structural dynamics 


Foreword 


Structural mechanics deals with how engineering structures such as buildings, air¬ 
planes, vehicles, rotating machines etc. behave under external service loads and natural 
forces. Traditionally the subject has been divided into statics and dynamics. Thus 
structural dynamics is the study of civil, mechanical and aerospace engineering 
structures under dynamic forces due to earthquakes, wind turbulence, waves, jet noise, 
blade rotation, moving loads, mass imbalance and other causes. Many of these forces 
exhibit randomness and hence the need for stochastic structural dynamics. Over the 
years, the theory of linear systems under deterministic and stochastic inputs has been 
well developed. However owing to large displacements or inelastic material properties, 
structural systems exhibit nonlinearity. Nonlinearity in engineering dynamical systems 
is quite complicated to handle under either deterministic or stochastic inputs. Exact 
solutions are not common. Even when such solutions are available they refer to overly 
simplified systems under excitations which are often unrealistic. Thus a frontier area of 
research in structural dynamics has been the investigation of complex nonlinear 
systems under deterministic and stochastic excitations. The present issue of Sadhana 
brings out some of the recent advances in this field. In all, in this issue there are twenty 
papers contributed by research workers from eight countries. 

The first paper is a review on nonlinear random vibration. Manohar presents a detailed 
state-of-the-art review on exact and approximate approaches to nonlinear systems under 
random excitation. The response of nonlinear systems under the influence of white noise 
can be shown to be a Markov vector process. Hence the probability density of the response 
vector satisfies the Fokker-Planck-Kolmogorov (FPK) equation. However, exact 
solutions for the FPK equation are difficult to obtain. The paper by Anh discusses how 
the averaging method which is popular in deterministic problems can be used to 
integrate the FPK equation. The van der Pol oscillator has been the classical example 
of a nonlinear system exhibiting a basic limit cycle. Naess and Hegstad study this 
oscillator under a white noise excitation. They solve the associated FPK equation 
numerically by the path integral method. Experimental work in nonlinear random 
vibration is difficult and hence uncommon. Ibrahim and Lee have experimentally 
investigated the random response of a nonlinear beam under white noise excitation to 
verify the results of closure methods and numerical simulation. Heuer et al study the 
nonlinear vibration of shallow shells made up of composite materials under white noise 
excitations. The response is a diffusion process and hence using the FPK equation 
approach it is possible to arrive at some interesting results on the probability of snap 
buckling of the structure. Homology design, attempts at realizing the desired modes of 
vibration of structures with specified constraints. Nakagiri and Yoshikawa investigate 
such a problem for out of plane vibration of a planar frame under uncertain boundary 
conditions which introduces nonlinearity into the problem. 
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The next three papers investigate active control of structures under random inputs. 
Karmeshu and Schurz study the stability of simple linear and nonlinear elastic 
oscillators with distributed delays under stochastic seismic inputs. In practice, civil 
engineering structures such as tall buildings are hysteretic. Suzuki presents a method 
for optimal control of hysteretic structural systems under earthquake excitations, 
modelled in terms of white noise processes. The system and observation equations are 
handled in terms of Ito stochastic calculus. Approximate solution procedures based on 
linearization and closure techniques have been presented. Raju and Narayanan 
investigate the active control of a moving vehicle modelled as a two degree-of-freedom 
system with .nonlinear suspensions. Equivalent linearization approximation has been 
used to arrive at a linear model of the problem for further analysis. Dynamics of 
offshore structures is a typical practical example wherein nonlinearity and stochasticity 
are present. Venkataramana and Kawano study the response of two models of offshore 
structures in random seas under seismic excitation. Nonlinearity enters through the 
hydrodynamic damping term. Datta et al study the nonlinear response of guyed towers, 
under random wave forces. These structures are extremely flexible with very low 
natural frequencies. This necessitates inclusion of second-order wave forces in the 
dynamic analysis. 

In recent years the phenomenon of chaos in nonlinear oscillator is attracting 
considerable attention. There are six papers devoted to various aspects of chaos. Sekar 
and Narayanan present a state-of-the-art review. Namachchivaya et al study the 
bifurcation behaviour in two-degree-of freedom shallow arches under periodic and 
stochastic excitations. Through'the Melnikov method it is shown that chaos is possible. 
Schiehlen and Hu introduce a stability measure to study system sensitivity to initial 
conditions and other parameters. The concept is illustrated with the help of the Duffing 
oscillator. Popp et al study in some detail friction induced stick-slip oscillators. They 
investigate for a friction oscillator the bifurcation behaviour and routes to chaos. 
Bontempi and Casciati consider the nonlinear dynamics of a truss with several bars and 
nodes. It is demonstrated that the Kolmogorov entropy can be used as a tool to detect 
chaotic response. Chandiramani et al investigate the nonlinear flutter of a buckled 
composite panel. Their theory includes effects of in-plane edge restraints, initial 
imperfections, transverse shear and direct stresses in the model. Periodic solutions and 
bifurcations are found under a supersonic flow. They demonstrate that chaos is 
possible under certain conditions. 

The last three papers address three different problems in nonlinear structural 
dynamics. Ghanem et al study the nonlinear oscillations of a two degree-of-freedom 
structural cylinder model in oscillatory flow. Hydrodynamic forces are imposed using 
available results from theoretical fluid mechanics. Higher order harmonic solutions are 
obtained using classical methods of nonlinear oscillations. Nonlinear mechanical 
systems are popularly analysed through equivalent linearization approximation. Fai 
Ma proposes a method for handling the equations of motion of linearized controlled 
structures. Such systems possess stiffness and damping matrices which are not necess¬ 
arily symmetric. The method is an extension of classical modal analysis and hence 
appeals to structural engineers. Special approaches have been developed in the active 
control of space structures. One such method is by embedding piezoelectric materials in 
the panels so that sensing and actuating mechanisms become part of the structure. 
Reddy and Mitchell develop theories for laminated composite plates with piezoelectric 
laminae. These theories include coupling between the elastic, thermal and electrical 



Foreword 


343 


fields. The nonlinear equations of motion for in-plane and bending vibrations of such 
plates have been derived. 

It has given me immense satisfaction to have worked on this special issue. I am 
grateful to the authors and to the reviewers for their cooperation. I thank the editorial 
board of Sadhana for giving me this opportunity to bring out this special issue. As guest 
editor I enjoyed my interactions with the editorial office of the Indian Academy of 
Sciences. 


June 1995 
Roorkee 


R N IYENGAR 
Guest Editor 
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Methods of nonlinear random vibration analysis 

C S MANOHAR 

Department of Civil Engineering, Indian Institute of Science, Bangalore 
560012, India 

Abstract. The various techniques available for the analysis of nonlinear 
systems subjected to random excitations are briefly introduced and an 
overview of the progress which has been made in this area of research is 
presented. The discussion is mainly focused on the basis, scope and 
limitations of the solution techniques and not on specific applications. 

Keywords. Nonlinear systems; random excitations; nonlinear vibration; 
vibration analysis. 


1. Introduction 

Random vibration methods are extensively used in earthquake, wind, transportation 
and offshore structural engineering applications. Here, the uncertainties is specifying 
the forces acting on the structure are quantified using sophisticated load models based 
on the theory of probability and stochastic processes. Consequently, the response 
analysis of structures is also carried out in a probabilistic framework which eventually 
leads to the assessment of the safety of the structure. In order to maintain a consistent 
level of sophistication in modelling, the vibrating structure also needs to be modelled 
with care. This concern leads to questions on modelling nonlinear behaviour of the 
structural system and.also on modelling uncertainties in specifying the structural 
parameters themselves. The questions of structural nonlinearity are particularly 
important while addressing the problem of failures and safety assessments, especially, 
since the nonlinear response is, at times, radically different from the one obtained using 
a simplified linear model. These questions offer considerable challenge to the analyst 
and are currently being actively pursued in vibration engineering research as evidenced 
by a continuous stream of publications in leading international journals. The present 
paper aims at presenting an overview of the research work in this field highlighting the 
developments which have taken place over the last decade. The emphasis is, therefore, 
to focus on the various techniques and methodologies, which have admittedly come to 
stay as powerful tools in dealing with problems encountered frequently in the area of 
nonlinear random vibrations. 

The sources of nonlinearities in vibration problems can be categorized into four groups: 

• Geometric nonlinearities arising out of large deformations; 

• nonlinear elastic and dissipation properties of the structural material; 
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• topological causes as in the case of vibroimpact systems such as rocking blocks and 
systems with stoppers; 

• fluid-structure interactions leading to nonlinear couplings. 

Table 1 lists a few examples which have been studied in the literature in the context of 
nonlinear random vibration of engineering structures. In these problems closed form 
solutions are rarely possible. Moreover, there exists no single general analytical 
procedure which leads to acceptable solution under all circumstances. The influential 
factors in formulating the solution procedures are 

• System degrees of freedom, nature of nonlinearity (that is, nonlinearity in mass, 
stiffness or damping and symmetry/asymmetry of nonlinearity), predominance of the 
nonlinearity in affecting the system behaviour, including quasiperiodicity and 
bifurcations, in the absence of random excitations, 

• Stationarity/nonstationarity of excitation, probability distribution and strength of 
excitation, 

• Excitation bandwidth in relation to system bandwidth, 

• Mechanism of excitation, that is, external or parametric, 

• Response variables of interest. 

Accordingly, several approximate solution procedures have been developed which lead 
to acceptable solutions in specific problem domains. Mostly, the approximations are 
based on the Markovian nature of the response or on the proximity of the response 
probability density function (pdf) to Gaussian distributions. Many of the methods are 
ingenious extensions of deterministic nonlinear analysis procedures to stochastic 
problems. A discussion on the following methods is presented in the sequel: (i) Markov 
vector approach, (ii) Perturbation, (iii) Equivalent linearization, (iv) Equivalent non¬ 
linearization, (v) Closure, (vi) Stochastic averaging, (vii) Stochastic series solution and 
(viii) Digital simulations. 


2. Markov vector approach 

When the inputs arise from Gaussian white noise processes the response will be 
a diffusion process and the associated transitional pdf (tpdf) will satisfy the well-known 
Kolmogorov equations. The governing equations of motion in these situations can be 
cast in the form of equations of the Ito type as follows: 

dX(t) = /(X(t),f)dt + G(X(t),r)dB(r), ( 1 ) 

under the initial conditions 

x ( t o) = Y, (2) 

where, X(t) = n x 1 response vector, f(X(t),t) = nxn matrix, G(X(t),t) = n x m matrix, 
B(r) = m x 1 vector of the Brownian motion processes having the properties 

E [A j(t)] = E [Bj(t + At) - Bjm = 0, (3) 

E[A i (t)A j m = 2D iJ At, (4) 

and Y = n x 1 vector of initial conditions independent of B(t). Here E[-] represents the 
mathematical expectation operator. The above representation of equations of motion 
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is fairly general in the sense that it allows for: (1) Multi-degree linear/nonlinear discrete 
systems, (2) external and parametric excitations, (3) nonstationary excitations, (4) 
nonwhite excitations, in which case, additional filters to model inputs as filtered white 
noise processes are to be appended to the system equations with a consequent increase 
in the size of the problem and (5) random initial conditions. The Kolmogorov 
equations satisfied by the response tpdf, p(x,t|y, t 0 ) are 


• the Chapman-Kolmogorov-Smoluckowski (CKS) integral equation 


p(M y,t 0 ) 


m oo 

p(x,t|z, r)p(z,T|y,£ 0 )dz, 

J — 00 


• the forward equation or the Fokker-Planck-Kolmogorov (FPK) equation 


( 5 ) 




^ dx^x/ 


+ I ^-[(GDGVfx^ly,^)], 


the backward equation 


( 6 ) 


dp{xj\y,t 0 ) £ ,, dp(x,t\y,t 0 ) £ d 2 p(x,f \y,t 0 ) 

= - £ /j(y> l) —j — [GDG ’"“w m 

In these equations, the superscript T denotes the matrix transpose operation. The first 
of these equations represents the consistency condition for the response process to be 
Markov. Equation (7) is the adjoint of (6) and these two equations can be derived using 
(5) together with the equation of motion given by (1). It is of interest to note that the for¬ 
ward equation and backward equations are also satisfied by several other response 
probability functions of interest. Thus, for instance, the probability, Q(t | y, t 0 ), that first 
passage across a specified safe domain will not occur in the time interval t 0 — t for 
trajectories in the phase plane starting at y at t = t 0 , can be shown to satisfy the 
backward Kolmogorov equation. The formulation of these equations-leads to the exact 
response characterization of a limited class of problems and helps in formulating 
strategies for approximate analysis for a wider class of problems. The details of the 
derivation of these equations along with a discussion on the initial conditions, 
boundary conditions, well posedness, eigenvalues and eigenfunctions and the existence, 
uniqueness and stability of stationary solutions can be found in the works of Bharucha 
Reid (1960), Stratonovich (1963), Caughey (1963a, 1971), Feller (1966), Fuller (1969) 
and Robert? (1986a). A comprehensive treatment of the FPK equation and its 
application in physical sciences is available in the monographs by Risken (1989) and 
Horsthemke & Lefever (1984). 


2.1 Exact solutions 

The complete solution of the FPK equation is obtainable for all externally forced linear 
oscillators (Lin 1967) and for a class of first order nonlinear systems (Caughey & Dienes 
1961; Stratonovich 1963; Atkinson & Caughey 1968; Atkinson'1973). In these solutions 
either the Fourier and Laplace transform-techniques or the method of eigenfunction 
expansion is used. Methods based on group theory have also been developed (Bluman, 
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1971). The stationary solution, when it exists, can be found for all first order systems and for 
a limited set of higher order systems. A general class of systems for which exact stationary 
solution under external white noise excitation is possible is discussed by Caughey & Ma 
(1982a, 1982b). This set includes single degree of freedom (sdof) systems with nonlinear 
stiffness and a class of sdof and multi-degree of freedom (mdof) systems with nonlinear 
damping and stiffness. Dimentberg (1982) has obtained stationary pdf for a specific sdof 
system in which both parametric and external white noise excitations are present. This 
solution has been obtained through an inverse procedure (Dimentberg 1988a). Here an 
approximate solution is first obtained based on the method of stochastic averaging. 
This solution is substituted into the governing FPK equation for the original system. 
This equation would be exactly satisfied provided the system’s parameters are related 
in a certain special way. Thus, this subset of parameters defines a class of systems for 
which the governing reduced FPK equation is solvable. The concept of detailed 
balance developed earlier by physicists (Gardiner 1983; Risken 1989), has been used by 
Yong & Lin (1987) and Langley (1988a) to obtain exact stationary solution for a class of 
sdof and mdof nonlinear systems under white noise excitations. In this method, the 
components of response vector Y are classified as either even or odd depending upon 
their behaviour under a time reversal of t to — t. The even variables do not change their 
sign whereas the odd variables undergo a change of sign. These are denoted as 

i,- = £ f x f , no summation on i, (8) 

where = 1 for even variables and £ f = — 1 for odd variables. The state of detailed 
balance is defined as 

p(x,t\y,t 0 ) = p(y,t\x,t 0 ), t>t Q . (9) 

In the steady state, in terms of the drift coefficients A t and diffusion coefficient B i} this 
condition is given by 

q 

Ai(x)p(x) + El A,(x)p(x) - — [£ 0 .(x)p(x)] = 0, (10) 

B ij {x)-£ i e j B ij {x) = 0. (11) 

Here summation on repeated index is implied. When these conditions are satisfied, the 
stationary solution expressed as 

p(x) = C exp [ — 1/ (x)] , (12) 

can be obtained by solving the equation for U (x), the generalized potential. The class of 
problems than can be solved using this method, is shown to include the problems 
considered by Caughey & Ma (1982a, b) and Dimentberg (1982). Furthermore, Lin & 
Cai (1988) and Cai &. Lin (1988a) have shown that the exact stationary solution as in 
(12) can still be obtained even when one of the conditions for detailed balance, namely 
(11) is not satisfied. This class of systems has been termed as belonging to the class of 
generalized potential. More general class of exactly solvable FPK equations have been 
discussed by Zhu et al (1990) and To & Li (1991). 

2.2 Approximate methods 

For analysing problems possessing no exact solutions one has to take recourse to 
approximate methods. An iterative procedure based on the parametrix method for 
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studying existence and uniqueness of solutions of partial differential equations (Friedman 
1964) has been used by a few authors (Caughey 1971; Mayfield 1973). It has been 
shown that both transient and steady state solutions can be obtained using this 
method. Also, the method is useful in improving approximate results obtained by other 
techniques. However, the method has not been used widely in random vibration studies 
(Roberts 1981). Payne (1968) used a combination of perturbation and eigenfunction 
expansion techniques to analyse nearly linear first order systems under white noise 
inputs. He has derived perturbation expansion for eigenvalues and eigenfunctions of 
the FPK equation up to 0(s 2 ) and obtained the corresponding expression for response 
autocorrelation function. Here e may be denoted as a smallness parameter associated 
with the nonlinearity. Iwan & Spanos (1978) have employed similar techniques to 
obtain nonstationary response envelope distribution of a linear sdof system. Johnson & 
Scott (1979, 1980) considered the first order system studied by Payne (1968) and 
extended his analysis to compute expansion terms up to 0(s 7 ). Furthermore, they have 
also applied this method to second-order systems. 

Atkinson (1973) has used an adjoint variational method to find the eigenvalues of the 
FPK operator. He has generated trial functions that are orthogonal to known 
stationary solution and has determined response power spectral density (psd) for the 
case of a Duffing oscillator, a bang-bang system and a system with nonlinear damping. 
Toland et al (1972) proposed a random walk analogy based on a discrete approxi¬ 
mation to continuous Markov process, and obtained a recurrence relation for response 
probabilities. This technique is equivalent to using finite difference approximation on 
the FPK equation and is time consuming especially when the domain of integration is 
large. The eigenfunction expansion method is applicable when the time and space 
variables in the FPK equation can be separated which is generally possible when the 
drift and diffusion terms are time invariant. In a more general context, the methods of 
weighted residual have been employed by several authors. Thus, Bhandari & Sherrer 
(1968) have used the Galerkin technique to find the stationary response of sdof and 
a two degree of freedom system with polynomial nonlinearities. Wen (1975, 1976) has 
extended this analysis to nonstationary response analysis and studied the response of 
the Duffing oscillator and a hysteretic system. The use of the method in determining psd 
response and the first excursion failure probabilities of response has also been 
indicated. A similar technique has been used by Solomos & Spanos (1984) for obtaining 
the amplitude response of a linear sdof system under evolutionary random excitation. 
Another variant of the weighted residual technique, namely, the method of moments, 
has been used by Fujita & Hattori (1980) for the analysis of sdof systems with collisions 
under modulated white noise input. Langley (1985) has applied the finite element 
method to solve the two dimensional FPK equation associated with the stationary 
response of a Duffing oscillator and a ship rolling problem. The domain in the phase 
space to be covered by finite elements was estimated using an equivalent linearization 
solution. An improper selection of the extent of the domain is shown to result in 
negative values for response pdf. Bergman & Spencer (1992) have studied the transient 
solutions of the FPK equations of several second order nonlinear systems using finite 
element method. 

Other discretization procedures based on path integral formalisms (Wehner & Wolfer 
1983; Kapitanaik 1985, 1986; Naess & Johnsen 1993) and cell mapping techniques 
(Sun & Hsu 1990) have also been developed. These methods are related to the iterative 
technique discussed by Crandall et al (1966) and are based on an assumption that 
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the tpdf over short time steps is Gaussian. Sun & Hsu have used a Gaussian closure 
approximation to evaluate the transitional probabilities, while, Naess and Johnsen 
have employed cubic B-splines to represent the tpdf. It is to be noted that although the 
analysis makes the short-time Gaussian approximation for the tpdf, the global 
non-Gaussian nature of the unconditional pdf is, nevertheless, captured by the analysis. 

2.3 Generalizations and moment equations 

Generalization of the FPK equation to non-diffusive Markov processes has been 
discussed by Pawula (1967). Here the inputs are modelled as white noise arising out of 
non-Gaussian processes. The response in such cases will still have the Markovian 
property but the equation of motion of the transitional pdf will have infinite number 
of terms. Tylikowski & Marowski (1986) have considered the response of a Duffing 
oscillator to Poissonian impulse excitation and have shown that the transitional 
pdf satisfies an integro-partial differential equation. The use of truncated generalized 
FPK equations in computing the lower order moments has been demonstrated by 
Risken (1989). 

When response moments are of interest, the governing equations for the moments 
can be derived based on the FPK equation (Soong 1973). Thus, the moments of the 
function h[X(t), f] of the solution X(t) of (1) can be shown to be governed by the 
equation 


d t 


E[A(X,t)]= I E 


j-i 


f, 


sy 

Sx i. 


+ I E 

. 7=1 


(GDG^h 


d 2 h 

lJ dx.-dxj 


+ E 


dh 

dt 


. (13) 


By setting h[X(t), t ] = X\ l X k 2 * X k3 ■ ■ ■ X kn and choosing different values for k h one can 
derive equations for the most commonly used moments. These equations can be readily 
solved for the case of deterministic nonautonomous linear systems under white noise 
inputs. This forms the basis for obtaining approximate transient solutions using 
linearization procedures. In the case of linear oscillators with parametric white noise 
excitations, the response is non-Gaussian and the associated FPK equation is not 
solvable. However, the exact response moments can be obtained by solving the 
associated moment equations. In nonlinear problems these equations form an infinite 
hierarchy and an exact solution of moment equations also is not possible. Based on the 
principle of maximum entropy, Sobczyk & Trebicki (1990, 1992) and Chang (1991) 
have developed approximate stationary solutions of nonlinear systems under paramet¬ 
ric and external noise excitations. This consists of employing pdf with undetermined 
coefficients which, in turn, are found by maximizing the entropy subject to the 
constraints of the pdf normalization and the moment equations which are obtained 
through the governing FPK equations. Roy & Spanos (1991) have utilized a perturba¬ 
tion solution scheme for the moment equations and have shown that the scheme 
overcomes the problem of infinite hierarchy. They have proposed Pade type transform¬ 
ations of the series which enables the analysis to be applicable even for strongly 
nonlinear systems. Furthermore, the same authors (Roy & Spanos 1993) have studied 
the response power spectral density of nonlinear systems by utilizing formal solutions 
of the FPK equation as discussed by Risken (1989) in conjunction with a power series 
expansion in terms of Pade approximants for the response spectra. The method 
requires the knowledge of the stationary response of the FPK equation. 
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As has been briefly indicated earlier, the Markov property of response has also been 
used in the study of first passage probabilities. Here, either the forward or the backward 
Kolmogorov equation is solved in conjunction with appropriate boundary conditions 
imposed along the critical barriers. Alternatively, starting from the backward Kol¬ 
mogorov equation, one can also derive equations for moments ofthe first passage time, 
which, in principle, can be solved recursively. Thus, denoting by T(y), the time required 
by the response trajectory of (1) initiated at the point x = y in the phase space at time 
t = t 0 to cross a specified safe domain for the first time, the moments A/ t = E[7*], 
k = 1,2, • • • N, can be shown to be governed by the equation 


n 


- L fj( y> t) 


dMu 

d yj 


- £ (GDG\ 


0 = 1 


d 2 M k 

dyidyj 


+ /cM fc _ 1 = 0 (A: = 0,1,2,---), 

(14) 


with the condition Af 0 = 1. These equations are referred to as the generalized Pon- 
triagin-Vitt (GPV) equations in the literature. Although no exact analytical solution 
exists for finding M k , several approximations are available and they have been 
reviewed by Roberts (1986a). These methods include method of weighted residuals 
(Spanos 1983), random walk models (Toland & Yang 1971; Roberts 1978), finite 
difference method (Roberts 1986b), finite element method (Spencer & Bergmanl985) 
and cell mapping techniques (Sun & Hsu 1988). 


2.4 Summary 


The FPK equation approach is the only source of exact solutions in nonlinear random 
vibration problems. It also forms a powerful tool for approximate analysis. The 
application of the method is, however, limited to Markovian responses. The solution 
procedures are not tractable when dealing with large number of variables or with 
nonstationary inputs. Although the method is applicable when the input is a filtered 
white noise, this class of problems has not received much attention in the literature. 


3. Perturbation method 

This is a straightforward extension of the technique used in deterministic problems. 
The method is applicable when the equations of motion contain a small parameter 
characterizing the nonlinearity in the system. The solution is expanded in a power 
series in small parameter which leads to a set of linear differential equations which 
can further be solved sequentially. The method is applicable to both sdof and mdof 
systems under additive or multiplicative, stationary or nonstationary stochastic 
inputs. It was first used by Crandall (1963) to evaluate response moments of sdof 
and mdof systems with nonlinear stiffness under stationary Gaussian excitations. 
Shimogo (1963a, b) considered symmetric and asymmetric nonlinear systems under 
stationary random inputs and computed the response psd using an iterative technique, 
which, in essence, is identical to the perturbation method. Crandall et al (1964) 
and Khabbaz (1965) applied the method to systems with nonlinear damping and 
evaluated the response psd. They also indicated the possibility of the even ordered 
response moments and the psd function becoming negative for large values of the 
nonlinearity parameter. To a first order of approximation, the psd function obtained 
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using this method and the equivalent linearization technique have been shown to be 
identical (Crandall 1964). Manning (1975) has estimated the response of the Duffing 
oscillator to stationary excitation and has shown that by evaluating the response to 
second order of the nonlinearity parameter, it is possible to display the effects of 
nonlinear resonances in the response psd. The response of the Duffing oscillator to 
nonstationary random inputs has been studied by Soni & Surendran (1975). The 
application of the method to systems with random parametric excitation has been 
discussed by Soong (1973). 

Perturbation method is well suited for polynomial nonlinearities and is useful in 
computing response moments and sometimes the psd function. Determination of the 
response pdf using this method is, however, not possible because of the non-Gaussian 
nature of the higher order corrections. Furthermore, obtaining second or higher order 
corrections involves cumbersome calculations and is not practicable. It can be noted 
that the method is asymptotic in nature and the accuracy markedly worsens with the 
increase in the value of the nonlinearity parameter. 


4. Equivalent linearization 

This method is by and large the most popular approach in nonlinear random vibration 
problems. It is extension of the well known harmonic linearization technique to 
stochastic problems and is applicable to both sdof and mdof systems under stationary 
or nonstationary inputs. The method consists of optimally approximating the non- 
linearities in the given system by linear models so that the resulting equivalent system 
is amenable for solution. For evaluating the parameters in the equivalent system, 
an additional assumption that the response is Gaussian is generally made. This method 
was developed in 1950’s in the context of random vibration problems (see Caughey 
1963b for earlier references) and nonlinear stochastic controls (Booton 1954). The 
subsequent developments of the method in vibration problems may be found in the 
works of Spanos (1981a), Roberts & Spanos (1990) and Socha & Soong (1991) and 
those in the field of control in the work of Sinitsyn (1974). 

Caughey (1963b) applied this technique to find stationary response of nonlinear 
sdof systems and a class of mdof systems under stationary inputs. The method was 
generalized to a wider class of mdof systems by Foster (1968), I wan & Yang (1972) and 
Atalik (1974). When nonstationary response is of interest the equivalent parameters 
will be functions of time and accordingly the equivalent linear system will be time 
varying. For the case of Markovian responses Iwan & Mason (1980) and Wen (1980) 
have obtained the nonstationary response of mdof systems by solving numerically the 
moment equations derived from the governing FPK equation. Spanos (1980b) modi¬ 
fied the method to deal with mdof systems having asymmetric nonlinearities in which 
case the response has a nonzero mean. When the inputs are nonwhite the solution of 
time varying equivalent linear system is generally difficult. Ahmadi (1980b) and Sakata 
& Kimura (1980) have suggested different schemes to deal with such problems. These 
schemes have also been extended to analyze the nonstationary response of asymmetric 
sdof and mdof systems to nonwhite inputs (Kimura & Sakata 1981,1987). In the study 
of continuous nonlinear systems, linearization can be done after discretizing the 
equation of motion or at the level of partial differential equation itself. The latter class 
of problems have been studied by Iwan & Krousgrill (1983) and Iwan & Whirley (1993). 




354 


C S Manohar 


4.1 Extensions and improvements 

The method is versatile, easy to implement and computationally efficient. It is 
applicable when the nonlinearities and the excitations are such that the response is 
unimodal and nearly Gaussian. This does not necessarily imply that the nonlinearity 
should be small. Crandall (1973) has demonstrated that for a first order system with 
cubic nonlinearities, the mean square response using this technique is fairly accurate 
even when nonlinearity parameter is of the order of 100. The method invariably leads to 
Gaussian response pdf and fails to display nonlinear resonances in the response psd. 
Furthermore, the method is not applicable to systems with parametric excitations. 

Several modifications to the method have been proposed to overcome some of the 
above limitations. Crandall (1973) has obtained a nonlinear integral equation for the 
improved estimate for the stationary response autocorrelations. Generalization of the 
method for the case of nonlinear systems with both parametric and external white noise 
excitations has been suggested by Bruckner & Lin (1987a). Here the original Ito 
equation for the nonlinear system is optimally replaced by a linearized Ito equation so 
that the first and the second moments computed from the equivalent system has 
minimum mean square error. For evaluating equivalent parameters use is made of the 
higher order moment equations and no appeal to the Gaussianness of the response nor 
for invoking any other closure approximation is necessary. In the methods discussed 
above the nonlinear system of a given order is invariably replaced by an equivalent 
system of the same order. Iyengar (1988a) has explored the possibility of replacing 
a given nonlinear system by an equivalent linear system of a higher order. In his study 
the nonlinear terms of the given equation are substituted by new dependent variables. 
Additional equations which govern these new variables and which are nonlinear in 
nature are obtained by suitably differentiating the given nonlinear equation. The 
resulting higher order system of nonlinear equations is further analyzed using the usual 
linearization scheme. For the case of the Duffing oscillator under white noise excita¬ 
tion, the results obtained using this method are shown to be better than the usual 
linearization solution and in particular, the response psd obtained using this method is 
shown to display the effects of secondary resonance in the form of an additional peak at 
about three times the primary resonance frequency. Extensions of equivalent lineariz¬ 
ation procedures to allow for non-Gaussian nature of the response have also been 
proposed. Thus, Manohar & Iyengar (1990) considered the broad band excitation of 
the Van der Pol oscillator and replaced the nonlinear oscillator by a linear system 
excited by a non-Gaussian input. The non-Gaussian excitation allowed for the limit 
cycle oscillations of the system in the absence of the external noise. This enabled the 
correct prediction of bimodal pdf of the response displacement and velocity which the 
traditional linearization fails to predict (Zhu & Yu 1987). Pradlwarter et al (1988), 
Pradlwarter (1989) and Schueller et al (1991) have considered stochastic response of 
inelastic systems and have proposed a non-Gaussian linearization scheme which is 
based on the theoretical results shown by Kozin (1987) that linear systems exist which 
lead, at least for white noise excitations, exactly to the first statistical moments of the 
response of the respective true nonlinear systems. The method involves nonlinear 
transformation between nonlinear response and the linear response which has to be 
chosen based on physical considerations. Iyengar (1992) considered Duffing’s oscillator 
under narrow band excitation and developed, an equivalent linear system whose stiffness 
parameter is random in nature. This parameter is shown to be a function of response 
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envelope which is approximated as a random variable. While the response conditioned on 
the stiffness is Gaussian, the unconditioned response becomes non-Gaussian in nature. 



V” 




4.2 Nonuniqueness of solutions and stochastic stability 

The solution obtained using equivalent linearization is not necessarily unique in more 
than one sense. Firstly, the answers depend on criterion of equivalence adopted. While 
the most commonly used criterion requires that the mean square equation difference be 
minimized, other criteria involving alternative norms of differences or other averaging 
operators are also admissible (Spanosl981a; Bolotin 1984). Recently Elishakoff 
& Zhang (1993) have applied several optimization schemes on a randomly driven first 
order nonlinear system and have shown that, when the differences are averaged with 
respect to a weight function, which, in turn, is taken to be a nonlinear function of the 
system potential energy, the estimate of the response variance coincides with the known 
exact solution. Casciati et al (1993) have considered second order nonlinear systems 
under broad band excitation and established the equivalence by requiring that the 
upcrossing rate of a specified critical level for the nonlinear and for the equivalent linear 
oscillator be equal. Obviously the success of this approximation depends upon the 
accuracy with which the upcrossing statistics are known. 

Another source of nonuniqueness, which is, perhaps more subtle, arises within the 
framework of a specified equivalence criterion. Thus, the linearization technique when 
applied to nonlinear systems under combined harmonic and noise excitations (Iyengar 
1986; Manohar & Iyengar 1991a), narrow band excitations (Richard & Anand 1983; 
Davies & Nandlall 1986; Davies & Rajan 1986; Jia & Fang 1987; Iyengar 1988b, 1989; 
Manohar & Iyengar 1991b; Roberts 1991; Kolioupulos & Langley 1993) or for systems 
with multiple stable equilibrium states under broad band excitations (Langley 1988b; 
Fan & Ahmadi 1990) leads to multivalued response statistics. It may be noted in this 
context that Spanos & Iwan (1978) have earlier demonstrated the uniqueness of 
equivalent linear systems under certain conditions but not of the solutions generated by 
the equivalent systems. 

The occurrence of multivalued response statistics apparently resemble the coexist¬ 
ence of multiple steady states encountered in the deterministic nonlinear oscillation 
problems. In random vibration context, however, it is important to note that stationary 
response statistics, when they exist, are necessarily unique. This follows from the fact 
that the steady solution of the governing FPK equation is always unique (Fuller 1969). 
The scope of this result includes all the nonlinear dynamical systems which are 
governed by equations of the form as given in (1) and, as has been noted in § 2, this class 
is fairly extensive. This fact was not recognized in some of the earlier studies on narrow 
band excitation of the Duffing oscillators (Davies & Nandlall 1986; Davies & Rajan 
1986; Jia & Fang 1987). These authors used a stability analysis of moment equations 
and concluded that response statistics are multivalued and display the jump behaviour. 
Iyengar (1986, 1988b, 1989) suggested that the realisability of the multiple solutions 
must be decided based on the almost sure stochastic stability of the multiple solutions 
and not on the stability analysis of moment equations. His study showed that in regions 
of multiple solutions, the linearization solution based on the assumption of Gaussia- 
ness of the response is stochastically unstable and, therefore, not valid. Furthermore, 
simulation studies on response amplitude showed that the probability density function 
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is bimodal in regions where linearization predicts multiple solutions. The studies by 
Langley (1988b) and Fan & Ahmadi (1990) on broad band excitation of nonlinear 
systems show that mean square response predicted by the linearization technique may 
not be unique while the corresponding exact solutions are unique. The system 
considered in these studies had multiple stable equilibrium states and the random 
response had multimodal probability density functions. The question of possible 
relationship between the multiple solutions predicted by linearization and local 
behaviour of the sample functions has been considered by few authors (Dimentberg 
1988b; Roberts 1991; Koliopulos & Langley 1993). It is suggested that the multiple 
solutions correspond to local behaviours near the modes of the response pdf. The 
relationship between such local behaviour and global behaviour in an ensemble sense is 
not obvious and further research is clearly required to resolve this issue. 


5. Equivalent nonlinearization 

This method is conceptually similar to the method of equivalent linearization and can 
be viewed as a generalization leading to non-Gaussian estimates for the response. The 
method has been introduced by Caughey (1986). It consists of replacing the given 
nonlinear system by an equivalent nonlinear system which belongs to the class of 
problems which can be solved exactly. This method is related to the class of exactly 
solvable FPK equations and thus is applicable only to systems under white noise 
inputs. The criterion of replacement is again the minimization of the mean square error. 
The method leads to non-Gaussian stationary response pdf and estimates correctly the 
random response of limit cycle systems in which case, equivalent linearization fails. 
Cai & Lin (1988b) have developed a similar technique and have applied it to systems in 
which parametric excitations are also present. Here, the replacement oscillator belongs 
to the class of systems possessing generalized stationary potential and is selected on the 
basis that the average energy of dissipation remains unchanged. For a specific system, 
the solution obtained using this method is shown to be superior to that obtained by 
stochastic averaging. The application of the method to randomly excited hysteretic 
structures has also been developed (Cai & Lin 1990). The method has also been studied 
by Zhu & Yu (1989) who have chosen equivalent nonlinear systems which are energy 
dependent. They have indicated that the method is asymptotically exact and is 
equivalent to the method of stochastic averaging of the energy envelope. In a study on 
random response of limit cycle systems, Manohar (1989) has developed equivalent 
nonlinearization solutions for the randomly driven Van der Pol oscillator. In one 
scheme, the Van der Pol oscillator under white noise excitation is replaced by a Van der 
Pol-Rayleigh oscillator which can be solved exactly. This involves linearization of only 
a part of the original equation and the equivalent parameters are found based on 
minimization of mean square error. In an attempt to model the multimodal pdf of the 
response phase process, Manohar has applied a similar partial linearization procedure 
to the simplified equations for the response amplitude and phase processes which, in 
turn, were obtained using a second order stochastic averaging procedure. Furthermore, 
the same procedure was also used to investigate the effect of noise on frequency 
entrainment of harmonically driven Van der Pol oscillator (Manohar & Iyengar 1991a) 
and in the study of rocking of rigid blocks under random base motions (Iyengar & 
Manohar 1991). The scheme of partial linearization has also been studied recently by 
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Elishakoff & Cai (1993). To & Li (1991) have presented a systematic equivalent 
nonlinearization procedure which is again based on the broad class of exactly solvable 
FPK equations and utilizes calculus of variations to derive the optimal replacement 
system. In a study on systems with asymmetric nonlinearities, Spanos & Donley (1991) 
and Li & Kareem (1993) have developed equivalent systems with quadratic nonlineari¬ 
ties. The evaluation of the equivalent parameters is based on an approximate analysis 
of the equivalent systems using Volterra series representations (see §8). 


6. Closure approximations 

In nonlinear random vibration problems the equations for response moments and 
correlations form an infinite hierarchy and exact solutions are not possible. This is true 
even for the class of systems for which exact response pdf is obtainable using the FPK 
equation. The closure problem consists of approximately replacing the infinite hier¬ 
archy of equations with a finite set so that estimates for the important lower order 
moments can be obtained. 

6.1 Closure using assumed probability density function 

The closure approximation can be made either in conjunction with an assumed 
response pdf or directly on the moment equations. Dashevskii (1967), Assaf & Zirkle 
(1976) and Crandall (1980,1985) have employed a series representation for response pdf 
in terms of the Hermite polynomials. The series is truncated after a finite number of 
terms. The first term in the series has the form of a Gaussian pdf. The unknown 
coefficients of the series are related to the response cumulants, central moments or 
expectations of the Hermite polynomials in response variables. The equations needed 
to determine these coefficients are generated from the governing equation of motion. 
This procedure can be viewed as the generalization of equivalent linearization tech¬ 
nique wherein the response was assumed to be Gaussian and the parameters in the 
distribution were determined using moment identities derived from equations of 
motion. Liu & Davies (1988) have applied Hermite polynomial approximation for the 
pdf and studied the nonstationary response of nonlinear second-order systems. Fur¬ 
thermore, the same authors (Davies & Liu 1992) have also studied the power spectrum 
of Duffing’s oscillator using a similar procedure. Iyengar (1975) and Iyengar & Dash 
(1976, 1978) have developed a closure technique in which the response variables and 
input variables, either as they appear or after a transformation, are assumed to be 
jointly Gaussian. It is possible in this formulation to take into account rion-Gaussian 
excitations and amplitude limited responses. The method handles nonlinear and 
stochastically time varying systems in a unified manner (Dash & Iyengar 1982). 

6.2 Closure in terms of moments or cumulants 

In the second approach, one directly deals with moment equations. Here the unknown 
higher order moments are approximated as functions of lower order moments, thereby 
truncating the hierarchy of these equations. Thus, Ibrahim (1978), Ibrahim et al (1985), 
Bolotin (1984) and Wu & Lin (1984) have considered different schemes for closing 
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hierarchy of moment equations. Appeal is generally made to the quasinormal approxi¬ 
mation which connects are higher order moments to the lower order moments through 
relations that are strictly valid only for Gaussian variables. This can be expressed in 
terms of either the direct moments, central moments or cumulants. The quasinormal 
approximation using cumulants amounts to setting cumulants beyond a given order to 
zero. For a Gaussian random variable, it may be recalled, all the cumulants beyond 
order two vanish. Other closure schemes such as discarding the direct or central 
moments beyond an order or ignoring the correlations among the response variables 
have also been proposed (Soong 1973). Bellman & Richardson (1968), Wilcox & 
Bellman (1970) and Sancho (1970) have used a mean square closure technique in which 
the unknown higher order moment is expressed as an optimal linear combination of 
lower order moments. 

6.3 Limitations and improvements 

The closure methods are applicable to a wide class of systems and excitations. They 
have been extensively used in the response and stability analysis of sdof and mdof 
systems with parametric and nonparametric excitations. However, it has not been 
possible to justify closure approximations through analytical approaches. Most of the 
closure schemes are theoretically inconsistent at some level as they violate well-known 
identities and inequalities of probability theory. Thus, for example, the moment closure 
scheme of setting direct moments beyond order n to zero violates the inequality 
E[x 2 "] > E[x”] 2 . The cumulant closure violates the theorem due to Marcienkiewicz 
(Gardiner 1983) which states that the cumulant generating function cannot be a poly¬ 
nomial of order higher than two, that is, either all but the first two cumulants vanish or 
there are an infinite number of nonvanishing cumulants. 

Bellman & Richardson (1968) have derived a condition under which the truncated 
equations obtained using the mean square closure technique preserves moment 
properties. No similar results are available for other closure schemes. Although it has 
been demonstrated with a specific example that the accuracy of closure scheme 
systematically improves as the order of approximation is increased, examples to 
counter this are also readily available (Crandall 1985). Instances of the estimated pdf 
becoming negative have also been encountered (Crandall 1985). Sun & Hsu (1987) have 
applied second, fourth, and sixth-order cumulant neglect schemes to a specific problem 
for which exact stationary solution is available and they have delineated regions where 
the schemes yield acceptable results. In certain parameter regions the fourth- and 
sixth-order schemes are shown to predict erroneous behaviour including a faulty jump 
in the response. Fan & Ahmadi (1990) have considered a system with multiple stable 
equilibrium driven by white noise excitation. They have shown that the stationary 
response statistics generated by Gaussian closure and non-Gaussian cumulant neglect 
closure techniques are not unique and are dependent on initial conditions. This 
contradicts the uniqueness property of the stationary solutions of the FPK equation 
(see §4T). Pawleta & Socha (1992) have compared approximate nonstationary sol¬ 
utions obtained using closure approximations with the corresponding exact solutions 
for the case of parametrically excited linear systems and have shown that near stability 
boundaries the approximations are not acceptable. 

The second-order cumulant neglect and the Gaussian closure techniques are similar 
to equivalent linearization method and are consistent closure schemes. However, they 


Methods of nonlinear random vibration analysis 


359 


yield acceptable results only when the response has features of Gaussian variables. For 
example, when applied to random vibration of self-excited systems (Bolotin 1984) they lead 
to drastically wrong results (Zhu & Yu 1987; Manohar & Iyengar 1990) (see §4-1). Recently 
Grigoriu (1991) has developed a consistent closure procedure which is based on an 
estimator of the response pdf that consists of superposition of specified kernels weighted by 
undetermined parameters. These unknown parameters are determined based on the 
criterion that the moment equations are optimally satisfied up to a specified closure level. 

7. Stochastic averaging methods 

In these methods the response of lightly damped systems to broad band excitation is 
approximated by a diffusion process. The coefficients of the associated FPK equation are 
derived based on an appropriate averaging of the equations of motion. The appeal of these 
methods lies in the fact that they often reduce the dimensionality of the problem and 
significantly simplify the solution procedures. On account of this advantage they are also 
applied to systems wherein the response is already Markov. Different versions of the 
method are available and are widely used in the problems of response prediction, stability 
analysis and the first passage and fatigue failure analyses. Extensive surveys of related 
literature have been published (Ibrahim 1985; Roberts & Spanos 1986; Roberts 1986a; 
Zhu 1988). 

7.1 Averaging of amplitude and phase 

The method was originally proposed by Stratonovich (1963, 1967) as a generalization 
of the deterministic averaging method developed earlier by Bogoliubov and Mitropolsky 
(1961). He considered sdof nonlinear systems under random excitation and showed that 
when the relaxation time of the oscillator is large compared to the correlation time of the 
excitation, the response can be approximated by a diffusion Markov process. Subse¬ 
quently, Khasminiskii (1966) provided a rigorous mathematical proof for Stratonovich’s 
arguments. The necessary requirements for applying the method are satisfied if the system 
is lightly damped and the excitation power spectrum is slowly varying in the neighbour¬ 
hood of the system’s natural frequency. The response in such a case will be a narrow band 
process with slowly varying amplitude and phase. The averaging procedure is a combina¬ 
tion of temporal and ensemble averaging and it aims at eliminating rapid oscillations from 
the dominant slowly varying components and also at replacing randomly fluctuating 
components by equivalent delta correlated processes. This results in a pair of Ito differential 
equations for amplitude and phase which will have to be analysed using the FPK equation. 
In many cases the equation for amplitude gets uncoupled from that of the phase thus 
enabling the determination of the stationary distribution of the amplitude process. In fact 
this is the main advantage of this method. In order to determine the stationary pdf of 
displacement and velocity variables, the knowledge of joint pdf of amplitude and phase is 
essential. But it is in general difficult to obtain this pdf. However under the assumption that 
amplitude and phase are independent it is still possible to obtain an approximation to pdf 
of displacement and velocity. 

Stratonovich used this method to examine the response of sdof self-excited systems 
to parametric and nonparametric excitations. Subsequently, the method has been 
generalized to include mdof systems and nonstationary inputs and widely used in 
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random vibration studies (Roberts & Spanos 1986). The method has also formed the 
basis for the study of first passage failures (Roberts 1986a) and stability analysis 
(Ibrahim 1985). For problems wherein the time varying nature of the system such as 
deterministic excitations or nonstationary inputs needs to be preserved, Lin (1986) has 
proposed that the temporal averaging in Stratonovich’s procedure may be dispensed 
with. Heuristic arguments for relaxing restrictions on time constants of input and 
response for problems of stability analysis have also been given. 

7.2 Quasistatic averaging 

A variation of the standard stochastic averaging method, known as the method of 
quasistatic averaging, has also been developed by Stratonovich (1967). The method is 
applicable to problems in which the correlation time of the excitation greatly exceeds 
the relaxation time of the system. This requirement is contrary to the one stipulated for 
the applicability of the standard stochastic averaging method. The method consists of 
only temporal averaging. The ensemble averaging with its attendant Markovian 
approximation is dispensed with. In applying this method, envelope representation is 
used for both the input and the response processes. During temporal averaging the 
amplitude and phase angle are approximated as random variables and hence as 
constants. This finally leads to a nonlinear memoryless transformation relating the 
input and the output amplitudes and phase angles. Thus the solution of the given 
random differential equation is converted to a problem in nonlinear transformation of 
random variables. The method has been used in the study of nonlinear systems under 
narrow band excitations by several authors (Lennox & Kuak 1976; Sato et al 1985; 
Richard & Anand 1983; Iyengar 1986). 

7.3 Averaging of energy envelope 

The stochastic averaging method is found to give acceptable results for systems with 
nonlinear damping. In fact when damping is amplitude dependent and the excitation is 
white noise, the method leads to the known exact solutions (Roberts 1976). However, for 
a system with nonlinear stiffness, such as Duffing’s oscillator, the solution does not display 
the effects of nonlinearity. In such cases a higher order averaging procedure needs to be 
used (Stratonovich 1967; Ibrahim 1985). This, however, involves cumbersome calculations. 
A simpler alternative is to examine whether a one dimensional Markovian approximation 
can be obtained for the energy envelope of the response. This was originally proposed by 
Stratonovich (1963) who considered systems under white noise inputs and reduced the two 
dimensional Markovian vector consisting of a slowly varying energy envelope and 
a rapidly varying displacement component to a one dimensional Markovian approxi¬ 
mation for the energy envelope. This method has further been developed by Roberts 
(1976, 1978) and generalized to incorporate nonwhite inputs (Roberts 1982), parametric 
excitations (Zhu 1983) and nonstationary inputs (Red-Horse & Spanos 1992). Here, the 
averaging is carried out over a period equal to the undamped natural period of the system, 
which, now depends on the energy of the response. The results obtained using this method 
also agree with the available exact solutions. Zhu & Lin (1991) and Zhu et al (1994) have 
considered systems with correlated Gaussian excitations and have included the 
additional contributions to damping and stiffness made by the Wong-Zakai correction 
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terms. These additions are incorporated into the definition of the energy envelope and 
the consequent new results are shown to be improvements over earlier averaging 
results. Another version of stochastic averaging is also available (Sunahara et al 1977). 
In this a deterministic averaging is carried out directly on the coefficients of the 
governing FPK equation. This method has been shown to be equivalent to the 
averaging of the amplitude or the energy envelope of the response (Zhu 1988). 

7.4 Combination of averaging with other methods 

The method of stochastic averaging has also been used in combination with other 
methods of random vibration analysis. Thus, Iwan & Spanos (1978) proposed a combi¬ 
nation of equivalent linearization and stochastic averaging to analyze systems with 
nonlinear stiffness. For the case of the Duffing oscillator under white noise input, the 
method improves the results obtained using averaging of response amplitude but does 
not lead to the exact solutions. Furthermore, Ariaratnam (1978) has questioned the 
consistency of approximations made in this analysis. Stratonovich (1967) has used 
equivalent linearization technique to solve simplified equations obtained using 
stochastic averaging. Bruckner & Lin (1987b) have adopted a complex form of 
stochastic averaging which eases the application of non-Gaussian closure technique to 
the simplified equations and is particularly useful in analyzing nonlinear mdof systems. 
In the study of nonlinear systems under combined harmonic and random excitations or 
when a higher order averaging in Cartesian co-ordinates is done, the resulting 
simplified equations do not get uncoupled, and, in general, are unsolvable within the 
framework of the Markov process theory. Under such situations Manohar & Iyengar 
(1990, 1991a) have proposed combining averaging with equivalent nonlinearization 
technique. This procedure is shown to give satisfactory results for the case of Van der 
Pol’s oscillator under broad band and combined harmonic and white noise excitations. 

7.5 Method of stochastic normal forms 

An alternative way of reducing the dimensionality of the problem using modern 
bifurcation theories, viz, center manifold theory (Guckenheimer & Holmes 1983), has 
been developed by Sri Namachchivaya & Lin (1991). The method consists of eliminat¬ 
ing certain response variables which are asymptotically stable as being unimportant 
with the essential behaviour of the system restricted to the dynamics of the remaining 
critical variables. The differences between this method and the traditional averaging 
arise in carrying out the ‘temporal’ part of the averaging, while, the ensemble averaging, 
with the consequent Markovian approximation, remains essentially the same. In fact, 
the equivalence of this method with a higher order stochastic averaging has been 
demonstrated (Sri Namachchivaya & Leng 1990). The approach has been employed in 
the study of the effects of noise on bifurcations in nonlinear systems and for specific 
cases, the method is shown to be more generally applicable than the stochastic 
averaging (Sri Namachchivaya 1991; Leng et al 1992). 

7.6 System stochasticity problems 

Although, the averaging methods are widely used in vibration problems, the idea of 
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applying them to problems of spatial variability is novel. Thus, the usefulness of the 
averaging method in the study of stochastic boundary value problems has been 
investigated by Manohar & Iyengar (1993,1994) in the context of the determination of 
the eigensolutions of stochastic wave equations. Here, the given boundary value 
problem is converted into a set of initial value problems and, these are, in turn, 
simplified by averaging over spatial domain. The results obtained on the pdf of the 
eigensolutions using this approximation is found to compare very well with digital 
simulation results. 

7.7 Summary 

The methods of stochastic averaging enhance the scope of the FPK equation approach 
in random vibrations. The different versions of this method are mathematically well 
founded (Zhu 1988). This is in contrast to other approximate techniques discussed 
earlier. The other merit of these methods is that they lead to non-Gaussian estimates for 
the response. 


8. Stochastic series solutions 

A widely used method in deterministic problems is the one based on the representation 
of the solution in an infinite series. Here an unknown function is expanded in a set of 
known functions. A few studies based on the extension of this concept to stochastic 
problems are available in random vibration literature. Thus, Iyengar & Dash (1976) 
have considered a linear sdof system with both parametric and external random 
excitations. The parametric excitation is taken to be a nonwhite process. The response 
is expanded in a power series in the random coefficient process. The unknown 
coefficients in this series are taken to be deterministic and are determined based on the 
minimization of the mean square error in an interval of time. Ahmadi and his co workers 
(Ahmadi 1980a; Jahedi & Ahmadi 1983; Orabi & Ahmadi 1987a, 1987b, 1988) 
have used the Weiner-Hermite functions in the study of the Duffing oscillator 
under stochastic excitation. These functions are a set of statistically orthogonal 
functions and form a complete random basis for expanding a given random process. 
See the book by Schetzen (1980) for a systematic account of the Volterra and Wiener 
theories of nonlinear systems. In the solution both the input and the response processes 
are expanded in a set of these functions. The orthogonality property further leads 
to a set of nonlinear coupled integro-differential equations for the unknown kernels in 
the expansion. These equations have been further solved using an iterative technique. 
For a Gaussian random process the series consists of only one term and thus the higher 
order terms in the series are non-Gaussian corrections. Thus the method systematically 
leads to non-Gaussian estimates for response statistics. However, it is not possible to 
obtain the expressions for the non-Gaussian pdf with this method. Using a single term 
in the expansion has been shown to be equivalent to the technique of equivalent 
linearization (Ahmadi & Orabi 1987). Recently, Ghanem & Spanos (1993) have studied 
random response of second order nonlinear systems using series expansions for both 
the excitation and response processes which consist of unknown deterministic 
functions of time which are weighted by known set of orthogonal random variables. 
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The excitations are represented exactly using the Karhunen-Loeve expansions. Fol¬ 
lowing the concept of Galerkin expansions, the response is also expanded using the 
same basis random variables which are used for representing the excitation. This leads 
to a set of deterministic ordinary differential equations for the unknown functions 
which can be solved numerically. The method has been applied to the response analysis 
of Duffing’s oscillator subjected to filtered white noise excitation yielding satisfactory 
results. 


9. Digital simulation technique 

For problems which are beyond the reach of exact or approximate analysis, the digital 
simulation technique forms the only means of solution. This technique is also useful in 
checking the validity of approximate analysis procedures. Here one follows a sample 
function approach in which the problem is handled largely in a time domain determin¬ 
istic framework. The application of this method consists of three steps: (i) Simulation of 
random inputs, (ii) discretization of the stochastic model and generation of response, 
and (iii) statistical processing of samples of response. Thus, at every stage, the method 
requires the availability of efficient computers. The basic tool for generating random 
inputs is the pseudo-random number generator which is a deterministic algorithm that 
produces a set of numbers which are statistically indistinguishable from uniformly 
distributed random numbers (Chambers 1967). Scalar and vector random variables of 
specified distributions can be obtained by suitably transforming the uniformly distrib¬ 
uted random numbers (Ripley 1987). Comprehensive reviews on generation of random 
processes and fields using spectral representations and discrete time series models are 
available respectively, in the works of Shinozuka & Deodatis (1991) and Spanos & 
Mignolet (1989). In the response analysis numerical schemes such as Runge-Kutta or 
predictor-corrector algorithms are used to solve the equations of motion. The method 
has been widely used in nonlinear response analysis (see, for example, Lutes & Shah 
1973; Vaicatis et al 1974, Spanos 1980a, 1981b, Zhu et al 1993, Manohar & Iyengar 
1990) and in first passage problems (Crandall et al 1966; Roberts 1976; Pi et al 1971; 
Spanos 1983; Iyengar & Manohar 1991). It has also been extensively used in problems 
of system stochasticity, parametric excitations and stochastic stability analyses 
(Shinozuka & Deodatis 1991). 

The simulation technique has vast scope and is uniformly applicable to nonlinear 
and parametric response analysis of sdof and mdof systems. However, in order to 
obtain reliable estimates of response variables, sufficiently large size of samples 
should be used in the analysis. This fact makes the method significantly expensive. 
This is particularly true in response analysis involving estimation of rare events 
and in the study of large mdof systems. The method is well suited for stationary 
response analysis where assumption of ergodicity is admissible. Spanos (1981a) has 
estimated that the cost of simulation studies is typically 100 to 1000 times that of an 
approximate analysis using equivalent linearization. It has been found that the cost of 
simulation increases linearily with sample size while the accuracy improves in propor¬ 
tion to the square root of sample size (Spanos & Lutes 1987). Nevertheless, given the 
strides made over the last few years in computer technology, one can readily forsee the 
broadening of the scope of digital simulation techniques in engineering stochastic 
analyses. 
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10. Summary and conclusions 

Various methods for stochastic response analysis have been outlined in the previous 
sections. Exact solutions are obtainable from the FPK equation approach but are 
scarce. It is generally necessary to take recourse to one of the several approximate 
procedures available. Many of the approximations are based on the assumption that 
the response process is nearly Gaussian distributed and/or is Markovian or nearly 
Markovian in nature. The approximate techniques based on the FPK equation are 
applicable to Markovian responses and are largely confined to lower order systems. 
The perturbation method is applicable to weakly nonlinear systems under weak 
stochastic inputs. They are useful in getting non-Gaussian estimates for response 
moments but are cumbersome and relatively inefficient. Equivalent linearization is 
useful for systems with Gaussian inputs and is applicable over a wide range of 
nonlinearity. The method leads to Gaussian estimates for the response and hence is 
suited for problems where the excitation and the system are such that the response is 
unimodal and nearly Gaussian. When applied to systems with multimodal response 
probability densities, the method leads to nonunique response statistics and requires 
careful interpretation. The method is not applicable for evaluating parametric re¬ 
sponses. Equivalent nonlinearization methods are based on the class of exactly solvable 
FPK equations and are limited to systems driven by white noise excitations. The 
closure schemes are an improvement over equivalent linearization as they can take into 
account parametric excitations and can lead to non-Gaussian estimates for the 
response. The methods are however mathematically not well founded. The perturba¬ 
tion, equivalent linearization and closure methods are all fairly general and are 
applicable to both transient and steady state analysis of sdof and mdof systems. 
Stochastic averaging methods are applicable to lightly damped systems with paramet¬ 
ric and external broad band inputs. They are mathematically well founded and in the 
case of systems under white noise for which the FPK equations are solvable, the 
methods lead to the exactly known solutions. These methods reduce the dimensionality 
of the problem and widen the scope of the FPK equation approach. Their applicability, 
however, is largely limited to sdof systems. Digital simulation technique is universally 
applicable and leads to estimates of the response to any desired level of accuracy. The 
method relies on the availability of a fast computer and is quite expensive, especially, in 
the study of large scale systems and in the analysis of rare events. 

The above methods have been applied in the past to study a variety of nonlinear 
problems such as structures undergoing large amplitude vibrations, yielding systems, 
self-excited systems, hysteretic systems, vibroimpact systems and rocking of blocks. 
The developments of these methods are characterized by two conflicting objectives. 
Firstly, the methods are expected to be viable when applied to large-scale engineering 
structures, while, on the other hand, they need to capture^correctly, the qualitative 
behaviour of nonlinear systems. The linearization and closure methods are, perhaps, 
the only feasible analytical methods which can be used in conjunction with computa¬ 
tional structural models for studying large scale mdof systems. One of the major 
drawbacks of these methods, however, lies in their inability to capture correctly the 
interactions between equilibrium states of the unforced system and external random 
excitations. It has been noted by Andronov and others as far back as in 1933 (Bolotin 
1967; Kozin 1969) that, for systems under white noise excitations, the most probable 
response states correspond to the stable equilibrium states of the unforced system (also 
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see Kapitanaik 1986). Similar behaviour can be expected in the study of the random 
response of systems exhibiting complicated bifurcation patterns such as those asso¬ 
ciated with jumps, limit cycles, nonlinear resonances and entrainments. It is important 
to recognize that corresponding to multiple stable solutions of the unforced system, the 
stochastic response probability density functions can be multimodal. Evidently, the 
linearization and closure techniques are ill equipped to model these nonlinear features 
satisfactorily. On the other hand, the averaging and FPK equation based approaches 
are mathematically well founded and perform well when applied to simple systems 
displaying the above mentioned complicated response patterns, but are, however, of 
limited use in analysing large scale structures. Thus, methods to overcome these 
limitations still need to be developed. 
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Higher order averaging method of coefficients in Fokker- 
Planck equation 

NGUYEN DONG ANH 

Institute of Mechanics, Vien Co, 224 Doi Can, Hanoi, Vietnam 

Abstract. Two methods of integrating the first-order averaged Fokker- 
Planck (FP) equation used in the theory of random oscillations are 
proposed for the non-autonomous cases. Further, since the effect of some 
nonlinear terms is lost during the first-order averaging procedure, the 
procedures for obtaining higher approximate solutions to the FP equation 
are developed. It is shown that these procedures involve the classical first- 
order averaging method of coefficients in FP equation. The Duffing and 
Van der Pol oscillations are considered. 

Keywords. Averaging; FP equation; random vibration; white noise. 


1. Introduction 

Over many years the well-known averaging method, originally given by Krylov and 
Bogoliubov and then developed by Mitropolskii (Bogoliubov & Mitropolskii 1963), 
has proved to be a very useful tool for investigating deterministic weakly nonlinear 
vibration problems. The advantage of this method is that it reduces the dimension 
of the response coordinates. In the field of random vibration the averaging method 
was extended by Stratonovich (1963) and had a mathematically rigorous proof by 
Khasminskii (1963). 

It should be noted that although the general higher order averaging procedure was 
already described for deterministic differential equations (Mitropolskii 1971), princi¬ 
pally only the first-order averaging method has been applied in practice. Recently, 
a systematical application of the first-order averaging method to arbitrary order 
differential equations subjected to deterministic and random excitations was presented 
(Mitropolskii et al 1992). It is well-known, however, that the effect of some nonlinear 
terms such as cubic stiffness is lost during the first-order averaging procedure. Thus, the 
latter is not sufficient to describe the effect of those nonlinear terms. In order to 
overcome this insufficiency, the procedures for obtaining higher approximate solutions 
to the Fokker-Planck (FP) equation are developed in the paper. It is shown that the 
presented procedures involve, in particular, the first-order stochastic averaging method 
(SAM) of coefficients in FP equation. 
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2. SAM of coefficients in FP equation 


Consider the equation of motion of a single-degree-of-freedom mechanical system 

x + co 2 x = sf(x, x,cot, a) + (a 1/2 )(r£(£), (1) 

wherein the symbols have their customary meaning, cr is a constant, / is a nonlinear 
function in x and x and periodic in cot with period 2%, and a is a small positive parameter. 
The random excitation £(t) is a Gaussian white noise process of unit intensity 

Emat+T))=5(z), (2) 

where F(-) denotes the expectation operator. Equation (1) may be considered as the 
following system of Ito stochastic differential equations, 

dx(f) = xd£, (3) 

dx(£) = [a/ — co 2 x]dt + (a) 1/2 <rdn(£), (4) 

where n(t) is the Wiener process and 

E(n(t)n(t')) = min {t,t'j. (5) 

The solution of the corresponding linear system (1), (e = 0), has the form 

x(t) = a cos cp, (6) 

x(t)= — acosincp, cp = cot + 6, (7) 

where a and 0 are constants. In the case where a # 0, according to the classical approach 
of the averaging method the state coordinates (x, x) are to be transformed to the pair of 
{a, 6) by the change (6) and (7). 

In order to obtain the amplitude and phase Ito differential equations let these 
equations be written in the form 


da(f) = K l (a, (p,cot, e)df + j5(a, <j>, cot, e)dn(f), 

d0(f) = K 2 (a, cp, cot, e)d t + y(a, <p, cot, e)d n(t), (8) 

where K l ,p,K 2 ,y are unknown functions of a,cp,cot and a. For a given function 
F(t, a, <p ) the Ito differential rule (Gikhman et al 1982) gives 

dF = [dF/dt + (l, + Z 2 )F]d£ + Z 3 Fdn(r), (9) 

wherein the operators l x ,l 2 , and / 3 denote 


d d d 

ll ~ K 'Ta + Ki W h = ll d-a + y dj’ 


1 


1 


ll 2^ da 2 + ^ dadcp + 2 f dcp : 


( 10 ) 


Differentiating (6) with respect to time and using (9) yields 

dx(r)= [— aco sin cp +(l x + / 2 )(acos</>)]dt + l 2 (acos<p)dn(t). 


( 11 ) 
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On the other hand, substituting (7) into (3) gives 

dx(f) =[— aco sin </>] dr. (12) 

Further, differentiating (7) with respect to t and using (9) yields 

di(r) = \_~aco 2 cos cp + (li + l 2 )( — aco sin cp)~]dt + l 3 ( — aco sin cp)dn(t). (13) 
Comparing (11) with (12) and (13) with (4) gives the following relations 

(f + l 2 ){a cos cp) = 0, (14) 

l 3 (acos cp) = 0, (15) 


and 


where 


(/1 + l 2 )( — aco sin cp) — af(a, cp, cot, £), 
Z 3 (— aco sin cp) = (s) 1/2 cr. 


f(a,cp,cot,s)=f(x,x,cot,e), with x = a cos cp, x=—acosincp. 
Noting (10), the explicit forms of (15), (17) and (14), (16) are, respectively, 
jScos cp — ya sin cp = 0, 

— fico sin cp — yaco cos cp = (£) 1/2 o\ 


and 


K x cos 4> — K 2 asin cp = fiy sin cp + ^y 2 acos cp, 

1 £ 

K x sin cp + K 2 a cos cp = — fiy cos + - y 2 a sin cp -/ (a, cp, cot, a). 


CO 


Therefore, one gets from (19) and then from (20) 

„ ( £ ) 1/2 • ^ ( £ ) 1/2 • ^ 
/i =- o sin cp, y — - o sm cp. 


(16) 

(17) 

(18) 


(19) 


( 20 ) 


co 


aco 


8 8 

K , =- f(a,(p,cot,8)sincp +-— 3 a 2 cos cp, 

co 2 aco 


( 21 ) 


/ 


8 £ 

K , =- f(a,(p,cot,s)cos<p - 5 —.a 2 sin <p cos cp. 

aco a A co~ 

Hence, (3), (4) are transformed into the amplitude and phase differential equations (8) 
where K i ,fi,K 1 ,y are given in (21). The FP equation, written for the probability density 
function W(a, 6, t ) corresponding to the system (8), has the form 

dW 


a t =- £ 1 


A 2 

da 


2 ^ + 2 JaM {K22 m + -^ {K22 ^ 


A 2 

dd 2 


, ( 22 ) 
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where 


K n (a, <t>) = ~sm 2 <(), 

G2 

K l2 (a, 0) =—r sin 0 cos 0, 
aoo 

<j 2 

K 22 (a,0) = ^ I cos 2 0. 


(23) 


Khasminskii (1963) proved that when e -»0 the solution of Cauchy’s problem of (22) 
approaches the solution of the following averaged FP equation 


dW 

dt 


A«k,>ho4«k 2 >ko 


d 2 


|i«K u > HO + W) + ^KK 22 > HO 


(24) 


where <•) is the averaging operator with respect to time or to phase 0, 

i ^ 2ir/<o I C2n 

<•>=- (-)dt = - (')d0. (25) 

2ft J o 271 J o 

In the non-autonomous case, where the function fin (1) contains explicitly the time 
cot, the averaged coefficients (K t >, < K 2 ) are functions in a and 9. Thus, the solution to 
(24) is still a difficult problem. In Mitropolskii et al( 1992) and Anh (1985), two methods 
for investigating the stationary solution W(a,9) of (24) are proposed. 


2.1 Method of auxiliary function 


The method transforms the FP equation (24) into a partial differential equation for an 
auxiliary function u(a, 6) 



. du du 

a,e ’di’ae' u 



. du du 
a ' e ’fa’88' U 


(26) 


where M and N are known functions of indicated arguments. After finding the func¬ 
tion u(a, 6), the stationary probability density function W{a,9) can be found by the 
quadrature 


W(a, 6) = exp 


"(**■£ 4 “ » da 


+ 


f \r f p 0U du 

J N \ a ' 9 'da’d0’ U 



(27) 


However, the effectiveness of the use of (26) is that its trival or simple solutions 
correspond to the nontrivial exact solution (27) of the FP equation (24). Hence, one gets 
a method to determine sufficient conditions for the integrability of the last equation: to 
impose sufficient conditions on the coefficients <!<£,•> and <X l7 >, i,J = 1,2; so that (26) 
admits some definite solution u(a,6), under which the particular exact solution (27) of 
the FP equation (24) can be found in quadratures. 





i 
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Consider, for example, the following simple auxiliary function 
u(a, 0) = 0. 

Equation (26) takes the form 


a f 2 

d0l<K n ) 


/K \ ' d < K "> 


11 2 ( 1 s<K 22 > < k 12 > \ 

da\(K 22 yy 2? 2 80 da J 

4<k 12 > ( i a<K tl > \] 

<Kn><K 2 2>V 2 5a /!' 


(28) 


Thus, if the coefficients < Kj >, <K 0 ),iJ =1,2, of drift and diffusion satisfy the condition 
(28), then the FP equation (24) has the exact solution 


fF(a, 0) = exp < 2 


w 

T <^i> 

1 


L<^n> 

2<K n > 

da 


da 


+ 2 


1 


,<K 2 2> 


<k 2 >- 


1 8 < \K 22 > 

2 80 


8<K 12 ) \ 
da ) 


4<£ 12 > . l g<^n> \ 

<^n><^22>\ 1 2 5a ^ 



In the case of external white noise excitation, one has 
<Kn> = <(<r 2 /co 2 )sin 2 </>> =^ 2 > 


</C 12 ) = <(a 2 /<y 2 )sin<^cos0> =0 


<^ 22 > = < (cr 2 /a 2 co 2 ) cos 2 0> = 

(30) 

Then the condition (28) is simplified 


A< Kl >=l(^</c 2 », 

(31) 

and the corresponding solution (29) takes the form 


W(a , d) = exp {(4co 2 /a 2 ) J[ (K t > da + a 2 (K 2 > dd]}. 

(32) 

A class of mechanical systems, for which the condition (28) is 
indicated in Mitropolskii et al (1992). 

fulfilled, has been 


2.2 Method of expansion in a quasicyclic coordinate 

First, the FP equation (24) is transformed to the form 

dJK Q djKQ 1 a 2 <X n > a 2 <-K, 2 > 

da dd 2 da 2 dadd 



378 


Nguyen Dong Anh 


where 


l d 2 CK 22 ) 

2 dd 2 + 


<Ki> 


d<K xl ) 

dd 



d® 

da 


+ Uk 2 '>- 


d(tc 21 y £<£ u >\a®_<£ li > 

da J dd 2 


dd 2 


d<D\ 2 d 2 <$ 

da) da 2 


~<K 12 > 


S 2 0 SOS®' 
dadd da dd 


<K 22 y 


s 2 ® /s® 
50 r + Vae 


= o, 


( 33 ) 


0(a, 0) = ln W(a, 0). 


(34) 


In many cases, the coefficients of drift and diffusion are polynomials in the amplitude 
a with integral powers and with coefficients depending on d. The amplitude a may thus 
play the role of a quasicyclic coordinate. According to the method of expansion in series 
in terms of the quasicyclic coordinate, one can also find the solution of (33) in S®/Sa in 
the form of a polynomial containing integral powers of the amplitude a: 


d(f) 

da 


= a 


+ 


CG 


X 1 - 


Then one has 


(35) 


W(a, d) = Cad exp 


I 


(36) 


Substituting (35) in to (33), and equating the coefficients of the same powers of a, one 
obtains a closed system of separable differential equations for the successive determina¬ 
tion of all the unknown coefficients g ( (d). The arbitrary integration constants must be 
chosen from the condition for the functions g { (d) to be periodic. The problem of the 
convergence of the series (36) is not easy, however, it does not arise in cases when one 
admits an exact solution of the form 


i = 0,2, • • • JV; /q(0) = O, l^N+l. 

Then we obtain 

W(a, d) = Cu'exp j £ 'F,(0)a' j . 


i = 0 


(37) 


(38) 


This property will be shared by many mechanical systems (Mitropolskii et al 1992). 
Example 1. The Van der Pol system to periodic and white noise excitations 

x + co 2 x = e(l - yx 2 )x AeP cos cot + y fe<j%{t). (39) 

The corresponding coefficients of drift and diffusion are to be found 


K-i = a sin 2 (f>-a 3 y cos 2 0 sin 2 0 -cos cot sin 0 + -Z_ cos 2 0 a~ 1 , 

2cu 2 


CO 


P a 2 

K 2 = sin 0 cos 0 - a 2 y cos 3 0 sin 0-cos cot cos 0 -—=■ sin 0 cos 0 a ~ 2 , 

ao '-' 2 ^ ^ 


o 1 a 2 2 

K 11 =— I cos 2 cf) , K 12 = — sin0cos 2 0tf _1 , K 22 = — cos 2 6a~ 2 . (40) 

CO CO (O v ' 
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Hence one gets, noting cot = <p — 6 , 


P 


sin 9, 


<K 2 ) 


2oj 


cos Oa ' 




<*u> 


r , <K 12 } = 0, <K 22 ) = 


(41) 


It is easy to show that the averaged coefficients (41) satisfy the condition of integrability 
(31). Thus, substituting (41) into (32) yields the following probability density function 
W(a , 9) of the Van der Pol system (39) 


2 coP 


W{a, 9 ) = Ca exp — — 5 - sin 6 a + ^ja 2 — 

8(J 


(42) 


{ * 

where C is a normalization constant. 

The solution (42) can also be obtained by using the method of expansion in 
a quasicyclic coordinate. Thus, let 

W(a,9) = Caexp{g 0 (9) + g 1 {9)a +g 2 (9)a 2 H-). (43) 

Substituting (43), (41) into FP equation (24) and equating the coefficients of 
a ~ 2 , a ~ 1 , a 0 , • • • we obtain the following system of separable equations for the unknown 
g 0 { 6 ), gfO), ju 2 ($),••• (a prime denotes differentiation with respect to 0 ), 

Po + Po = Pi + Pi ~ 


„ . 4co 2 2 , 2 2 (OP 

P 2 + 4 p 2 — — 2 — (Pi + Pi ) - T~ 

a o 


(, fi l sin 9 + fi\ cos 9), 


Pi + = - 2(2 pip 2 + fi\ fi 2 ) - (2g 2 sin 9 + g ' 2 cos 9) + 


Pi + 16^4 = “ ( 4 pI + P 2 2 ) -2(3 ^3 + fi\g 2 ) - 


2 (o 2 y 


2(oP . .. 4co 2 

—2 —(3/i 3 sin 9 + /i 3 cos 0) H-j-• 

It can be shown that the system (44) has the solution 
Po(6) = 0, g n {9) = 0, 0 5, 

Mi(0)= -^-sin9, g 2 (6) = ~, g^0)=~^. 

Substituting (45) into (43) yields (42) again. 

The probability density function (42) has extremums at the points (a, 9), where 

8W a 2 , yw 1 , . 

— =- 1- 2 (o 2 a - — a 3 — 2co P sm 0 = 0, 

da a 2 

dW 

—- = — 2oj Pa cos 6 = 0. 

89 


(44) 


(45) 


( 46 ) 
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Hence, we get the most probable value of the amplitude 

—+ 2co 2 a-^-a 3 ] = 4co 2 P 2 . (47) 

_ a 2 J 

For a->-0, P-> 0 the value (47) tends to the known value of the amplitude of 
self-oscillations of the Van der Pol system. 

If the mechanical system is autonomous, the averaged corresponding coefficients of 
drift x >, < K 2 ) depend only on a. Thus, the stationary probability density function of 
amplitude W 0 (a) can be found from the equation 

!;[<*!> *„< a )]-y^[<K„>»M«)]-a (48) 

It is known that in many cases of interest the averaged FP equation (48) is not sufficient 
for analysis of nonlinear terms in the original equation (1). 



3. Higher approximate solutions to FP equation 


We consider only the autonomous case of (1) and suppose the considered system has 
a stationary response. Using the transformation 

x = a cos 0, x — — aco sin 0, (49) 

and the Ito formula we reduce (1) to the form 

da = eK t (a, 0, e)dt — ((g) 112 joS)c sin 0dn(t), 

d 0 = ia> + eK 2 {a,(j),s)']dt — {{s) ll2 /cL)a)GCOS(f>dn(t). (50) 

The corresponding FP equation for the stationary probability density function W(a, 0) 
takes the form 

dW \ d 8 


4 [£< k ‘^> +2 ^ ( * 1 ^ + |^ 2 ^ 


where the coefficients of drift and diffusion are given in (21). Denote 
0(a, 0) = In W (a, 0), $ 0 (a) = In W 0 (a). 

It is easy to show 

dw <3d> d 2 w ra 2 o /a<&\ 2_ 

8a da ’ da 2 _da 2 + \ da) 

8 2 w r a 2 o> aoaol 

-—- 1 - w 

dad<p _dad(j) da 50 
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V . / 

K * 


For the given functions Kfa, </>), Xy(a, <£), Xy = X }i , ij =1,2, the operators [X,., Xy] if 
and [X i? Xy] / are defined as follows 

[K„ (V) + i(K 2 W) 




[/f„K u ]i(H0 


dK 2 

ia 2 K,i 

dK 12 

id 2 K 22 

d(f) 

2 da 2 

dad<p 

2 d(f) 2 

8K ll 

dK l2 \ 

™ + ( K 

dK 22 

da 

df ) 

da + \ 

2 df 


if aaHr /awV 

21 11 l 3a 2 + l da ) 


(d 2 W dWdW\ fd 2 W fdW 

+ 2Kl2 {^ + ^^) + K22 {W + \df 


• (54) 


Calculations show that operators [X ; , Xy] if, [X f , Xy] / have the following properties 
[X i5 X ( .,] if (<xW+ (S W 1 ) = a[K i , Xy] if (WO + j9[X„ Xy] if ( IF,), (55a) 

«[X„ Xy] if ( IF) = [aX,, a X y ] if ( W) = [X,, Xy] if (a IF), (55b) 

[x f + X„ Xy + Xy] if (WO = [X,., Xy] if (WO + [Xy Xy] if ( W), (55c) 

[X,, X,-] J?(W 1 W) = [X, JF X , Xy IF, ] if (IF), (55d) 

[XyXy]if(lF)= PF[X,.,Xy]/«E>), (55e) 

<[Xy Xy] if ( W 0 (a))) = [<X ( >, <Xy>] if ( W 0 m (550 

where 

W{a, </>), W x (a, (f>), Ri(a, <fi), R^a,#), Rij^Rji, U = 1,2, 

are arbitrary functions of a and <j>, W 0 fa) is a function in a, a and /? are constants. Using 
now the operator [X t -,Xy]if, the FP equation (51) can be rewritten in the form 

(56) 

Suppose that 

X f (a,</>,£) = K i0 (a,(j>) + R ix (a, 0)e 4- R i2 (a,(j))e 2 + •••, i = 1,2. (57) 

We seek the solution of (56) in the form 

W=W 0 + eW 1 +e 2 W 2 + ---. (58) 

Substituting (57), (58) into (56) and noting the properties (55) yields 
r dW 0 dW l ,dW 2 1 

<o hf +e ^ +e ^t + -\ = 

-£{[K IO> Ky]if(W' o ) > + £ ([*:,. o ,K (J ]ifOy 1 ) + [K, 1 ,0]if(M' 0 ))+-} 
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Comparing the coefficients of like powers of a one obtains 


e°:co 


e 2 :oj 


dWo 

d(f> 

dW i 

!hjj~ 

dW 2 


= 0, 




From (59) it follows 


Wo = W 0 (a), 
then (60) yields 

co-^=-lK l0 ,K u -]X(W 0 (a)). 


(59) 

(60) 
(61) 

(62) 

(63) 


The arbitrary integration function W 0 (a) must be chosen from the condition for the 
function W x to be periodic. Thus, one gets from (63) 

<[K 10 ,Ky]J?(W„(a))>=0, (64) 

AW 

0)-^- = - {KJ se{ W Q (a)) - < lK i0 ,K^ jSf ( W 0 (a)) > } (65) 

Taking into account the property (55f), one can rewrite (64) in the form 

[<K iQ \(K ij '>mW o (a)) = 0, ( 66 ) 

which is identical to (48). Hence, it is shown that the averaged FP equation (48) may be 
obtained by equating to zero the averaged value of the coefficient of power e in the 
original FP equation (51). Further, using (66) and (55e) one gets from (65) 



WoiaKKn'KuWoia)). 


(67) 


Expand [X l0 ,X ij ]/(<I) 0 (a)) into a Fourier series as are periodic functions in 

0 with period 2 n (see (21)) 


[X 10 ,K u ]/(4> 0 (a)) = 2 £ {<[K i0 ,Ky]/(4> 0 (a))sinn0>sinn</> 

n — 1 

+ < [K,o. *y] i(®o(«))cos n</> > cos n<t>}. (68) 

Substituting (68) into (67) yields 

WxM)= W 0 {a)(W n (a,<f>) + W 10 (a)), (69) 

where W 10 (a) is an arbitrary integration function of a, and 

Wi i{a,4>)= — — £ -{<[X i0 , K ;j ]/(O 0 (a)) cos n </> > sin n 0 — 
co „~ l n 

< lK m , K u ]l{<b 0 (a)) sin ncf> > cos «</>}. (70) 







Averaging method of coefficients in Fokker-Planck equation 


383 


Substituting (69) into (61) and noting (55a), (55d) yields 
dW 

= - [K ,0 W ll ,K, J W n lS?( W 0 (a)) - [K l0 W 0 (a\ K t] W 0 (a)l 
3 3 

X £’(W l0 (a)) — —(R ll W 0 (a))-—(.R 21 W 0 {a». (71) 

Therefore it follows 

< [K, 0 W,,,K, t W u ]Jf( W 0 (a)) + IK, 0 fT 0 (a), VT 0 ( a )] 

x i?( W l0 (a)) - ^<R„ W 0 (a)) -^(R 21 IT 0 (a))> = 0 (72) 

or noting (55f), 

[<R ,0 W'l 1 >. <R»^, >]^(W 0 )(a)) + [<K i0 IT 0 (a)>, <K,J fT 0 (a)>] 


xi?(W' IO ( £ t))-^«R 11 >lT 0 (a)) = 0. (73) 


In the explicit form, (73) takes the form 


<K,o W'i . > w o(“) - <*i, W, , > TT 0 (o)} + <R 10 IT 0 (a)> IT 0 (a) 

- ~{ < K 1 r W'oW) W'toW} - J a ~ > »o(a)) = 0- 


Using (48) one gets from (74), 


<R,0 W.. > - <*. t *^t, > -5<Kt, W'u > 

_ 2 < ^ u>_ 5a-3^- <Rll> ^ ° 


So one obtains 


WioW 


- rr 1 

L<*n 






Substituting (75) into (71) and noting that 

[*io W 0 ,K U W 0 1 J?(W 10 ) = [K i0 W l0 ,K u W 0 ~] if ( W 0 ), 
one gets from (71) 

W 2 (a, (f>) = W Q (a) W 22 (a, <j)), 


where 


W 22 (a,(f>) = 


lK i0 W tl + K i0 W w ,K ij W 11 + K u W l0 V{O 0 (a)) 


dR-u d S% dR 2l 


R ^- wV *- < 77 > 
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According to SAM, the second approximate solution to the FP equation (51) is 
defined as 


W(a,<t>) = W 0 (a){l + £(W 10 (a)+ W n (a,<!>))}• (78) 

and the improved second approximate solution is 

W(a,<l>)=W 0 (a){l +e(W l0 (a) J r W^a,</>)) +e 2 W 12 (a, 0)}, (79) 

where W 0 (a\ W 10 (a), W lx {a,(j)) and W 22 (a,4>) are found from (48), (75), (70) and (77), 
respectively. It is seen that the solution (79) might be negative, however, this difficulty 
arises very often in truncated series expanding the probability density function. 

Example 2-Duffing oscillator: Apply now the proposed procedure to the Duffing 
oscillator subjected to white noise excitation 

x + a) 2 x = — 2ehx — eyx 3 + h,y> 0 . (80) 

From (80) and (21) one gets 

K,(a,d>) =-^—z — ha + l ha +—t )cos 2(b + -^-u 3 sin2d> +-^-a 3 sin40, 
4 aor \ \aoo l ] 4co 8a> 

/ <j 2 \ y y 

K 2 {a, (f)) — —a 2 — h + ^ 9 sin 2d) +— a 2 cos 26 + -2-a 2 cos 4d>, 

8co \ 2 a (D J 2co Sen 

a 2 cr 2 or 2 

i (a, <j>) = ^2 “ 2^2 cos 2 K l2 (a,(f>) = —^sin 2 </>, 

2 ^.2 

K 22 {a,(j))=—^-^ + - T - I cos2(l>. (81) 

2crar 2a co 


The corresponding averaged FP equation (48) is 


d 

17 <x 2 , \ 7 

1 8 2 

a 2 

da 

L(w ha ) W °M\ 

2 da 2 



= 0, 


which gives the solution 

W 0 (a) = Caexp 


{-"■} 


(82) 


(83) 


It is seen that the solution (83) corresponds to the linear case of the system (80), (y = 0), 
and does not include the effect of the nonlinear term yx 3 . 


Further, substituting (81), (83), (54) into (68) yields 

[K„ K, v ] l(<K 0 (a)) = - ^u*{(2 sin 2 <#. + sin 40)}. 
Then, from (67) one gets 

W 11 {a,(())= - IF o (a)^-|cos2 0+^cos4^j 


(84) 


( 85 ) 
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Further, using (81), (83), and (85) one gets from (75) 


W 10 (a) 


— 3y/iu 4 
8<r 2 ' 


( 86 ) 


Hence, the second approximate solution to the FP equation (51) for the Duffing case 
(80) takes the form 


W{a, 0) = Ca exp 


2 hco " 


1 — af {3 + 4 cos If + cos 40) 
8cr 


(87) 


The condition 


f oo fin 


-‘i) 




W(a,(f))dad(f)=l 


o J o 


gives 


C = (2hco 2 /no 2 )(l + (3£yft- 2 /16/ift) 4 )). 

From (87) one gets the probability density function W(a), IT(0), respectively. 

3 eyh c \ [ 2hoo 2 


( 88 ) 

(89) 


4 hco 2 f 3eyo 2 

w ( a )- — { 1 + l6fe7 


a — 


8ct 2 

.2 


exp 


"w ■- 1 + §£) [ : 1 - + 4cos: 2 * + cos ^ j 


Thus, in particular, one has 

~ 2 3y<7 4 


<a 2 > 


2\ 


2 hoj J 


— £ 


16 h 2 o ( 


(90) 


(91) 


Hence, using the second approximate solution, the effect of the nonlinear term yx 3 is 
obtained in the formulae (87), (90), and (91). It is seen from (91), that the nonlinear term 
yx 3 reduces the mean square of the amplitude. This effect cannot be found from the first 
stochastic averaging. 


/ 


4. A modified procedure 


A similar procedure can be applied to the function <D(a, 0) by imposing 

0) = d> 0 + £<D 1 + £ 2 0 2 + •••. (92) 

Noting (52), (55e) one gets from (56) the following equation for d>: 


co 


8 <D 

<30 


-el 


(93) 


Substituting (57), (92) into (93) and comparing the coefficients of like powers of e yields 


co 


8% 

8<f> 


= 0, 


(94) 
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®^=-[x,„,k 0 .] n %). 


( 95 ) 


co 


5<d 2 
5c p 


dK n dK 12 \d ^ / 5k 22 ^ 12 yo t 

10 5a 50 ) da \ 20 50 da ) 50 

1 . / 5 2 ^» (d 2 ®, d% 50 ,\ 

2 u \5a 2 5a da ) 12 \5a50 5a 50 ) 


Jr 

+ 2 * 22 50 2 5a 50 11 5a’ ' 


(96) 


As an illustration apply the procedure to the autonomous Van der Pol system (39), 
(P = 0). In this case the coefficients K X ,K 2 of drift do not depend on e and are obtained 
from (40) by putting P = 0. Equation (94) yields <J> 0 = <$ 0 (a) and (95) gives, in particular, 


[<*«>>. <K 0 >m(a))=o. 

Substituting (40) (P = 0) and (54) into (97) yields 

2 2 

O 0 (a) = In C + lna + ^y a 2 — ^y > 
a 8a 

1V 0 (a) = Ca exp a 2 - a 4 J. 

Substituting (40) (P = 0) and (99) into (54) one gets from (95) 


(97) 

(98) 

(99) 


$1 (a,0) 


32coo" 


;(12cra 2 + 4ar a 4 —yar a 6 ) sin 2 0 


+ ^2(7? 6 - 4 a 4 ) sin # + <t) 1 o(4 (100) 

Substituting (100) and (40) (P = 0) into (96) after averaging a particular solution for 
$ 10 (a) is obtained, 

®io(fl) = 0 (101) 

Thus, the probability density function IV(a, 0) of amplitude and phase of the autono¬ 
mous Van der Pol system (39), (P = 0), takes the form 

W(a, 0) = Ca exp j^y a 2 - n 4 + - ~^ 2 (12a 2 a 2 + 4co 2 a 4 - yea 2 a 6 ) 

x sin 2 0 4- —^ (ya 6 - 4a 4 ) sin 40 + e 2 • • • j, (102) 

where C is a normalization constant. 


5. Conclusion 

The stochastic averaging method is a useful tool for analysis of nonlinear random 
vibrations. Therefore, the development of SAM has attracted interest from investigators. 





Averaging method of coefficients in Fokker-Planck equation 


387 


Furthermore, it has been observed that the effect of many nonlinear perturbations 
cannot be described by using the first approximate solution of SAM. It is shown in this 
paper that this difficulty may be overcome by considering higher approximate solu¬ 
tions of SAM. 


Support from the fundamental research project in natural sciences is gratefully 
acknowledged. 
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Transient and stationary response statistics of van der Pol 
oscillators subjected to broad band random excitation 

A NAESS 1 and B K HEGSTAD 2 

1 Faculty of Civil Engineering, The Norwegian Institute of Technology, 
Rich. Birkelands v la, N-7034 Trondheim, Norway 

2 Faculty of Physics and Mathematics,'The Norwegian Institute of Tech¬ 
nology, A. Getz v 1, N-7034 Trondheim, Norway 

Abstract. The joint probability density function of the state space vector 
of a white noise excited van der Pol oscillator satisfies a Fokker-Planck- 
Kolmogorov (FPK) equation. The paper describes a numerical procedure 
for solving the transient FPK equation based on the path integral solution 
(PIS) technique. It is shown that by combining the PIS with a cubic P-spline 
interpolation method, numerical solution algorithms can be implemented 
giving solutions of the FPK equation that can be made accurate down to 
very low probability levels. The method is illustrated by application to two 
specific examples of a van der Pol oscillator. 

Keywords. Nonlinear vibration; stochastic excitation; random response; 
van der Pol oscillator. 


1. Introduction 

The van der Pol oscillator has been extensively studied over the last several decades, 
and it has come to play the role as one of the archetypes of a nonlinear oscillator. Its 
dynamic properties relative to deterministic harmonic excitation are by now well 
mapped out. The available literature on the van der Pol oscillator is voluminous. 
A guide to this literature is provided by Hayashi (1964), Andronov et al (1987) and 
Hagedorn (1988). 

While the reported work on the van der Pol oscillator is extensive, only a very small 
part of it deals with the probabilistic aspects of this oscillator, in particular the response 
statistics when the oscillator is subjected to stochastic excitation, see e.g. Caughey 
(1959), Stratonovich (1967), Piszczek (1977), Spanos (1979), Zhu & Yu (1987). The 
purpose of the present paper is to contribute to the study of the response statistics of the 
van der Pol oscillator when it is subjected to stationary Gaussian white noise. 

Due to its particular nonlinear properties, the dynamic analysis of the van der Pol 
oscillator does not easily lend itself to the various perturbation, linearization or similar 
techniques. At present there seems to be basically two methodologies available for 
studying the response statistics of substantially nonlinear dynamic systems subjected to 
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random forcing. One is the time domain Monte Carlo simulation method. The other is 
based on the theory of Markov processes. In this paper we shall deal exclusively with 
methods of the second class. In this class, the methods are based on the assumption that 
the forcing term can be obtained by filtering white noise processes. It is then often 
possible to rewrite the equation of motion in terms of a Markov vector process whose 
probability density function satisfies the Fokker-Planck-Kolmogorov (FPK) equa¬ 
tion, also referred to as the forward Kolmogorov equation, and the backward 
Kolmogorov equation (Risken 1989; Wong & Hajek 1985). Solving one of the these 
equations would provide an extensive probabilistic characterization of the response 
process. However, it has turned out that these partial differential equations are in fact 
quite difficult to solve. Considerable effort has been invested in setting up numerical 
procedures for this purpose, see for instance Wehner & Wolfer (1983), Langley (1985) 
and Bergman & Spencer (1992). 


2. The dynamic model 

The equation of motion for a van der Pol oscillator driven by white noise can be written 
as follows, after suitable rescaling if necessary, 

X(t) + 2£[X(t) 2 - i]X(t) + X(t) = (2D) i/2 N(t). (1) 

Here £ and D are positive parameters, determining specific versions of the oscillator. 
N(t) denotes a standard, stationary Gaussian white noise, that is, £[N(t)] =0 and 
E[N(t)N(t +1)] = <5(t). 

The response behaviour of the oscillator depends decisively on the numerical values 
of the parameters of the problem. Already van der Pol (1927) observed that (1) would 
give rise to limit cycle oscillations that are almost sinusoidal functions of time when £ is 
small compared to 1*0, while large values of £ would result in oscillations approaching 
a square wave. These highly nonlinear oscillations are often referred to as relaxation 
oscillations. 

The relaxation oscillation behaviour is in fact also reflected to a certain extent in the 
random response of a van der Pol oscillator to stochastic excitation. Early contribu¬ 
tions to the study of white noise excited van der Pol oscillators were given by 
Stratonovich (1967), Caughey (1959) and Piszczek (1977). They developed approxi¬ 
mation methods for deriving information on the stationary response statistics. 

Numerical simulations have also been used to study the random response of the van 
der Pol oscillator, see e.g. Spanos (1979). Applying a generalized cell mapping tech¬ 
nique (Hsu 1987) based on Monte Carlo simulation, Chiu & Hsu (1986) gave results for 
the steady-state probability density function of the van der Pol oscillator for a range of 
damping parameters £. This technique was further developed by Sun & Hsu (1990), to 
yield rather complete probabilistic information on the steady-state behaviour of the 
oscillator for specific values of the parameters. Recently, Zhu & Yu (1987) presented 
a critical review of previous work, pointing out that the stochastic averaging method 
used by Stratonovich, yields reasonable approximations of the response of a white 
noise excited van der Pol oscillator provided that £ « 1. 

The key to the calculation of the response statistics of the white noise driven van der 
Pol oscillator is the fact that the joint PDF of the state space vector [ X , X] T satisfies 
a Fokker-Planck—Kolmogorov (FPK) equation. However, no general closed form 
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solution is known for the case of the van der Pol oscillator. It is therefore necessary to 
develop numerical solution strategies that apply to this particular oscillator. This has in 
fact turned out to be a rather demanding task. Langtangen (1991) and Bergman 
& Spencer (1992) report work on the development of finite element methods. The last 
paper presents results from the application of such methods to the van der Pol 
oscillator. 

In the present paper an alternative solution strategy will be described. This is based 
on the concept of a path integral, which can be exploited to provide a formal solution to 
the FPK equation. The path integral solution (PIS) hinges on the property that the 
associated Markov diffusion process may be assumed to behave locally like 
a Brownian motion. This makes it possible to express the transition probability 
densities (TPD) of the Markov process in closed form for a short time step. The Markov 
character can then be invoked to build a global (in time) solution from local (in time) 
TDPs. 

The PIS technique has been used as a basis for developing numerical solution 
methods for the FPK equation of nonlinear dynamic models driven by white noise by 
Wehner & Wolfer (1983) and Naess & Johnsen (1991, 1992). Naess & Johnsen have 
shown that the PIS technique can be used to provide accurate solutions down to very 
low probability levels for a wide range of nonlinear oscillators. 

3. The path integral solution 

Let the vector process Z(t) = [Zft), Z 2 (£)] r (T as superindex denotes transposition) be 
defined by Z 1 (t) = X(t) and Z 2 {t) = X{t). The equation of motion (1) can then be 
rewritten as 


Here 


dZ(t) = m [Z(f)] d£ + Q d£(r). 


m[Z (£)] = 


Z 2 (t) 

L-2C[Z 1 (t) 2 -l]Z 2 (r)-Z i (£). 


Q = 


o 

(2D) 112 


( 2 ) 

(3) 

(4) 


B{t) denotes a standard, real Brownian motion, cf. Risken (1989) or Wong & Hajek 
(1985). Equation (2) will be interpreted as an ltd stochastic differential equation. The 
solution Z(t) to (2) is a Markov vector process whose TPD p(z, t\z\ t') is defined by the 
equation 


Prob{Z(t)e4|Z(£') = z 



p(z,t\z', t')dz. 


(5) 


where A £ R 2 is some event, z, z'e R 2 , dz = d z x dz 2 . Prob {B | T} denotes the conditional 
probability of B given T. It can be shown that the TPD p = p(z, 1 1 z', t')(t > t'^0) of the 
van der Pol oscillator satisfies the FPK equation, 


dp 

dt 


= 2C(z 2 


1 )P 


dp 


+ K(Z? 




with initial condition p(z, t'|z', £') = S(z — z') — 8 (z l — z\)5(z 2 — z' 2 ). 


/ r\ 
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Since clearly Prob{Z(t)eR 2 |Z(f') = z'} = 1, theTPD satisfies the following normal¬ 
ization condition 


p{z, t\z\ t')dz = 1 


(7) 


Let /(z, t) denote the PDF of the random vector Z(f). If /(z, t ') = w(z) for some initial 
PDF w(z), then it is recognized from (6) and the relation, 


/(z, 0 = 


vf CO J ~ CO 


p(z,t\z\ t')w(z')dz', 


( 8 ) 


that f(z, t ) itself is a solution of (6) satisfying the initial condition /(z, t') = w(z). 

The steady-state or stationary solutions / s (z) of (6), if they exist, are those solutions of 
(6) which satisfy df/dt = 0, or 


f s (z)= lim/(z, t) = lim p(z,t|z',t') 

t CO (~* CO 


(9) 


To derive the PIS appropriate for the van der Pol oscillator, one may proceed in 
a manner similar to the derivations in Risken (1989). It can then be shown that the TPD 
for small values of r( = t — t') is given by the following expression, which is correct up to 
terms of order t 2 . 


p{z,t + t|z', t) = <5(z x - z\— z' 2 t)[1/2(7cI)t) 1/2 ] 


x exp{ -(1/4Dt)[z 2 — z' 2 + 2£(z' 1 2 - 1)z 2 t + z^t] 2 }. (10) 

It may be noted that the expression for the TDP given by (10) is not unique, cf. Risken 
(1989). 

Having obtained an explicit expression for the TDP for a short time step, one can 
now invoke the Markov property. This allows a TPD over a time interval of arbitrary 
length to be expressed in terms of a product of short-time TPDs. By dividing a given 
time interval (t\ t) into N small time intervals of length t = (t — t')/N, it is found that 
(tj =t’+ jx, t = t N ,z = z (N \ t'-t 0 ,z' = z (0) ) 


P(z,t|z', t') = 


f] p(z U) ,tj\z a 1) ,r j ._ 1 )dz (1) ---dz (iV 1} . 


j=i 


Similarly, with an initial PDF f(z', ?) = w(z'), the PDF/(z, t ) will be given by 


/(z, t) - 


r n 


p(z U) ,tj\z u 1) ,t J ._ 1 )w(z (0) )dz (0) ---dz (N 1) . 
j= i ' 


( 11 ) 


( 12 ) 


Hence, by combining (10) with (11) or (12), a formal (approximate) solution of the 
FPK equation can be written down. Equations (11) and (12), which are often referred to 
as PIS, constitute the core of the numerical solution procedure to be described 
subsequently. It is realized that a numerical solution according to this method, 
automatically provides the evolution in time of the (conditional) PDF of the Markov 
process Z(r) from given start conditions in terms of an initial density/(z', t') = w(z'), 
including the degenerate case/(z', t') = <5(z' -z 0 ), for some starting point z 0 . We may 
therefore study directly how the response statistics develops in time from specified 
initial conditions until the oscillator reaches the steady-state response behaviour. 
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Obviously, the closer the initial PDF is to the stationary PDF, the faster the oscillator 
settles down to the steady-state behaviour. 

It is also worth noting how the PIS relates to the physics of the dynamic model, 
which is expressed through the coefficient vectors m(-) and Q, cf. (2). The evolution in 
time of the PDF as expressed by the PIS, is seen to be directly determined by these 
coefficients in an explicit manner. This fact is a very important advantage of the PIS 
method, and reveals its fundamental physical significance. 


4. Numerical implementation 


The Chapman-Kolmogorov equation expressed as 

* 00 Coo 

p(z U) , tj\z u ' 1) , t j _ 1 )p(z (j ~ 1 \ tj_ JIz', t')dz ( l~ L) d z ( i ~ X) , (13) 

J - 00 J — 00 


constitutes the basis for the iterative numerical implementation of the PIS. 

The numerical estimates of the TPDs are calculated on a discrete set of grid points in 
state space. In an effort to retain accuracy with a relatively coarse grid, the expected 
smoothness of the TPDs was invoked. This would justify the introduction of an 
interpolation and smoothing procedure. For the purpose at hand, it was found that 
application of cubic B-splines (de Boor 1978) offered the desired accuracy and 
smoothness. This procedure was used as follows. At each time step tj_ L -> t Jt p(z {J ~ 1} , 
tj_ x \z', t') is represented as a cubic B-spline series in the following manner, 

Mi m 2 

p(zV-"tj\z',t')= Y. £ ry-‘'(fc 1 ,fc 2 )B )t ,(z l /- 1) )B lj (zr 11 ), (14) 

*1 = 1 fcj = l 


M, = number of grid points for the ith state variable z { . Assuming that each set 
{B k .(-)^L 15 i = 1,2 is a basis of cubic B-splines associated with the knot sequence 
(de Boor 1978) determined by the grid points for the ith variable z f , it can be shown that 
{B ki (-)B k;i (-)}£fL L constitutes a basis of cubic B-splines for two-dimensional problems. 
{r°’ -1) (fc 1 ,/c 2 )}j*! =1 is the set of interpolation coefficients associated with time t j _ 1 . 

The representation of p(z u ~ 15 , f._ y | z', t ') by B-splines makes jt possible to retain high 
numerical accuracy even with a fairly coarse basic grid if p(z u ~ 1} , tj_ t \z\ t') is not too 
singular. By substituting from (14) into (13), we obtain 

Mi m 2 

p(z<J>,tj\z',t')= Y E 

ki = 1 fc 2 =l 

X f” f" p(z°\t s \&- I) ,tj.‘>)dz </- l) . 

J — 00 J — 00 

(15) 

It is seen from (10) that the TPDs do not depend on absolute time, but only on the 
time increment. Markov processes whose TPDs have this property, are called homo¬ 
geneous. It follows that 

p(z ij) ,t j \z u ~ 1 \tj_ l ) = p(z (j) ,T\z u ~ i) ,0), j= 1,2,... 
which holds for any t j — t J _ l = t ^ 0. 


( 16 ) 
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From (15) and (16) it is seen that for a fixed value of the time increment i, each of the 
integrals on the right hand side of (15) needs to be calculated only once, and can be 
stored for repeated use. That is, the following parameters are calculated initially and 
stored 




p(z ij) 


(Mn 




(17) 


Here, the index / f , i = 1,2, refers to grid point number / f for the state space variable z { . It 
may be noted here that due to the properties of the TPD for small time increments t, the 
tensor Bfcfc has a strongly banded character with the elements decreasing rapidly away 
from the main diagonal k 1 =l l ,k 2 = l 2 . This has important implications for the 
efficiency of the computer program. Let p\ j ? h =p{zff tli) ,t J \z\t'). Then (15) can be 
rewritten as 


Mi M 2 

p',1= E I r (is) 

ki = l fc 2 = 1 

Having calculated the TPD p{z ij) , tj\z', t') at the grid points by using (18), a spline inter¬ 
polation is again carried out and a new set of interpolation coefficients {T U) {k k 2 )}j£‘ = x 
are calculated. This provides an updated representation of the TPD for time step j, cf. 
(14). For each time step, the normalization condition (7) is checked. That is, if for the 
chosen grid domain (a t , b j) x (a 2 , b 2 ), 



p(z 0) , tj\z\ f')dz ( 1 J) dz ( 2 J) 


Mi M2 




= E I r° ) (k 1 ,k 2 ) II B ki (x)dx^qj 


ki = 1 k 2 = 1 


i = 1 


(19) 


and qj ^ 10 within the desired accuracy, then the following replacement is made to 
restore the correct normalization. 


T U) (k t , /c 2 ) new ^ qj 1 F»{k r ,k 2 f d . (20) 

This normalization check and replacement strategy contributes to producing a very 
stable and accurate numerical procedure provided a proper choice of time step, grid 
size and grid domain has been made. 


5. Numerical examples 

In the numerical calculations, appropriate choices have to be made for the grid domains 
and their discretizations as well as the corresponding time increments. At present such 
choices are made mostly on the basis of experience and trial and error. We shall not 
enter a detailed discussion of this part of the subject, but rather concentrate on the 
results themselves. More detailed discussions of the numerics of some of the case studies 
presented here are given in Naess & Hegstad (1994). 

As previously stated, the parameter C plays a crucial role in determining the dynamic 
behaviour of the van der Pol oscillator. For small values of £ (compared to 1-0), which 



Response statistics of van der Pol oscillators 


395 



the PDF of the state space 
vector of oscillator response 
for Z> = 0-1 and various £- 
values. C = 005 (a), 0-1 (b), 0-5 
(c), 1-0 (d). 


















■e 2. Three-dimensional plots of the PDF of the state space vector of oscillator response for D = OT and various (-values. ( = 0-05 (a), 0-1 
5(c), 10(d). 
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corresponds to an almost linear oscillator with natural frequency close to unity, the 
time history of the response to harmonic excitation is almost sinusoidal when the 
driving frequency is in the neighbourhood of unity (harmonic entrainment), cf. Hayashi 
(1964). In contrast, for large values of £, the response time history approaches a square 
wave. To study how this property is carried through when the excitation is random, we 
have done computations for a set of conditions with a constant excitation level, 
specifically D = OT, and with increasing ^-values from 0-05 to TO. 

To get a clear qualitative picture of how the response statistics change with increasing 
values of the C-parameter, the results for the stationary PDFs from the calculations are 
presented both as contour plots in figure 1 and as three-dimensional plots in figure 2. 
From these figures it is clearly recognized how the response changes from a somewhat 
diffuse sinusoidal behaviour for small (, to a much more focussed limit cycle behaviour 
approaching that of a square wave. To wit, from figures Id and 2d it is seen that for 
C = 1-0, the oscillator spends most of its time at large excursions and small velocities, 
which is precisely what is to be expected for an almost square wave. 

To provide more quantitative results for the stationary response statistics of the van 
der Pol oscillator, plots of the marginal PDFs for displacement and yelocity are given 
in figures 3 and 4, respectively. As a partial verification, the calculated PDF of 




Figure 3. (a) Linear and (b) 
logarithmic plots of the mar¬ 
ginal PDFs of oscillator dis¬ 
placement response for D = OT 
and various C-values. ( = 0-05 
(-), 0-1 (—-), 0-5 (--), 

i-o (•••). 


l 


2 


3 


X 
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Figure 4. (a) Linear, and (b) 
logarithmic plots of the mar¬ 
ginal PDFs of oscillator veloc¬ 
ity response for D = 01 and 
various C-values. ( = 0-05 (—), 
01 (-)> 05 (-), 10 (• • •)- 


displacement response for the van der Pol oscillator is compared to the results of 
Monte Carlo simulations for two case studies by Naess & Hegstad (1994), and the 
agreement was found to be good. 

As previously stated, one of the remarkable features of the PIS technique is the high 
accuracy that can be retained down to very low probability levels. This property has 
been substantiated in several earlier papers, cf. Naess & Johnsen (1991,1992 and 1993). 
A principal experience from this work is that if proper numerical procedures are 
followed, the obtained results are very reliable. To the extent that this conclusion has 
general validity, it is very important. It allows us to provide accurate results relevant for 
extreme response analyses of nonlinear oscillators where no a priori results are 
available. This cannot easily be done by Monte Carlo simulations since it requires at 
present either extrapolation or a prohibitive amount of CPU time. 

It is also interesting to have a look at the transient behaviour of the oscillator. We 
have chosen to study the case with parameter values £ = 1-0 and D = 0T. In figure 5 it is 
shown how the PDF of the state space vector develops in time after being released from 
the point (2-0, 2-0). That is, the initial PDF is a delta function located at this starting 
point. It is seen that the oscillator almost immediately locks on to the limit cycle 
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Figure 5. Transient behaviour 
of the PDF of the state space 
vector of oscillator response for 
D = 0T and ( = l’O. 


oscillations. Numerical studies of situations where the transient behaviour is known, 
indicates that the PIS method provides accurate estimates also of the transient 
response statistics. 

In the last series of numerical results, the effect of increasing excitation level is 
illustrated. The (-parameter is kept constant at the value OT, while the excitation level 
runs through the values D = 0 01, OT, 1-0 and 10 0. The results are given in figures 6 and 
7. As could be expected, when the excitation level is very low, the oscillator diplays an 
almost linear behaviour. However, as the excitation level increases, the nonlinear, 
square wave character of the system comes more to the forefront, although the 
oscillations are much less focussed on a limit cycle behaviour than would be the case 
with larger values of the (-parameter. 


6. Conclusions 

The paper has described a general numerical technique for calculating the response 
statistics of nonlinear oscillators subjected to broad band random excitation. By 
application to a white noise excited van der Pol oscillator, it has been demonstrated 
that detailed information about the statistics of this system can be obtained by 
implementing the so-called path integral solution (PIS) method. It is pointed out that 
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Contour plots of the PDF of the state space vector of oscillator response for £ = 0-1 and various D-values. D = 001 (a); 0-1 (b), i 0 (c). 
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the PIS method provides a very robust numerical solution technique that allows 
accurate calculation of both transient and stationary response statistics up to response 
levels beyond reach for most other solution procedures. 
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Random excitation of nonlinear elastic structures with internal 
resonances 

R A IBRAHIM and B H LEE 

Department of Mechanical Engineering, Wayne State University, Detroit, 
MI 48202, USA 

Abstract. The influence of random vibration on the design of mechanical 
components has been restricted to the linear theory of small oscillations. 
However, this theory is inadequate and fails to predict the complex 
response characteristics which may be observed experimentally and can 
only be predicted by employing the nonlinear theory. This paper presents 
a brief overview of the basic nonlinear phenomena associated with non¬ 
linear random vibration. 

An example of a clamped-clamped beam under filtered white noise 
excitation in the neighbourhood of 1:1 internal resonance condition is 
considered. Three approaches are employed to examine the response and 
stochastic bifurcation of the beam coupled modes. These are the Fokker- 
Planck equation together with closure schemes, Monte Carlo simulation, 
and experimental testing. The analytical results are compared with those 
determined by Monte Carlo simulation. It is found that above a critical 
static buckling load the analytical results fail to predict the snap-through 
phenomenon, while both Monte Carlo simulation and experimental results 
reveal the occurrence of snap-through. The bifurcation of second mode is 
studied in terms of excitation level, internal detuning and damping ratios. It 
is found that below the critical load parameter, the response statistics do 
not significantly deviate from normality. Above the critical value, where 
snap-through takes place, the response is strongly non-Gaussian. 

Keywords. Random excitation; nonlinear elastic structures; internal reso¬ 
nances; complex response characteristics. 


1. Introduction 

The nonlinear random vibration of structural elements is one of the serious problems 
of great concern to the designers of mechanical systems. This problem mainly deals 
with the stochastic stability/bifurcation and response statistics. The stochastic stability 
analysis of the equilibrium position is usually carried out on the basis of a linearized 
approximation to the system equations of motion. If the equilibrium is unstable in 
a stochastic sense, the linearized equations do not provide a unique bounded solution. 
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On the other hand, if the system’s inherent nonlinearities are included in the mathema¬ 
tical modelling, the solution trajectories which emanate from an unstable equilibrium 
often end up in bounded limit cycles. Moreover, the nonlinear modelling also allows 
the designer to predict a wide range of complex response characteristics such as 
multiple solutions, jump phenomena, internal resonance, and chaotic motion. The 
system nonlinearity is dictated by several factors including the system’s geometry, 
boundary conditions, and material characteristics. The geometry and boundary 
conditions can result in inertia and stiffness nonlinearities. For example in a clamped- 
clamped beam the nonlinearity is mainly governed by the mid-plane stretching, while 
for a cantilever beam the nonlinearity can include inertia and curvature depending on 
the mode in question. These continuous elastic elements are usually described by 
partial differential equations of the form 

&(w(z,t)) = U(z,t), (1) 

together with the appropriate boundary conditions, where ££ is a linear/nonlinear 
operator, w(z, t) is the structure deflection in a function of space coordinate z and time r, 
and U (z, f) is a random excitation field. To develop methods for evaluating the 
nonlinear response to stochastic excitation, an approximate solution of the nonlinear 
partial differential equation can be obtained by expanding the response coordinate 
w(z, t) in terms of a finite number of eigenfunctions 

n 

w{z,t)= £ (f>j{z)qj{t), (2) 

j= i 

where <Pj{z) is the eigenfunction of mode j of the eigenvalue problem obtained by setting 
nonlinearities and nonconservative forces to zero, and qj(t ) is the corresponding 
generalized coordinate. The partial differential equation (1) can now be reduced to 
a system of n nonlinear coupled ordinary differential equations in q/t) using the 
Ritz-Galerkin approach (Kantrorovich & Krylov 1958). The random excitation field 
U (z, t) can also be expanded in terms of the system normal modes (j)j(z) 

U(z,t)= £ hmit), (3) 

j =i 

where is the generalized force associated with mode j. Introducing relations (2) and 
(3) into (1) and applying Galerkin’s method results in a set of nonlinear ordinary 
differential equations of the general form 

w++[[K]+Ko mm +(4) 

where [C(r)] and [K] are the linear damping and stiffness matrix coefficients, £ 0 
represents the parameteric random component, and the vector i {/ represents all non¬ 
linear terms due to material and structural nonlinearities. K is a constant coefficient. 

In problems concerning structure failure, one of the most important questions is 
whether the solution, which is the system response to a random perturbation, remains 
bounded for all time. In the case of parametric excitation, a more refined question is 
whether the equilibrium solutions,.if they exist, are asymptotically stable. The method 
of stochastic Liapunov functions has been proven to be effective for stochastic ordinary 
differential equations (ODE’s) to a partial differential equation (PDE) setting. For 
stochastic ODE’s the Liapunov method for stability has been systematically developed 
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by Khas’miniskii (1980), and by many others (see for example Kushner 1967, and 
Pinsky 1974). The Liapunov method for stochastic PDE’s was initiated by Chow (1982) 
to treat a class of parabolic problems. Stochastic stability can be examined in terms of 
one of the stochastic modes of convergence such as mean square stability, almost sure 
stability. Recently, mathematicians established a measure of exponential growth of 
system’s response known as the Liapunov exponent for nonlinear dynamical systems 
and for parametrically excited linear systems. 

1.1 Single mode random excitation 

The random response of a clamped-clamped beam has been extensively examined 
within the framework of one mode excitation. Nonlinear systems described by one 
mode excitation are usually modelled by the nonlinear differential equation 

q + 2Z(D„q + (olq + Eil/(q,q,q) = Z(t) (5) 

where £(t) is an external excitation and i/f may include nonlinear stiffness, damping and 
inertia terms and a is a small parameter. The case of nonlinear stiffness has received an 
extensive amount of investigation. The equation of motion of this case is known as 
a Duffing oscillator. The early study of single mode response under random excitation 
goes back to the work of Lyon et al (1961). They considered £{t) as a narrow band 
Gaussian process derived from a wide band excitation of a resonant filter centred at 
frequency co l . 

t + ltjco^ + coU=W(t), (6) 

where W ( t ) is a white noise. The purpose of their study was to find out if the well-known 
jump phenomenon can occur under narrow band random excitation. The response was 
studied by using the quasi-linearization method and by experimental testing. The 
results revealed multivalued response characteristics which have the same general 
appearance as those for sinusoidal forcing except that the peaks are much sharper. The 
same feature has been observed by Lennox & Kuak (1976). They used quasi-static 
method with small but finite bandwidth. However, their method does not allow one to 
investigate the influence of bandwidth. Fang & Dowell (1987) analysed the response of 
a Duffing oscillator to a narrow band random excitation by numerical simulation. 
Their results showed that multi-valued mean square responses can occur for mono¬ 
level excitations having very narrow bandwidths. As the bandwidth increases, the 
multi-valued responses give way to single valued responses. Similar results were 
obtained using many different approaches, such as, averaging method (Roberts & 
Spanos 1986; Davies & Rajan 1988; Davies & Liu 1990,1992), stochastic linearization 
(Dimentberg 1971; Iyengar 1989; Roberts & Spanos 1990; Roberts 1991), probabilistic 
linearization technique (Iyengar 1992), numerical simulation (Davies & Rajan 1988; 
Iyengar 1989, 1992; Davies & Liu 1990, 1992), and experimental testing (Lyon et al 
1961). An alternative approach has been proposed by Davies & Nandlall (1986). They 
modelled the excitation as the response of a lightly-damped second order filter to white 
noise. Thus the four-dimensional Fokker-Planck-Kolmogorov (FPK) equation can be 
written for the response. An approximate, time-dependent solution based on a Gauss¬ 
ian closure scheme was obtained from the four-dimensional FPK equation. The result 
was smoothed time histories for the mean square displacement and velocity. When 
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plotted in a phase plane, the solution showed two stable attractors and a saddle point 
for the case in which the excitation had very narrow bandwidth, in a certain frequency 
range. This behaviour is related to the well-known jump phenomenon. As the band¬ 
width of the excitation increased, the phase plane was reduced to a single stable 
attractor (sink), indicating the elimination of the jump phenomenon. 

The FPK equation for a Duffing oscillator to filtered excitation has been solved 
numerically by Kapitaniak (1985) using the path-integral method. He presented the 
stationary probability density function of the stochastic process as bimodal. Davies 
& Liu (1990) investigated the same system via averaging method and numerical 
simulation. They showed that the probability density function obtained for narrow 
bandwidth excitation has multi-local maxima (multifurcation) with random jumps. 
Also, for wider bandwidth excitation the probability density function shows only 
a single local maxima. Similar result was obtained by Iyengar (1992) using numerical 
simulation. Koliopulos & Bishop (1993) studied a method for the prediction of 
response statistics of nonlinear oscillators subjected to narrow band random noise 
based on a quasi-harmonic assumption. They showed that the method can easily 
incorporate the possible jumps of the system via the input-response amplitude curve. 
This curve defines critical input values which define the region of possible jumps. In this 
case the probability density of the response amplitude exhibits two peaks. 

Iyengar (1986, 1988) has presented a stochastic stability analysis of a nonlinear 
hardening-type oscillator subjected to narrow band random excitation. He showed 
that only one statistical moment is stable and, hence, the multivalued predictions of the 
equivalent linearization analysis do not correspond to the real behaviour of the system. 
Koliopulos & Langley (1993) improved the stability analysis for the same system. They 
showed that when jumps between competing response states occur, the choice of an 
appropriate value for the kurtosis of the response is crucial for a reliable estimation of 
the local statistical moments and for a more accurate stochastic stability analysis than 
the equivalent linearization method. Furthermore, a modification is proposed which 
takes into account the influence of certain nonlinear terms in the variational equation 
for the case of strong nonlinearities. 


1.2 Multi-mode random excitation 

The case of multi-degree-of-freedom system involves new problems in addition to those 
encountered for single mode excitation. The new problems mainly arise due to the 
nonlinear modal interaction which is only significant if the normal mode frequencies 
are commensurable, i.e. when the normal mode frequencies are governed by the 
internal resonance condition £/c ; co ; = 0, where k t are integers, and co ( - are the system’s 
natural frequencies. Under this condition, the mode which is directly excited interacts 
with other modes in the form of energy exchange. This type of modal interaction is 
referred to as autoparametric (Minorsky 1962). In the absence of internal resonances, 
the response is only dominated by the directly excited modes. The energy sharing in 
random vibration of nonlinearly coupled modes is believed to be first addressed by 
Newland (1965). The random response of two degree-of-freedom systems with au¬ 
toparametric coupling was examined by Ibrahim & Roberts (1976, 1977), Schmidt 
(1977a, b), Ibrahim & Heo (1986), Soundararajan & Ibrahim (1988), Li & Ibrahim 
(1989), Ibrahim et al (1990), and Nayfeh & Serhan (1991). These authors considered 
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different nonlinear coupled oscillators such as coupled beams, liquid sloshing intei ac¬ 
tion with elastic support, shallow arches, strings and hinged-clamped beams. For 
systems governed by quadratic nonlinearity it was found that the Gaussian closure 
scheme leads to nonstationary response statistics while the non-Gaussian closure gives 
a stationary response to a white noise excitation. Systems with cubic nonlinearity 
exhibit complex response characteristics in the neighbourhood of the condition ol 
internal resonance. It was reported that unbounded response statistics take place at 
certain regions above and below the exact internal resonance. For regions well remote 
from the exact internal resonance condition the system experienced linear response 
behaviour. These results were qualitatively verified experimentally by Roberts (1980), 
Ibrahim & Sullivan (1990), and Ibrahim et al (1990). In some cases the experimental 
results revealed an uncertain region of excitation level over which the coupled mode 
may or may not interact with the directly excited mode. 

Ibrahim & Li (1988) and Li & Ibrahim (1990) considered the nonlinear interaction of 
a three degree-of-freedom structural system subjected to a wide-band random excita¬ 
tion. The nonlinearity of this system resulted in different critical regions of internal 
resonance. The response statistics were predicted by using Gaussian and non-Gaussian 
closure schemes, and Monte Carlo simulation. While the non-Gaussian closure 
predicted multiple solutions in the neighbourhood of exact internal resonance, the 
Monte Carlo simulation yields only one branch of the stationary solution correspond¬ 
ing to a zero set of initial conditions. This implies the existence of domains of attraction 
corresponding to the stationary attractors of the system. 

Nonlinear continuous systems, not governed by internal resonance conditions, such 
as beams and plates subjected to random excitation, can exhibit new phenomena not 
found in discrete nonlinear systems. Mei & Wentz (1982), Moyer (1988) and Rcinhall 
& Miles (1989) showed a new response behaviour which takes the form cf a definite 
widening response spectra, as well as a shift, of the resonance peak as the excitation 
level is increased. These features can only be predicted using Monte Carlo simulation 
or experimental testing. All well-known analytical methods failed to predict these 
phenomena. This motivated Ibrahim et al( 1993) to study the stochastic bifurcation of 
a clamped-clamped beam under wide band random excitation in the neighbourhood of 
the internal resonance condition aj 2 = co { + 2 co 2 , which is mainly governed by the 
value of pre-applied axial static load. They used Gaussian and non-Gaussian closures, 
Monte Carlo simulation and experimental testing. The nonlinear response is represen¬ 
ted by the first three modes. In the neighbourhood of the critical bifurcation parameter 
(axial static load/Euler buckling load) the Monte Carlo simulation yields strong 
nonstationary mean square response for the asymmetric mode which is not directly 
excited. Experimental and Monte Carlo simulation exhibit nonlinear features similar 
to those observed by Mei & Wents (1982) and others as described previously. The 
estimated probability density functions, from numerical simulation, displayed signifi¬ 
cant deviation from normality. Furthermore, the response revealed non-zero mean 
under zero mean excitation. This non-zero mean was attributed to the fact that the 
straight beam is under initial axial load. It was al?o reported that large initial 
conditions resulted in numerical instability and only small initial conditions were used 
to generate bounded solutions. 

This paper presents an example which is an extension of the work of Ibrahim et al 
(1993) to examine the stochastic behaviour of a buckled clamped-clamped beam in the 
neighbourhood of 1:1 internal resonance. In this case the treatment will be confined to 
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the nonlinear interaction of the first two modes. The excitation is considered narrow 
band which is generated from a linear second order shaping filter subjected to a white 
noise. The filter central frequency is selected to be close to the natural frequency of the 
excited mode. Three different approaches will be employed. These are the closure 
schemes, Monte Carlo simulation and experimental testing. 


2. Example 

2.1 Analytical model and analysis 


Figure 1 shows a clamped-clamped beam which is buckled due to an axial load P > P cr , 
where P cr is the Euler buckling load. Under this load the beam will have a static 
deflection U 0 (x). The base excitation is assumed as a filtered white noise process. The 
motion of the beam is governed by the nonlinear partial differential equation, which is 
derived by using Hamilton’s principle; 

d 2 U (x,r) , r ,d 4 (U(x,t) + U 0 (x)) , n 5 2 (t/(x,r)+t/ 0 (x)) 

m—— + El -r-j-+ P -—- 

dt~ dx ox z 


EA d 2 (U(x,t)+U 0 (x )) 
2L dx 2 


L d 2 Y(t) 

{d(U(x,t) + Uq(x))/8x} 2 dx = — m- 2 , 
o dt 

(7) 


where U(x, t ) is the beam deflection at position x and time t measured from U 0 {x); E is 
Young’s modulus of the beam; / is the area moment of inertia in bending; and A is the 
cross-sectional area of the beam. The static axial load P exceeds the Euler buckling load 
P cr , i.e.: 


p = p + 


EA 
2 L 


^U 0 (x) J 


dx. 


( 8 ) 


The deflection of the beam is expanded into n modes, 


U Q (x)+U(x,t)=CV 0 \^ l (x)+ t (9) 

where V 0 is the initial deflection of the beam at x = L/2; C = 1/<E> (L/2); q ( = qjh, q t (t) 
are the generalized coordinates; and 0 1 (x) are the corresponding shape functions which 


Y(t) 



L 


Figure 1, Schematic diagram 
of initially buckled beam. 
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The departure from the exact internal resonance is expressed by the internal detuning 
parameter A, defined by the relationship: 

r = 1 + £ 2 A. (12) 


The external excitation is assumed Gaussian white noise process whose correlation 
time is much smaller than any characteristic period of oscillation of the system. In this 
case the response can be approximated by a Markov state vector which is established 
through the coordinate transformation: 

{Q l ,Q' l ,Q 2 ,Q , 2 ,z,z'} = {x 1 ,x 2 ,x 3 ,x 4 ,x 5 ,x 6 } = x. 

If the random process W(x) is replaced by the formal derivative of the Brownian process 
B( t), i.e., W{x) = adB{x)/dr, where a 2 = 2D, equations (11) can be transformed into the 
Ito type equations 


dXj = X 2 dx, dX 3 = X 4 dx, dX 5 = X 6 dx, 
dX 2 = {- 2C,rX 2 - r 2 X 1 - er 2 (V5X 2 + 1-80X 2 ) 

-eV(l-804X 1 X 2 T 0-5X1) ~f a X 5 }dx , (13) 

d* 4 = {- 2C a X 4 - X 3 - er 2 (3-14X l X 2 ) - s 2 r 2 (\-miX\X 2 + 6-74X 2 )}di, 


dX 6 = (- 2 C f r f X 6 - rjX 5 ) dx + e 2 ad B(z). 

The joint probability density p(X, t) of the response coordinates is described by the 
Fokker-Planck equation 


dp(X, t) 
dx 


- ij^wx,x)p(x,xn+~i i 


i=i 


IfrfrdXfXj 


/b u (X,x)p(X, T)], 

(14) 


where the coefficients a,- and b u are the drift and diffusion coefficients, defined by the 
relationships 


ai (X,x)= lim— £[X £ (T + AT)-* t .(T)], 

At-*0 Xx 


(15) 



In view of the system nonlinearity, it is not possible to solve the system Fokker- 
Planck equation for the response probability density. Instead, one may derive a general 
first-order differential equation for the response dynamic moments. This equation is 
obtained by multiplying both sides of (14) by the scalar function, O(X) = X kl X k 2 2 
X k 2 X k *X kS X k 6 6 , and integrating by parts over the entire state space — oo < X< oo. 
Then the general first-order differential moment equation; 


m k2,kz,k^k^,k s ,ke ^ 1 W k, - l.k* + l,k 3 ,k4,k s ,k 

+ k 2 {~ 2£i rm kitk2 kMitkb - r 2 m, 

— er 2 (l-5m i 
, 2„2 


*1 + l.fca- l,k3,k A ,k 5 ,k(, 

+ l-804m t 


ki + 2,k 2 -l,k 3 ,JcA,ks.k« 1 ' oyr * ,rl ki,ki- l,fc 3 + 2,fc4,k 5 ,k 6 ) 

— £ r (l*804m fci + 1 fc2 _ lifc3+2 fc4iks k< .+ 0-5m fci+3 tkl -i <k3tkAtk5tkf ) 
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fa m kt,k z -\,k},k*,k.s + l,k» } + ^ 




4- /c 4 { — 2(>2 m k 


,ki,k3,kt,ks,k 6 ^ki,k 2 ,k3 + l,<u 


l,k s ,k„ 


-£r 2 (3-74m fci + li , 2)k3 + lifci _ liksik6 )-£V(l-8687m fci + 2ifc2>fc3 + 1 ^_ lifc3 ^ 

+ 6 , 74m fci j t2 j fc3 + 3)fc4 _ i' kS ' k j} + ^5 m k 1 ,k 2 ,k 3 ,k^,k s - l,fc.+ 1 
+ k 6 (— 2 C/ r / m fc 1 ,* 2 ,fc3,k 4 ,fc 5 ,fc6 _ r f m k l kz,k 3 ,k4,ks+ 1,*»- l) 


+ 2 ^6(^6 ^) e4 °’ 2w2 fc I ,k2,fc 3 ,* i ,/c5,fc6-2’ 


(16) 


where 


rx A"| 2 AT* 4 -J^5 5 JfS 6 )/7(AT, r)dA' 1 - - - . (17) 


Equation (16) contitutes an infinite set of coupled moment equations. In order to solve 
for the response statistics, these equations must be truncated by an appropriate 
closure scheme. Gaussian and the first-order non-Gaussian closure schemes are 
considered. The Gaussian closure is carried out by setting third- and fourth-order 
cumulants to zero and expressing third- and fourth-order moment in terms of first- and 
second-order moments. In this case, the Gaussian closure requires to generate 27 
coupled first-order differential moment equations. The first-order non-Gaussian closure 
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is carried out by setting fifth- and sixth-order cumulants to zero. For the present 
problem one needs to generate 6 equations for the first-order moment, 21 equations for 
the second-order, 56 equations for third-order, and 126 equations of fourth-order, with 
a total of 209 first order differential equations. The solution of the differential equations 
are numerically solved by using the IMSL DIVPAG subroutine (Adams-Moulton or 
Gear numerical integration method). The numerical integration is performed with 
double precision using time increment At = 0-2 and integration accuracy 0TD-03. 

The first mode is excited by a filtered white noise whose centre frequency is very close to 
the first mode natural frequency ( r f ~ r ). The bandwidth of this random process is selected 
such that it does not excite any other higher modes. The system response is then governed 
mainly by the first mode. However, the second mode can be excited through nonlinear 
modal interaction near internal resonance condition. Figure 3 shows time history records 
of the mean square responses as obtained by Gaussian and non-Gaussian closures for 
internal detuning A = — 4-0. These values of A correspond to r = 0-99. These solutions 
are obtained for excitation spectral density (a 2 a) 2 = 0-0125, damping ratios Ci=C 2 — 0-005, 
and filter parameter £/ = 0-2. The figure shows that the non-Gaussian solution is slightly 
higher than the Gaussian solution. For the same initial conditions the two schemes yield 
different transient periods. Different excitation levels, internal detuning parameter, and 
filter parameters may result in unimodal response. Accordingly, the stochastic bifurcation 
of the second mode of the two solutions depends to a great extent on excitation level and 
system parameters such as system damping ratios and internal detuning parameter A. The 




Figure 4. Effect of excitation level on response mean squares for different filter 
damping ratios: ^ = 0-05 —A— Gaussian; £/ = 0-l — 0 — Gaussian, —•— 
non-Gaussian; £ f = 0-2 — □ — Gaussian,—■— non-Gaussian. 
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bifurcation behaviour also depends on the damping ratio of the linear shaping filter 
equation which is related to the bandwidth of the filtered excitation. Figure 4 shows 
that the second mode bifurcation takes place at different excitation levels due to 
different damping ratios of the linear shaping filter. If the damping ratio of the linear 
shaping filter increases, relatively higher excitation level is required for the second 
mode bifurcation. For very small filter damping ratio, such as <^. = 0-05, the non- 
Gaussian closure was found to encounter numerical instability. 

The effect of internal detuning parameter A on the response mean square, as 
determined by the two closures, is shown in figure 5 for damping ratios £i = £ 2 = 0-005, 
excitation level (e 2 a) 2 = 0-005 and filter damping ratio £ f = 0-2. This figure demon¬ 
strates that in the neighbourhood of exact internal detuning the energy is transferred 
from the externally excited mode to the second mode. It is found that mixed mode 
interaction takes place over a wider internal detuning parameter for the non-Gaussian 
closure solution than the range of the Gaussian solution. It is seen that the mean 
squares are not symmetric about exact internal resonance condition due to the 
presence of cubic nonlinearity. 

The dependence of the mean square response on the excitation level is shown in 
figure 6 for internal detuning parameter A = — 4 (r = 0-99) and three different system 
damping ratios (i = ( 2 = 0-003,0-005, and 0-007. The damping ratio of linear shaping 
filter has a fixed value of ( f = 0-2. It is seen that if the system damping ratios increase, 
the second mode requires a higher excitation level to bifurcate. For the same system 
damping ratios, the Gaussian closure solution yields second mode bifurcation at 




Figure 5. Dependence of 
mean square response on the 
internal detuning; — o — 
Gaussian closure, — 

non-Gaussian closure. 
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Figure 6. Mean square response vs excitation level for different system damping 
ratios Ci = C 2 = 0-003 — x — Gaussian, --A— non-Gaussian; = ( 2 = 0-005 

— o— Gaussian, —•— non-Gaussian; C 1 = C 2 = 0-007 —□— Gaussian, 

— ■— non-Gaussian. 


a higher excitation level than the level due to the non-Gaussian closure. At relatively 
low excitation level the response is unimodal and the first mode mean square is 
governed by a linear relationship with the excitation level. 

The bifurcation boundaries for three different system damping ratios on the a 2 — A 
plane showing the separation between single and mixed mode responses are plotted in 
figures 7 and 8. Figures 7 and 8 are obtained for e = 0-005, £ f = 0-2, and for three 
different system damping ratios, Ci = £2 = 0-003,0-005, and 0-007 as solved by Gaussian 
and non-Gaussian closures, respectively. It is obvious that as the system damping ratio 
increases, the region of two mode interaction shrinks. Furthermore, the two mode 
interaction occurs in the non-Gaussian solutions at less excitation power spectral 
density level than the level of the Gaussian solution. For A > — 2, both solutions 
predict only unimodal response of the directly excited mode. 

2.2 Monte Carlo simulation 

Numerical simulation of nonlinear vibration problems requires an extensive number of 
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Figure 7. Bifurcation diagram according to Gaussian closure: £1 = £ 2 = 0-007 
— A— , Ci = C 2 = 0-005 — o —, C, = C 2 = 0-007 — □ — . 


numerical integration of the actual equations of motion in order to establish an 
adequate ensemble record of the response characteristics. The number of excitation 
records is dictated by the system complexity and the convergence of response statistics 
to a stationary state. The method first requires the system equations of motion to be 



Figure 8. Bifurcation diagram according to non-Gaussian closure: Ci = C 2 = 0-007 
— A—, Ci = £ 2 = 0-005 — o —, Ci = C 2 = 0-007 — □ —. 
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written in the state vector form 

x; = x 2 , *3 = * 4 , *' 5 = x 6 , 

X' 2 = - 2C l rX 2 -r 2 X l - zr 2 {\-5X\ + 1*804^) 

- cV(l-804X 1 X 2 + 0-5 X])-f a X 5 , (18) 

X\ = - 2C 2 X 4 - X 3 - r.r 2 (3-74X l X 3 ) - i?r 2 {\-miX]X 2 + 6-74^^) 
X' b =~2i f r f X 6 -r 2 X 5 + W(x). 

The first mode is excited by a filtered white noise random excitation X 5 (z) whose 
central frequency is very close to the first mode natural frequency. The bandwidth of the 
random process X 5 (z) depends on damping ratio of the shaping filter £/. To estimate 
the mean square response five hundred records of the white noise W(z) are simulated 
and a sequence of 25,000 random numbers are sampled for a single record by using the 
IMSL subroutine DRNNOR [random number generation from a normal distribution 
using an inverse Cumulative Distribution Function (CDF) technique]. The six coupled 
differential equations (18) are then numerically solved by the IMSL DIVPAG subroutine. 
Response statistics such as, auto-correlation function, probability density function, and 



T 

Figure 9. Time history records of mean square responses according to-Monte 

Carlo simulation, and-non-Gaussian closure. 
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power spectral density function are estimated using the ILS (Interactive Laboratory 
System) software and other subroutines. 

The mean square response and filtered excitation time history records are shown in 
figure 9 for C, f = OT Ci = (2 = 0-005, (e 2 o) 2 ~ 0-00625, and A = - 4 (r = 0-99). This figure 
also includes the mean square responses as determined by the non-Gaussian closure 
scheme (shown by dashed curves) for comparison. It is seen that both numerical simulation 
and non-Gaussian closure solution are in good agreement. The fluctuation exhibited by the 
Monte Carlo simulation records may be attributed to a statistical error of the ensemble 
averaging. As shown in the closure results, second mode bifurcation is mainly governed by 
system parameters and excitation level. However, Monte Carlo simulation predicted 
mixed mode interaction at relatively higher excitation level than the level predicted by 
closure schemes. Due to space limitations these results are not shown in this paper. The 
time history records obtained from Monte Carlo simulation are processed to estimate 
other statistical parameters including autocorrelation function, power spectra, and 
probability density function. The power spectra are determined from the Fourier transform 
of the autocorrelation functions using the ILS software with Hanning window algorithm. 
Because the system is tuned at exact 1:1 internal resonance and the central frequency of the 
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Figure 10. Estimated power spectra (same parameters as of figure 9). 
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Figure 11. Estimated prob¬ 
ability densities according to 
Monte Carlo simulation. 


filtered excitation is close to the first mode natural frequency, the response spectra are 
localized at the normal mode natural frequencies as shown in figure 10. It is also observed 
that there is sharp spike at zero frequency which implies a non-zero mean for the response 
processes under zero mean random excitation. 

The probability density functions of the filtered excitation and response time history 
records are estimated and shown in figure 11 for £ f = 0-05, ^ == £ 2 = 0-005, 
(sV) 2 = 0-00625, and A = — 4. The figure includes the corresponding Gaussian density 
function, shown by the dashed curve, as reference for measuring the deviation of the 
response from normality. It is seen that the filtered excitation is Gaussian since it is 
generated from a linear constant coefficient filter, while the response coordinates 
demonstrate significant deviation from Gaussianity characterized by multi-maxima. 
Depending on the value of the filter damping ratio, different response probability 
density functions are obtained. For small damping ratios <^ = 0-05 and 0-1 the 
probability density functions of the first and the second modes response exhibit multi 
maxima. The transition from multi maxima to one maximum density of the second 
mode response is found to take place for relatively higher filter damping ratio ( f >0-15. 
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This implies that the deviation from Gaussianity and the degree of nonlinear interac¬ 
tion between the two modes become more significant as the filtered excitation 
bandwidth becomes narrower. 


3. Experimental investigation 

3.1 Experimental model and equipment 

Experimental random vibration tests of nonlinear structures are scarce and valuable. 
They are valuable in exploring new phenomena not predicted from theory. They are 
particularly important where different theoretical approaches lead to different results 
for the same problem. They also provide physical insight into a number of complex 
dynamic phenomena and in identifying systems’ modal parameters. In the experimen¬ 
tal tests we consider a beam of spring steel with 0-254 m length and 0-0127 x 0-00005 m 
cross-section. Figure 12 shows a schematic diagram of the instrumentation used in the 
lab. The beam is clamped at both ends and mounted on a platform directly bolted to the 
armature of a 2669N (600-Ib) thrust VTS electromagnetic shaker. The excitation signal 
of the shaker is detected using a piezoelectric Bruel & Kjaer m odel 4383 accelerometer 
which is mounted on the shaker. The beam response is measured using two 350 Ohm 
strain gauges (CEA-06-250UW 350 type) in a two arm bridge. The signals from both 
transducers are amplified and digitized using an analog-digital converter system. The 
input signal to the shaker is generated by a random noise generator (HP-3565S system) 
and is amplified before it goes to the shaker. 

The r.m.s. acceleration and frequency of the shaker output are adjusted according to 
pre-selected values. This is done by incorporating a combination of frequency band 
shaping filters and a buffer amplifier with step input attenuator between the random 



Figure 12. Experimental 
equipment and set-up. 
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noise generator and the shaker. The random excitation is generated by using a HP 
VISTA module of the HP-3565S system and amplified through a Techron Model 7550 
power amplifier. The response signal on the other hand is amplified by a Micro- 
Measurements Model 2310 signal conditioning amplifier. The excitation and response 
signals are read by a Hp-3565 Spectrum Analyzer and the data are transferred to a Sun 
3/260 computer. Statistical parameters of the response and excitation are estimated by 
using the ILS (Interactive Laboratory System) software. In addition to the subroutines 
available in the ILS software, a FORTRAN program in written by the authors to 
estimate the probability density functions. 

3.2 Experimental results 

Several attempts were made to buckle the beam with initial static deflection by exerting 
an initial axial load higher than the Euler buckling load at one end of the beam. These 
attempts were carried out to reach an initial static deflection V 0 close to the theoretical 
critical value F 0 /h = 1-66 which corresponds to 1:1 internal resonance condition. The 
closest value achieved during the tests is in the vicinity of V 0 /h = 1-86. Accordingly, it 
was not possible to satisfy the condition of exact internal resonance. The damping ratio 
Cj was measured from the free vibration tests and was found to be 0-003. The first two 
natural frequencies of the beam were measured from the response spectra and were 
found to be co ! = 80 Hz and co 2 = 110 Hz. The excitation of the actual tests was then 
specified to have a centre frequency close to 75 Hz and bandwidth of 100 Hz. It was 
observed that there was a critical excitation level below which the beam oscillates about 
one potential well (one of the buckled positions). Above this critical level, the beam was 
observed to oscillate between the two wells where the snap-through took place. 
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Figure 13. Experimental 
mean square time histories. 
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Figure 14. Measured prob¬ 
ability densities of the shaker 
and beam response (dotted 
curves show Gaussian den¬ 
sities). 
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Figure 15. 
spectra. 






422 


R A Ibrahim and B H Lee 



16.0 



t 


Figure 16. Time history re¬ 
cords showing the snap- 
through. 


However, for a long test duration, it was difficult to identify the excitation level due to 
the inevitable uncertainty at the beam boundary conditions. 

Figure 13 shows time history records of the mean squares of the shaker acceleration 
and the beam deflection. The excitation level of this case is below the critical level. It is 
seen that the beam mean square reaches a stationary value and the oscillations take place 
about one buckled position. The probability density function of the response exhibits 
slight deviation from the Gaussian distribution as shown in figure 14. The response 
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Figure 17. Time history re¬ 
cords of mean squares (corre¬ 
sponding to snap-through). 
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Figure 18. Measured power 
spectra corresponding to snap- 
through case. 
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Figure 19. Measured prob¬ 
ability densities (dotted curves 
are the corresponding Gaussian). 
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signal is decomposed into two components corresponding to the first two modes 
through an ideal band-pass filter with cut-off frequencies 30-95 Hz and 95-220 Hz, 
respectively. Each filtered signal was processed to estimate the corresponding power 
spectral density functions. Figure 15 shows the power spectra of the excitation and the 
first two modes. It is seen that each mode spectrum is centred around its natural frequency 
and the spectrum level of the second mode is higher than that of the first mode. 

Under relatively higher excitation level the snap-through phenomenon was observed 
as shown in the time history records of figure 16. The response signals are found to 
fluctuate between two potential wells in an irregular manner. The time history records 
of the mean squares of the excitation and the beam response are shown in figure 17 and 
it is clear that the beam response does not achieve stationarity during the test period. 
The power spectra of the shaker and beam response are estimated and displayed in 
figure 18. The response spectrum is not concentrated at a single frequency but rather is 
spread over a wide frequency band. The same feature has been predicted numerically 
and the beam response is mainly characterized by chaotic motion. The corresponding 
probability density is shown in figure 19 which exhibits two peaks corresponding to the 
two wells of the buckled beam. The straight position is unstable in all tests as predicted 
numerically. These experimental results are reported for qualitative comparison and 
no quantitative comparison with the analytical or numerical results is made. The 
reason is that, during each test, the initially given conditions such as the axial load and 
the initial static deflection do not remain constant due to the inevitable slipping effect at 
the beam ends. 


4. Conclusions 

Nonlinear random vibration is important in the design stage of mechanical systems. 
Several complex characteristics owe their origin to nonlinear effects. The theory of 
nonlinear random vibration is limited in predicting these characteristics. Numerical 
simulation as well as experimental tests are important in exploring other phenomena 
such as snap-through and stochastic chaos. Under a filtered white noise excitation, the 
nonlinear response and stochastic bifurcation of a clamped-clamped beam have been 
studied in the neighbourhood of 1:1 internal resonance. The mean square responses 
show that the bifurcation of the second mode response depends on the excitation level, 
system damping ratios, internal detuning parameter, and the damping ratio of filter 
equation. The damping ratio of the filter equation is related to the bandwidth of filtered 
excitation. The response bifurcation is examined at different excitation levels, detuning 
ratios, and damping ratio of the linear filter equation. The range of two-mode interaction 
predicted by non-Gaussian closure is wider than that by Gaussian closure. Monte Carlo 
simulation is found to be in good agreement with the analytical results in predicting the 
second mode bifurcation boundary. The effect of damping ratio of the linear filter on 
the response statistics is found to reduce the interaction between the two modes. 


This research is supported by a grant from the National Science Foundation under 
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Nonlinear deterministic and random response of shallow shells 
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Abstract. Deterministic and random vibrations are considered for the 
case of shear deformable shallow shells composed of multiple perfectly bonded 
layers. The nontrivial generalization of the flat plate vibrations is expressed by 
the fact of “small amplitude” vibrations existing about the curved equilibrium 
position together with the snap-through and snap-buckling type large 
amplitude vibrations about the flat position. The geometrically nonlinear 
vibrations are treated by applying Berger’s approximation to the generalized 
von Karman-type plate equations considering hard hinged supports of the 
straight boundary segments of skew or even more generally shaped polygonal 
shells. Shear deformation is considered by means of Mindlin’s kinematic 
hypothesis and a distributed lateral force loading is applied. Application of a 
multi-mode expansion in the Galerkin procedure to the governing differential 
equation, where the eigenfunctions of the corresponding linear plate problem 
are used as space variables, renders a coupled set of ordinary time differential 
equations for the generalized coordinates with cubic and quadratic non- 
linearities. For reasons of convergence, a light viscous modal damping is 
added. The nonlinear steady-state response of shallow shells subjected to a 
time-harmonic lateral excitation is investigated and the phenomenon of 
primary resonance is studied by means of the “perturbation method of multiple 
scales”. The use of a nondimensional formulation and introduction of 
the eigen-time of the basic mode of the associated linearized problem provides 
a unifying result with respect to the planform of the shell. Within the scope 
of random vibrations, it is assumed that the effective forces can be modelled 
by uncorrelated, zero-mean wide-band noise processes. Considering the set 
of modal equations to be finite, the Fokker-Planck-Kolmogorov (FPK) 
equation for the transition probability density of the generalized coordi¬ 
nates and velocities is derived. Its stationary solution gives the probability 
of eventual snapping after a long time has elapsed. However, the probability 
of first occurrence follows from the (approximate) integration of the 
nonstationary FPK equation. The probability of first dynamic snap- 
through is derived for a single mode approximation with the influence of 
higher modes taken into account. Using the two-mode expansion, the 
probability distribution of the asymmetric snap-buckling is also evaluated. 
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Keywords. Nonlinear vibrations; shallow shells; primary resonance; snap- 
through; snap buckling; Fokker-PIanck equation 


1. Introduction 

Von Karman’s nonlinear plate equations represent the starting point of several 
modified theories for plate and shell structures. Recent reviews of nonlinear plate 
problems are given by Sathyamoorthy (1987) and Chia (1988). Within the scope of 
static applications, von Karman’s nonlinear theory, originally derived for flat plates, 
was generalized by Marguerre (1950) to include the effect of initial imperfections, and in 
von Karman & Tsien (1941) it was extended to cylindrical shell problems. Extensive 
literature surveys on the topic of nonlinear free vibrations of shallow shells are 
elaborated by Leissa & Kadi (1971). Evensen (1972) gives a review on nonlinear natural 
and forced oscillations of thin-walled circular cylindrical shells, where existing dis¬ 
agreements as to whether the shell behaves as a hard or a soft spring, and whether the 
type of behaviour depends upon the boundary conditions and/or the shell being open 
or closed, are discussed. 

Following a procedure developed by the authors, Heuer et al (1990, 1992), for flat 
and for buckled isotropic plates (of polygonal planform), Heuer (1994) derived a similar 
unifying representation of the influence of large amplitudes on forced vibrations of 
shallow shells with a generally shaped polygonal planform. Shells composed of 
multiple transversely isotropic elastic layers are considered. Equations of motion 
according to the dynamic version of the von Karman-Tsien theory, compare von 
Karman & Tsien (1941), modified by the kinematic hypothesis of Mindlin (1951) are the 
starting point. The edges are assumed to be simply supported and the in-plane 
displacements are constrained such that the approximation of Berger (1955) holds. 
Based on the concept of Jones & Klein (1968) a correspondence of the laminated shell 
to a moderately thick homogeneous shell is found. A multi-mode approach and the 
Galerkin procedure are subsequently applied to approximately solve the boundary 
value problem. The result of the projection is a set of noniinearly coupled ordinary 
differential equations. A unifying result with respect to polygonal planform is achieved 
by the time transformation to the linear eigentime of the associated linearized plate 
problem. 

As an illustration of deterministic oscillations, the nonlinear steady-state response of 
shallow shells and panels subjected to a time-harmonic lateral excitation is inves¬ 
tigated. With respect to the phenomenon of primary resonance, a single-term approxi¬ 
mation is used, and the solution of the corresponding problem is obtained by means of 
the “perturbation method of multiple scales” according to Nayfeh & Mook (1979). 

Furthermore, shallow shells excited by random forces are considered. In that case 
where the generalized forces can be modelled by uncorrelated, zero-mean white noise 
processes, and a finite set of modal equations suffices, the Fokker-Planck-Kolmo- 
gorov (FPK) equation for the stationary transition probability density of the generaliz¬ 
ed coordinates and velocities is derived. Up to now, the FPK equation was used to 
determine the transition probability density function of the response of rather simple 
structures like strings (Ariaratnam 1962), beams (Herbert 1964), rectangular plates 
(Herbert 1965), and shallow arches (Ariaratnam & Sankar 1968; Sankar & Ariaratnam 
1971a, b; Sun & Hsu 1989). The authors recently extended that classical approach to 
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more complex plane and shallow structures with polygonal boundaries (Heuer et al 
1992,1993a, b). Their strategy is applied in this paper to shallow shells when forced by 
random pressure loads. Nonlinear vibrations of shallow arches and buckled beams in 
the elastic range have been considered by Seide (1986), Penketgorn et al (1989), Locke 
& Mei (1990), Abou-Rayan & Nayfeh (1993) by means of either simulation techniques 
or with the closure scheme and also by discretization with FEM. 


2. Multimodal approach for nonlinear vibrations of shallow shells 


Geometrical nonlinearity is considered by means of the kinematic assumptions for the 
midsurface strains of the shallow shell according to von Karman & Tsien (1941). 
Taking into account the effect of shear, the distribution of strain through the thickness 
of the shell is assumed according to the Reissner-Mindlin first order shear deformation 
theory, see Reissner (1985) and Mindlin (1951). Using these kinematic assumptions 
when setting up the strain energy, the contributions due to the in-plane, bending, and 
shear deformation become, respectively: 


U m = -D IgdA, 

^ A 


( 1 ) 


U ' = 2 K 


U =—D 
s 2s 




+ vl2^^--(^ + ^) 2 
(»v,+ TJ 2 + + 'F J ,) 2 |<M, 


d A, 


( 2 ) 

( 3 ) 


where w denotes the deflection and are the cross-sectional rotations. I c stands 

for the first invariant of the midsurface strain tensor. The contribution of the second 
invariant is neglected in (1) following Berger (1955): Berger’s assumption is a reaso¬ 
nable well-behaving approximation for structures with immovable in-plane boundary 
conditions. It renders exact results in the case of a shallow shell strip and has been 
confirmed for circular plates by Schmidt (1974) using a perturbation technique and the 
von Karman equations. Immovable in-plane boundary conditions are considered 
throughout the paper. In (1)—(3), A denotes the shell area projected onto the (x, y)-plane, 
D, K , 1/s denote the effective stiffness in the case of symmetrically laminated shells 
composed of transversely isotropic layers, 


n rz k 1 N 

(D,K)= £ [E,/(l-v 2 )](l,z 2 )dz, v = - £ K k v„, 

k= 1 k= 1 


n rz k 

1/s = k 2 X G ck d z , 

Jz k _ 1 


( 4 ) 

( 5 ) 


and v is an effective Poisson’s ratio. The shear factor k 2 is commonly set to 1-2. 

Considering the lateral pressure loading p perpendicular to the shallow shell surface 
and neglecting rotatory as well as in-plane inertia, the field equations of this shear- 
deformable Berger-type shallow shell theory have been derived by Heuer (1994) using 
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Hamilton’s principle. For thermally stressed and buckled plates see Heuer et al (1990, 
1992, 1993a). As a result, the deflection turns out to be governed by the single 
fourth-order differential equation, p is the averaged mass per unit of area: 


K( 1 + sn)AAw — n[Aw -2{H~ KsAH] - KspAw + pw = p- KsAp, (6) 


where n characterizes a time variant isotropic in-plane force, that is constant through¬ 
out the plate domain and is related to the deflection by the averaging integral 


n= — D 


2 A 


w(Aw — 4H)dA 


(7) 


H is the initial Gaussian curvature of the shallow shell. 

Shallow shells with straight edges and with boundary conditions of the hard hinged 
type are considered. 


T: w = 0, Aw = 0. 


( 8 ) 


In a multimodal approach, the deflection and the lateral force loading of the shell are 
expanded into the orthogonal set of eigenfunctions wf of the corresponding linearized 
flat plate problem where n = 0; 

w(x,t)= X c jq *(t)Wj(x), p(x,t) = £ c pj f* pj (t)wf(x), (9) 

i=i j =i 

The coefficients c carry the appropriate dimensions and may be chosen freely. The 
superscript (*) stands for nondimensional quantities. In Irschik (1985) it has been 
shown, that the eigenfunctions of those shear deformable simply supported plates with 
polygonal planform are governed by a set of second-order Helmholtz-differential 
equations with homogeneous Dirichlet’s boundary conditions, 


Awf + oijwf = 0,j= l,2-*-N (10) 

F:w* = 0. (11) 

For homogeneous shear deformable plates, the eigenvalues are directly proportional to 
the linear natural frequencies by 


UjLO = 


Kaj/p(\ + Ksdj) 


1/2 


( 12 ) 


Using (9) as a Ritz-approximation for the solution of (6) and running through the 
Galerkin procedure give the set of coupled non-linear ODEs; a light linear viscous 
modal damping is added at this stage of discretization, aJ\ 


q*" + 2a* q*’+ X a*q* + X b%q* 2 

k=l fc=1 

+3* X X e %<ik 2 =ftjPjJ= 


where 


t=i 


(13) 


4D0*c 


a* = | Kccjd jk + - Ksdf/R 2 0 ) j j p(l + Ksajco^ 


(14) 
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4 Da k P*c 2 k 


2DcCj§*c k 


b » = KJffW ~ + di - ^ 

e% = DajaJ*c 2 /(2pco p2 LO ), p* = c pj /(pCjCof L0 ), 


R, 


Bf=— wfdA,^*=~ tfwf dA,<5? — 

1 J A j J A •* •* 


R o 


AHwfdA. 


(15) 

(16) 
(17) 


i? 0 stands for a characteristic length and S jk represents the Kronecker delta. 

The non-dimensional form of (13) has been derived by scaling the time with the 
fundamental natural frequency of the linearized plate problem, 

t, (18) 


co 


where 


p 

1L0 


( )' 


_d_ 

dt*‘ 


Structures having the same similarity numbers result in an identical nondimensional 
response being sure or random. Any possibly non-regular polygonal planform of the 
shell enters only via the parameters a.j and fif of the simple linear boundary value 
problems as determined by (10) and (11). Thus, only these linear eigenvalue problems 
have to be solved. 


3. Steady-state motion at primary resonances 

In this section, the behaviour of shallow shells subjected to a time-harmonic lateral 
excitation is examined. In order to investigate the phenomenon of primary resonance 
the forcing frequency is selected in the neighbourhood of an eigenfrequency of the linear 
structure. For a qualitative study, a single-term approximation is used as starting point, 
and the governing equation (13) becomes in that case 

q*" + 2X*q*' + a* 2 q* -f b*c*q* 2 + e*c* 2 q * 3 = ( p*/c*)cos(Q*t *). (19) 

The non-dimensional parameters, 

a* — a*\ l2 ,b* =(b* 1 + = e* k /c* 2 ,c* = c 1 /R 0 , (20) 

are determined by means of the expressions according to (14)-(17), and they are 
independent of the dimensionless amplitude c*. 

In the case of primary resonance the forcing frequency is set 

Q * = a* + e 2 o*, (21) 

where the detuning parameter a* quantitatively describes the nearness of Q* to the 
linear eigenfrequency. For steady-state response, the “perturbation method of multiple 
scales” according to Nayfeh & Mook (1979) is applied to obtain in a straight forward 
manner the following nonlinear frequency response equation: 



where 


10 fb* 


2 
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Figure 1. Nonlinear frequency 
response function according to* 
(22) for three nonlinearity para¬ 
meters g* = — 0-8,0,4-0; p* = 0-5, 

2* = 0-15, a* = 1;-stable 

and ---- unstable portions, 
-backbone curves. 


represents the influence of large deflection on the frequency response function (FRF). 
According to the sign of g* the shell behaves as a hard spring (g* > 0) or as a soft spring 
(g* < 0) and, as g* approaches zero, the problem becomes linear. Figure 1 shows this 
nonlinear influence. Here, all the FRFs are displayed for shells with given damping 
ratio X* + 0-15, the loadfactor is p* = 0-5, dependent on the parameter values 
g* = —0-8,0,4-0. The bending deformation of the resonance curves leads to multi¬ 
valued amplitudes and the entire solution splits into stable and unstable branches. At 
those points where the tangent is vertical the well-known jump phenomenon, which 
indicates dynamic snap-through of the shell, occurs. 


4. Nonlinear random vibrations 

In the present analysis it is assumed that the initial curvature of the shallow shell is 
proportional to the basic eigenmode of the linearized plate problem, 

H = \Coc 1 w*, (24) 

where Ccl 1 measures the intensity of initial curvature. In that case (13) turns out to be 
given by 


where 


q*" + 2Xfqf'+afqf + S 1J £ b*q* 2 + 2b* jq *q* + q* £ e*q* 2 =/*., 

*=i < fc=i 

(25) 

a* = a f [(1 + s*)/(l + s*a*)] + C* 2 D*|?«(1 + (26) 

dfj = K*fi*(«)«j*(l + n eJ = iD*«J«*(l + s») > (27) 


with the following similarity numbers: 


a ; /aj, s* = Ksa,, D* = DR 2 JK, C* = C/R 0 J* = 


w* 2 dA. (28) 


To finally derive the simple form of (25), the following choice must be made but without 
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loss of generality: 

Cj = Rl/Pf, C ri = Cfl\ m + s*). (29) 

The lateral force load is split into the static and the dynamic part, 

(30) 

where /* is assumed to be given and deterministic, while f* d (t) is characterized by 
a stochastic process. By definition, a generalized potential function 



transforms the coupled modal (25) to the dynamically driven set of ODEs 

qf " + 2Xf qf + (d V/dqf) =f%. (32) 

ft is this standard form of (32) that makes the application of the classical FPK 
formulation of the theory of nonlinear random vibrations possible. This approach, 
however, until recently was restricted in applications to simple structures, like one- 
spari beams or arches and rectangular plates or cylindrical panels. The nondimensional 
form of (25), when properly interpreted has no restrictions of validity with respect to 
geometry of the planform or the “order of elasticity” of rather complexes structures, 
like shallow composite shells erected above a non-regular polygonal planform. 

The stochastic modal dynamic loads ff d (t) are approximated by uncorrelated zero- 
mean Gaussian white noise processes with the autocorrelation, we omit subsequently 
the superscript (*), 


if j At)f kd (t + ^)> = 2D j S(r)S jk , (33) 

while a given deterministic static force load f u is considered to be correlated to the first 
mode with static amplitude q ls . 

In this classical case the transition probability density p(q 1 ---q N ,q\---q' N ) of the 
system with potential restoring force and light linear modal damping is a solution of the 
FPK-equation with the initial conditions 





(34) 


lim p(q l ---q N ,q' 1 "-q' N ,t) = $(q l -q u )dq 2 ---dq N dq\---5q' N . 

| -*°+ 

The stationary solution of (34) in a mechanical setting has been derived by Caughey 
(1963), 

p(q j ’--q N ,qy'-q , N ) = BQxp^-^^Y J q j 2 + f)|, (35) 

with the additional assumption of constant ratios of intensities and light damping, 

Dj/(lXj) = K = const, for all j. (36) 

Multiple integration over the velocity space gives the n-dimensional probability 
density function of the generalized coordinates 

p(qj--q N )=ex p{- V/K}, 


(37) 
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where B is determined through normalization, 

* + co r + oo 

Pityq n )Tldqj=l=>B. (38) 

J-ooJV — fold J — oo 

The mean-square response of the jth modal deflection equals the second moment that 
results from the cross moments by putting; = k, 

I* + oo r + oo 

<<?;<?*>- ••• ( 39 ) 

J — oo N — fold J — oo j 

Considering the nonlinear vibrations of flat plates, the influence of the higher modes 
upon ) has been studied in detail in Heuer et al (1992). 

As an extension of these results, figure 2 illustrates the influence of the initial 
curvature and shape, measured by C, (19), of shallow shells with a j /a 1 =j 2 ,j= 1,2, • • •, 
upon < q\ >. The influence of five modes is taken into account. A convergence study of 
the nonlinear mean-square response in the parameter space of figure 2 has been 
performed. It was found that all the higher modes N ^ 5 alter these results by less than 
one half of a promille. That means their contribution to the mean-square value is 
negligible. These results refer to a shell with polygonal boundary, without any 
deterministic lateral load, when vibrating about the undeformed configuration. 

It is well known, however, that the vibrations about the static equilibrium position of 
shallow shells may show snap-through behavior. The probability of first occurrence of 
snap-through in a time interval T has been approximately determined by Ariaratnam 
& Sankar (1968) for SDOF-systems of the form 

<7i + q\ + d V/dq 1 =/ w . (40) 

Using (31) with N = 1 and (32), the potential Fean be identified as 

P(9i) = + b u €? + y q \ -fisQi • ( 41 ) 

Assuming the intensities in (36) to be sufficiently small, 

K«h,h= V(q lB ) - V(q lA ), (42) 

where B refers to the intermediate barrier of unstable equilibrium position. Where the 
potential takes on its maximum, and A indicates the initial (stable) equilibrium 
configuration q ls , the following approximation has been derived by Ariartnam & 



Figure 2. Linear (---) and 

nonlinear (-) nondimen- 

sional mean-square jesponse 
) as a function of K accord¬ 
ing to (37) and (39). The graph is 
valid for shallow shells with 
= j 2 J = h 2, • • • N, regardless 
of their special polygonal plan- 
form. C* = C/R 0 . 
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Figure 3. Probability of first 
occurrence of snap-through P T 
as a function of f s = / lj; /max/ ls 
r or various values of C*. The 
graph is independent of the 
special skew planform of the 
shallow shell, C* - C/R 0 . 



Sankar (1968): 

P t = (T/2*)(- V'lV^KX]- Kp'^-A.Jexpl- h/K). (43) 


In the present case 


v " = ~^-==(3e ll q 2 l +6b il q 1 +a l ) 


<h=<hA 

Qi 


(44) 


and d V/dq { = 0 renders the equilibrium positions q 1A <q lB < q lc that exist under the 
action of f u . Snapping occurs with probability P T from q lA to q lc . Equation (43) is 
evaluated and the result is plotted in figure 3 for shear deformable shallow shells. The 
probability P r of snap-through is presented as a function of the static force load ratio 
fs—fu! max /is* where max f ls is the static snap-through load. Note, that the 
SDOF-approximation of (40)-(44) gives a unifying formulation for simply supported 
shallow shells of arbitrary polygonal planform. 

As was pointed out by Ashwell (1962), see also Irschik (1986) and Ziegler & Rammer- 
storfer (1989), the loss of stability of the static equilibrium position may be accom¬ 
panied by a bifurcation to a higher mode. For nonlinear random vibrations, this 
so-called snap-buckling has been studied in detail by Sankar & Ariaratnam (1971a, b) 
starting with a two-mode approximation of the form 


q r [ + 2X i q\ + dV/dq l =f ld . (45) 

q'[ + 2X 2 q’ 2 + dV/dq 2 =0 

This set of equations can be matched to the current more general formulation of (31) 
with N — 2, by putting 

V(<h ’Qz) = i( e + e 22 q*) + b lt ql + ^(a^l + a 2 ql) (46) 

+ 9i92^i2 + ( e u/ 2 )<3i)-/i,9i* 

to render the class of systems of shallow shells. Again sufficiently small levels of 
excitation are assumed 


K«h ly h^ — V(q lB ,q 2Bi ) V(^ 1/4 ,0). (47) 

The value of the generalized coordinate at the equilibrium point A is consequently 
taken from the single-mode analysis. The coordinates of the saddle-point B { are the 
roots of the static equations resulting from equation (45), see figure 4, 


dV/dq l =0,dV/dq 2 = 0, 


(48) 
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Figure 4. Static force deflec¬ 
tion curve, snap-buckling prob¬ 
lem; D* = 300, p* = 0-5, 
s* = 0-2. 


and are given by the following expressions, where q\ Bi ^ 0 

9lB, = e 22 ^f \s “1" ^2 ^ 12/^22 )/(^ 1 ^22 — 2^12 — ^2^12)’ (49) 

^2Bi = l- (^2 ”h 2h 12 <? lj u 1 + e 12 <7 1 B t )/e 2 2] ^ • (50) 


Sankar & Ariaratnam (1971a), in a detailed analysis, derived the following approxi¬ 
mation for the probability of first snap-buckling within the time interval T: 


where 



(V_u)a 

(Mb, 


P{[* 2 -(MbJ 


1/2 



(M*~ 


d 2 V 

dq\ 



(Mb. 


<? 2 K 

5s 2 



(51) 

(52) 


(^Ii)b, ~ 2I-(^h)b, 4-(K 22 ) Bi ] 1 { C( ^ 1 1 )b, ( ^ 22 ) b ,] 2 + 4( K 12 ) Bi } 1/- , 

(53) 


(M 


<3 2 K 

dq^qj 


<?1 — ^lB, 

92 = 928 ,' 


i,j— 1,2, 


(54) 


and a transformation to the principal coordinates s l5 s 2 has been performed by means 
of Mohr’s circle: 


(55) 

with the angle of rotation <f> 

tan 2$ = 2(F 12 ) Bl /[(V u ) Bi -(V 22 )„J. (56) 

Figure 5 shows the probability of first occurrence of snap-buckling of a shallow shell 
with af = 13/5 (that holds e.g. for a shell above a right-angled isosceles triangular base 
with hard hinged supports) as a function of the load ratio £=/ u /max f ls . The 
maximum load according to the static equations (49) and (50), max/ ls , results from the 
condition q\ Bi = 0. 
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Figure 5. Probability of first 
occurrence of snap-buckling 
P r , (5 i) as a function of 
fs ~ /i,/ ma x/i,, for various 
values of C*. The graph is valid 
for shear deformable shallow 
shells with a* = 13/5, e.g. right- 
angled isosceles triangular shell. 
C* = C/R 0 . 


5. Conclusion 

It has been shown that even complex structures like non-regularly shaped shallow 
shells designed with composite materials, forced by lateral loads can be projected onto 
the space of the linear modes of the associated flat plates. The resulting set of coupled 
nonlinear ODEs is transformed and finally brought in a nondimensional form that is of 
the classical type of dynamic equations with potential restoring force. 

Approximating a random environment by physical white noise processes makes the 
classical analysis of the resulting Markov process by means of the FPK equation 
possible. Approximate values of the probabilities of snap-through and snap-buckling 
vibrations are calculated in the parameter space. In the case of single mode approxi¬ 
mations the results are general. Taking into account the influence of higher modes 
restricts the results to a class of structures having a definite sequence of the linear 
natural frequencies divided through the basic frequency. 


List of symbols 


a %’ b %’d%,e% 

D 


H 

hMe 

K 

n 

Pr 

Pr 


fi 

P(4i-4n,4i - q ' N ) 

<qj> 

1/s 


coefficients of differential equations (13); 
effective stretching stiffness; 

Young’s modulus of the kth layer; 
transverse shear modulus of the kth layer; 
mean curvature 

invariants of the mid-surface strain tensor; 
effective bending stiffness; 

Berger’s in-plane force; 

probability of first occurrence of snap-through in a time 
interval T; 

probability of first occurrence of snap-buckling in a time 

interval T; 

lateral force loading; 

generalized coordinates of shell deflection; 
transition probability density; 

mean-square response of the y'th generalized coordinate; 
shear rigidity; 



438 


R Heuer, H Irschik and F Ziegler 


U b 

bending energy; 

U m 

membrane energy; 

U s 

strain energy due to shear deformation; 

V 

potential function; 

w 

mid-surface deflection; 

w*,aj 

K 2 

Helmholtz-eigenfunctions and -eigenvalues; 
shear factor; 

P 

mass density per unit shell area; 

v k 

Poisson’s ratio of the kth layer; 

¥ ¥ 

A jc’ A y 

cross-sectional rotations. 
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Homology design of vibration mode shape under uncertain 
boundary conditions 

SHIGERU NAKAGIRI and NOBUHIRO YOSHIKAWA 

Institute of Industrial Science, University of Tokyo, 7-22-1, Roppongi, 
Minato-ku, Tokyo 106, Japan 

Abstract. A methodology is proposed for homology design to realize such 
a vibration mode shape that satisfies a certain geometrical relation before 
and during vibration. The formulation is based on the finite element 
eigenvalue and sensitivity analyses so that a governing equation for the 
design variables is derived under the condition that the homology con¬ 
straint holds while the eigenvalue problem is satisfied. 

The nonlinear effect of uncertain boundary conditions on the homo¬ 
logous mode shape is examined through convex model of the uncertainties. 
The worst case of the disturbed mode shape due to the uncertainties is 
estimated employing the Lagrange multiplier method. 

The numerical example of out-of-plane vibration of a planar lattice frame 
displays the validity of the proposed method for homology design. The 
worst case of the disturbed mode shape is discussed when rotational 
stiffness of the boundary fluctuates. 

Keywords. Vibration mode shape; homology design; finite element sensi¬ 
tivity analysis; Moore-Penrose generalized inverse; uncertain boundary 
condition; convex model. 


1. Introduction 

Structures have been regarded basically as carriers of external loads in the field of 
structural mechanics. Static structures such as bridge and pressure vessels are not 
expected to perform any function or motion actively, other than to serve as passive 
bases to carry loads. Integrity and safety have long been pursued as the primary aim of 
structural design even in mobile structures such as aircraft, ship and automobile. 

In modern times, dimensional accuracy and excellent performance are demanded for 
precision instruments and smart machinery in order to fulfil the mission assigned to the 
structures on the premises that their stiffness and strength are ensured. The quality of 
mission and performance nowadays is a major concern of adaptive structures, deploy¬ 
able structures and large flexible spacecraft. For instance, high accuracy in shape 


A list of symbols is given at the end of the paper 
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control is required for the antenna mirror of a huge radio telescope at any tilting angle 
to achieve high resolution for radio wave collection, while the parabolic shape of the 
antenna is likely to be deformed by its self-weight. This is an example where the 
deformation should be kept under control as well as stresses. 

Hoerner proposed the concept of homologous deformation in the design of huge 
radio telescopes and stated that the deformation of a structure shall be called 
homologous, if a given geometrical relation holds, for a given number of structural 
points, before, during and after the deformation (Hoerner 1967). His idea was adopted 
in practice to construct a huge radio telescope (Morimoto et al 1982). A finite element 
formulation was proposed for the homology design of the frame structure under static 
loading based on the condition that the solution of the equilibrium equation with 
a rectangular stiffness matrix exists (Hangai & Guan 1989). A different formulation of 
homology design, with less computational effort than the formulation by Hangai, by 
means of avoiding the tiresome calculation of the sensitivities of the Moore-Penrose 
generalized inverse, was made (Yoshikawa & Nakagiri 1993). 

This paper deals with a methodology of the homology design to control vibration 
mode shape at a single eigenmode as. the first step for dynamic homology design of flexible 
structures. It discusses the nonlinear effect of the uncertainties, involved in the structural 
system, on the homologous mode shape. The methodology is based on the finite 
element sensitivity analysis of vibration eigenvalue problem. Use is made of the convex 
model to estimate the worst case of the disturbed homologous mode shape due to the 
uncertainties (Ben-Haim & Elishakoff 1990). The validity of the proposed methodology 
is examined by the numerical examples concerned with a planar lattice frame. 


2. Statement of the problems 

Suppose we have a baseline design of an elastic structure, and that the linear undamped 
eigenvalue problem of the vibration is given by (1) after the geometrical boundary 
conditions are duly processed, 

(ra-i[M])i<?}={0}, a) 

where [£] and [A/] denote N x N symmetric stiffness matrix and consistent mass 
matrix, A the eigenvalue, and {0} the corresponding eigenvector of the eigenmode 
concerned, respectively. The symbol of upper bar indicates a current baseline design. 
The constraint for the prescribed homologous mode shape can be expressed in the 
following form by using a constraint matrix [C], 

{<M = [C/{0,}. (2, 

where the subscripts h and i stand for J homologous components and /(= N — J) 
independent components of the eigenvector, respectively. We call the mode shape 
homologous if a certain geometrical relation holds in a part or whole of the mode shape of 
a structure before and during harmonic vibration, similar to the definition of homologous 
deformation stated by Hoerner. If the eigenvector obtained from (1) satisfies the 
constraint relation of (2), the baseline itself realizes homology design. Usually, (2) is 
not satisfied, and the baseline design should be changed so that (2) holds for the 
eigenvector of the structure after design change. The first problem in this study is to 
find a way to change the baseline design so that the homologous mode shape is realized. 
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The eigenvalue and eigenvector are subjected to the material constants, dimensions 
and boundary conditions of the structure. Then the homologous mode shape, if 
feasible, is likely to be disturbed by the fluctuation of the aforementioned influential 
factors. The second problem is how to estimate the worst case of the homologous mode 
shape disturbed by uncertain fluctuations in the boundary conditions. 


3. Finite element formulation of homology design 

3.1 Derivation of governing equation of design variables 

The constraint matrix [C] should be adequately constituted, since the way to express 
the homologous mode constraint is not unique even for an identical homologous mode 
shape. In this paper, the ingredients of the matrix [C] are supposed to be constant under 
the assumption that the constraint is linear. In order to find the homology design which 
satisfies (2), we choose M design variables ot m corresponding to design parameters p m in 
the following form. 

Pm = Pm( l + a m)- (3) 

It is assumed that the change in the stiffness matrix arising from the design change can 
be expressed by the first-order approximation of the Taylor series expansion in the 
vicinity of a current baseline design, as given below. The subscript m indicates the m-th 
design variable, and the superscript / the first-order. The first-order sensitivity of the 
stiffness matrix [A^] can be easily evaluated by taking the derivative of the stiffness 
matrix with respect to <x m . 

M 

[K] = [K]+ £ [K> m . (4) 

m= 1 

The change in the mass matrix is approximated in the same form as above. When we 
express the change in the eigenvector in the following form, 

. M 

W = {■?}+ E {«.}«.. (5) 

m = 1 

and that of the eigenvalue in the same form, we can evaluate the sensitivities of the 
eigenpair X ! m and {</>£,} by the conventional method of sensitivity analysis (Fox 
& Kapoor 1968). 

In the framework of the first-order approximation, the sensitivities of the eigenpair 
thus obtained ensure that the eigenvalue problem in the form of (1) is satisfied even after 
the design change. Substituting the relevant parts of the eigenvector of (5) into (2), we 
have (6), and its summarized form is expressed as (7), 


M 


E (MJ - = [c]{</>,} - {*,}, 

m= 1 

(6) 

MM = {(,}, 

(7) 


where [A] is a J x M rectangular matrix and {a} the unknown vector consisting of 
M design variables. This is the governing equation of the design variables of the 
homology design for mode shape. 
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3.2 Determination of design variables 

The unknown vector {a} can be solved through (7) by use of the Moore-Penrose 
generalized inverse of the rectangular matrix [A] (Rao & Mitra 1971). The solution {a} 
exists when the following equation is satisfied, 

= ( 8 ) 

where [/] denotes a J x J identity matrix. The symbol on the right shoulder (—) 
indicates the Moore-Penrose generalized inverse. The general solution {a} is given by 

{a} = lAT{b} + ([/]- lAT[A]){h}, (9) 

where the first term on the right hand side is called the particular solution, and the 
second one the complementary solution, respectively. In (9), [/ ] indicates an M x M 
identity matrix, and {h} stands for an arbitrary vector. The particular solution results in 
the minimal norm of {a}, owing to the property of the Moore-Penrose generalized 
inverse. 

We determine the design variables only by the particular solution of (9), that is, set 
the arbitrary vector {/z} equal to a null vector, so as to minimize the design change from 
the current baseline design in the sense that the norm of the design variables is minimal. 
It is necessary for the completion of the homology design to renew the baseline design 
and repeat the finite element eigenvalue analysis and sensitivity analysis for the 
renewed baseline design in order to cope with the deficiency of the first-order 
approximation employed in the formulation until (2) holds exactly. Equation (6) 
indicates that no design change takes place further because {a} = {0} after (2) is satisfied 
exactly. 


4. Convex approach to estimate disturbed mode shape due to uncertain boundary conditions 

4.1 Sensitivity analysis of homologous mode shape 

Suppose that (10) represents the eigenvalue problem which affords the homologous 
mode shape p a } as aimed at and corresponding to eigenvalue A fl , 

([XJ-A a [MJ){^} = {0}, (10) 

where [KJ and [M fl ] denote the N x N stiffness matrix and mass matrix obtained by 
the homology design respectively. In this section we examine the nonlinear effect of the 
uncertain degree of boundary constraint on the homologous mode shape. The follow¬ 
ing formulation deals with the fluctuation' of [XJ due to L uncertain factors/,, which 
stand for the degree of boundary constraint and are supposed to be independent of each 
other. The fluctuation of a factor /, is expressed in the (11) with uncertain variable e, 
whose mean value is assumed to be equal to zero, 

/, = /*(l+e,), (11) 

where the asterisk indicates the mean value hereafter. It is assumed that the fluctuation 
of [K a J in the vicinity of the mean values of the uncertain factors can be approximated 
in the first-order form of (12). This causes a nonlinear effect on the uncertain fluctuation 
of the eigenvector of the homologous mode shape {<£ a }. The first-order approximation 
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of the eigenvector change is given by (13) below, its sensitivity being evaluated in the 
similar manner with that for {<£*}, 


L 


kj =[*.*]+ z 

i= 1 

(12) 

(<M = {<£*}+ Z 

(13) 


i = i 


Equation (13) enables us to express the first-order approximation of the uncertain 
deviation vector of the mode shape {K} from the homologous mode shape as given 
below, 

m=[c]{^}-{^„}=[s]H <i4) 

where the sensitivity matrix [S] consists of the sensitivities of the eigenvector with 
respect to the uncertain variables, and {e} denotes uncertain variable vector. 

4.2 Worst case estimated by convex model 

Now that the uncertain deviation vector is approximated in the first-order form, an 
index to indicate the magnitude of the disturbance caused by the uncertainties can be 
devised somehow. The weighted sum of the absolute value of the aforementioned 
deviation vector is a candidate for linear definition. Instead of it, we define an index A in 
a similar manner with the definition of standard deviation, namely, the square root of 
the squared sum of the difference between a vector component and the mean value of 
the components. The square of the index is summarized in the following form, 

A 2 = {s} r [/>]{£}, (15) 

where the deviation matrix [£>] is derived mainly by taking the product of sensitivity 
matrix [S] in a proper way, and superscript T indicates transpose of the matrix. 

The magnitude of the deviation index A is dependent on the uncertain variable 
vector. The aim of this study is not to discuss how widely the deviation index spreads, 
but to formulate a convex approach for the estimation of its maximal value, namely, its 
worst case for a given set of the uncertain variables. In the convex models of uncertainty 
(Ben-Haim & Elishakoff 1990), we do not define any probabilistic measure for the 
uncertain variables, but define a domain in which conceivable uncertain variables are 
confined. In other words, we specify a domain where the uncertain variables are 
expected to exist, no matter how scarce our knowledge of the uncertain variables. In 
case this domain is a convex hull, the maximal value of any convex function of the 
uncertain variables lies on the domain boundary. This property enables us to search the 
worst case of the deviation index for the convex hull. We deal with the case where the 
uncertain variables are confined to a convex hull expressed as a hyperellipse in 
L-dimensional space as follows, 

e 2 {e} r [Q]{ e }^l. (16) 

The convexity of the domain is ensured when the matrix [Q] in (16) is positive-definite. 
The matrix [Q] is employed to prescribe the characteristics of the uncertain variables. 
We call it characteristic matrix. The constant e in the above equation is introduced to 
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set the expanse of the domain which is to be determined by our knowledge about the 
uncertainties. 

The worst case of the deviation index can be searched easily by the Lagrange 
multiplier method applied to the following functional in which v denotes the Lagrange 
multiplier. 

n = {e} r [/)]{£] + v(e 2 {s} T [Q-]{s} - 1). (17) 

The stationary conditions of the functional with respect to the uncertain variable vector 
and Lagrange multiplier give rise to the following. 

= 2( [D] + [Q]) {£} = {0}, (18) 

«{ £ ) 

^ = « 2 { £ } r [n]{ £ }-l = 0. (19) 

Equation (18) constitutes an eigenvalue problem defined by the matrices [Z>] and [ft] 
for the eigenvalue p-ve 2 and (19) is the normalizing condition of the eigenvector. Once 
the eigenvalue problem of (18) is solved, we obtain the eigenvalue g and corresponding 
eigenvector {i jj], which indicate the worst case of the deviation index as given below, 

A 2 = {iA} T [f>]{iA} = ve 2 {^} T [ft] {\j/}= — v = /r/e 2 , (20) 

and the set of the uncertain variables, from which the worst case arises. It is needless to 
say that the maximal eigenvalue is to be computed in the Lth mode in the eigenvalue 
analysis. Equation (20) means that the eigenvalue p represents the essential sensitivity 
of the deviation index with respect to the magnitude of the uncertain variables. 


5. Numerical example 

5.1 Homology design for mode shape 

The formulations of the homology design and worst case search are examined through 
numerical example concerned with out-of-plane vibration of a planar lattice frame 
simply-supported at four points as shown in figure 1. The lattice is discretized into finite 
elements. Each section is represented by a bending finite element with torsion. The 
cross-section of all elements is circular and 50 mm in diameter in the initial baseline 
design. The Young’s modulus, modulus of rigidity, Poisson’s ratio and mass density of 



1 


2 


Figure 1. Planar lattice frame 
simply-supported at four 
points. 
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Figure 2. First mode shape of 
lattice of initial design. 


the material are set equal to 70-0 GPa, 26*9 GPa, 0-3 and 3-0 x 10 3 kg/m 3 , respectively. 
The geometrical boundary conditions at the supports are represented by virtual 
springs attached to the nodes, whose spring constant is expressed as sEI/l 3 for the 
deflection constraint, and the one for the rotation constraint as cEI/l, respectively. 
In these expression, El and / denote the flexural rigidity and section length of 1,000 mm 
of the beam element in the initial design, respectively. The case of s = c = 0 indicates 
free support, while s = c = oo indicates fixed support. Simply-supported condition in 
the finite element analysis can be expressed by s > 50 0 and c = 0. The natural 
frequency of the first mode is 4-571 Hz for the initial design for nearly simply-supported 
conditions with the inputs of s = 100-0 and c = 0-1, and the saddle-shape mode is shown 
in figure 2. 

Suppose that an optical system is to be mounted on the central rib encircled by a 
broken line in figure 1, and the deflection arising from the saddle-shape mode should be 
eliminated to keep accurate collimation of the optical axis. This elimination can be 
achieved by homology design to set a homologous mode shape of the three ribs in the 
y-direction straight and parallel to the undeformed state. 

Figure 3 illustrates the mode shape obtained by the homology design, in which 22 
design variables are assigned to the diameter of all the section members. The baseline 
design has to be renewed three times until the three ribs are made sufficiently straight. 
The eigenvector is normalized as {0} T {<£} = 1. In carrying out the design change, the 
Moore-Penrose generalized inverse [. A ] “ is calculated on the basis of the eigenvalue 
analysis related to [A][yl] r (Rao & Mitra 1971). It is shown in the figure that the 
saddle-shape mode is eliminated, and the lattice is deformed not like a plate but like 
a beam, when viewed in the y-direction, as is aimed at. Figure 4 depicts the section 
diameter determined after the homology design is achieved. The natural frequency is 
decreased to 1-557 Hz. A simple way to keep the ribs straight is to make the flexural 
rigidity of the ribs enormously large, and this inevitably causes considerable increase in 
the lattice weight. On the other hand, the homology design results in weight increase by 
only 76-2% of the initial weight. This implies that homology design is capable of 
realizing the objective mode shape with slight increase in structure weight. 



Figure 3. First mode shape 
obtained by homology design. 
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Figure 4. Changes in diameter 
required for homology design. 


5.2 Convex analysis of disturbed mode shape 

The nonlinear effect of the boundary constraint coefficient for rotation c on the first 
natural frequency is depicted in figure 5, for which the value s is kept equal to 100-0. The 
dotted line in the figure indicates the natural frequency calculated by s = 100-0 and 
c = 0-0. The uncertain fluctuation of the boundary conditions of nearly simple support 
is simulated by taking the coefficient c as an uncertain factor, and the worst case of the 
disturbed homologous mode shape can be estimated by the formulation given in § 4-2. 

Table 1 summarizes the result of the worst case estimation by means of the convex 
approach. The mean values of the fluctuating coefficients c for the four supports are set 
equal to 0-1, the deterministic value s being kept constant at 100-0. These inputs enable 
us to evaluate the deviation matrix [£)] duly. The fluctuation of c is assumed to arise 
from a common cause for all the four supports and is uncorrelated with the other. Then 
we assume that the characteristic matrix [Q] is given in the form of an identity matrix. 
Solving the eigenvalue problem of (18), we obtain the value of pie 2 as is listed in the 
second column of table 1 for e= 1-0 together with the corresponding eigenvector 
components, which are so normalized that the norm is equal to unity, in the fourth 
through seventh columns. The worst case estimated by the convex approach is given 
the maximal eigenvalue of0-1633x 10~ 2 (deviation index equal to 0-04041) with regard 
to e = 1 -0. On the other hand, the value of A 2 calculated from the disturbed mode shape 
forSi =0-4877,6]: = 0-5036 and so on for e = 1-0 is 0-1711 x 10 -2 as is listed in the third 
column. The agreement between the two values proves that the estimation by the 
convex approach based on the first-order approximation is satisfactory even for rather 
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Figure 5. Nonlinear increase 
of natural frequency with rota¬ 
tion constraint. 
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Table 1 . Worst case estimated by convex model. 


Mode 

p/e 2 ( x 10 2 ) 

A 2 ( x 10 -2 ) 

e i 

« 2 

£3 

£4 

1 

0-1633 

0-1711 

0-4877 

0-5036 

-0-5097 

- 0-4987 

2 

0-0673 

0-0828 

-0-4783 

0-4720 

0-5223 

- 0-5249 

3 

0-0008 

0-0054 

-0-5165 

- 0-4920 

-0-5107 

- 0-4800 

4 

0-0000 

0-0048 

0-5163 

-0-5305 

0-4545 

- 0-4954 


large fluctuation of e = 1-0. When we know the expanse of the ellipse is less, for example 
e = 2-0, we can estimate the deviation index as equal to 0-02021 = 0-04041/2-0. 

6. Concluding remarks 

Success of homology design is decided by adequate setting of the initial design, 
judicious choice of the design parameters and proper formulation. The validity of the 
proposed methodology for homology design and estimation of the worst case of the 
disturbed mode shape is demonstrated by the numerical example concerned with 
a single mode shape of the out-of-plane vibration of a planar frame for which the 
homology design is possible. The possibility of the homology design which controls the 
multiple mode shapes should be examined, and a pertinent formulation would be 
demanded in the future when homology design in the field of forced vibration and 
random vibration is desired. The convex model of uncertainties seems to be useful as 
a device to formulate robust homology design in the future because the model affords 
us to estimate the worst case in a straightforward way. It is desired to establish the 
means to form the characteristic matrix methodically and realistically based on the 
knowledge about uncertainties such as variance and correlation. 

List of symbols 

[A] coefficient matrix; 

{b} calculated vector; 

[C] matrix representing homologous mode constraint; 

c coefficient for rotation constraint; 

[D] deviation matrix; 

El flexural rigidity of beam element; 

e coefficient indicating expanse of uncertainty domain; 

f uncertainty factor; 

{h} arbitrary vector; 

[/] identity matrix; 

[K] stiffness matrix; 

/ length of beam element; 

[M] mass matrix; 

p m design parameter; 

[S] sensitivity matrix; 

s coefficient for deflection constraint; 

{a} design variables vector; 
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a m design variable; 

s„ uncertainty variable; 

X eigenvalue; 

ju eigenvalue (worst case); 

v Lagrange multiplier; 

{(f)} eigenvector; 

if/ eigenvector (worst case); 

[Q] characteristic matrix. 

Subscripts 

[u] achieved homology design; 

h homologous; 

i independent; 

m mth variable; 

Superscripts 

I first-order; 

( —) Moore-Penrose generalized inverse; 

(—) (overbar) current baseline design; 

(*) indicating mean value; 
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Effects of distributed delays on the stability of structures 
under seismic excitation and multiplicative noise 
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Abstract. The effect of seismic excitation and multiplicative noise (arising 
from environmental fluctuations) on the stability of a single degree of 
freedom system with distributed delays are investigated. The system is 
modelled in the form of a stochastic integro-differential equation inter¬ 
preted in Stratonovich sense. Both deterministic stability and stochastic 
moment stability are examined for the system in the absence of seismic 
excitation. The model is also extended to incorporate effects of symmetric 
nonlinearity. The simulation of stochastic linear and nonlinear systems is 
carried out by resorting to numerical techniques for the solution of 
stochastic differential equations. 

Keywords. Stochastic stability; weak and strong time delay; random 
oscillations; seismic excitation; stochastic differential equations; numerical 
methods; implicit Euler, Mil’shtein and balanced methods. 


1. Introduction 

The technique of vibration suppression by active control of civil engineering structures 
has drawn considerable attention in recent years (Abdel-Rohman 1987; Chung et al 
1988; Pu & Kelly 1990; Yang et al 1990; Agrawal et al 1993; Zhang et al 1993). The 
purpose of applying the control is to reduce the damage caused by unexpected 
disturbances like earthquakes, wind and dynamic excitations. The main emphasis of 
the work done on the control design concerns instability of structures due to unavoid¬ 
able time delay, unless it could be compensated properly, e.g. see Agrawal et al (1993). 
Time delay arises on account of a number of factors such as the time taken in data 
acquisition from sensors located at different parts of the structure or unsynchronized 
application of control forces. Even if the time delay is negligible, the structure could be 
destabilized due to excitations by the control force. 

A number of attempts have been made to treat the system as a linear optimal 
feedback control problem. These studies have been restricted to SDOF (single-degree 
of freedom) and n-DOF controlled structures with constant time delay described by 
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linear differential equations. An important aspect which has been generally ignored in 
modelling of such systems is the neglect of stochastic effects. It is well-known that 
parametric noise can destabilize a linear system which is otherwise stable, see Kozin 
(1969). Arnold (1974), Karmeshu & Bansal (1975), Karmeshu (1976) or Ariaratnam & 
Srikantiah (1978). Moreover, the delay magnitude varies from one sensor to another 
and from one actuator to another, as already mentioned in Abdel Rohman (1987). It 
may be useful to examine the effect of continuous and nonconstant time delay instead of 
fixed time lags. 

The object of this paper is to investigate the effects of seismic excitations and 
multiplicative noise on the stability of an SDOF system with distributed delays. The 
paper consists of six sections. Section 2 contains the formulation of the model. The 
resulting stochastic differential equation (SDE) with multiplicative noise is interpreted 
in the Stratonovich sense. Section 3 is concerned with the stability analysis of the 
deterministic linear system. Sections 4 and 5 are devoted to the stochastic analysis of 
the linear system with weak and strong delay. Mean square stability of the system 
without seismic excitation is also investigated. Section 6 deals with the numerical 
simulation for transient solution of the linear stochastic model. Effects of nonlinearity 
in the stochastic SDOF system are contained in § 7. Finally, the last section summarizes 
the observed effects and ends with concluding remarks. In the appendices numerical 
methods for the solution of SDEs and results on mean square stability used for robust 
and reliable simulations have been outlined. 


2. Formulation of the models with distributed delays 


The model considered here is based on distributed delays instead of fixed discrete time 
delay. The displacement. x{t) of an SDOF structure follows the integro-differential 
equation 


x + w 2 x + 2(cux + g x 


K t {t — s)x{s)ds + g 2 K 2 (t — s)x(s)ds = 0, (1) 


where cu, (, g x , g 2 are nonnegative real parameters. The parameters g x and g 2 represent 
feedback gains of the displacement and velocity of the oscillations, whereas co and 
C natural frequency and damping ratio coefficients respectively. K x {t) and K 2 (t) are 
absolutely integrable weight functions specifying the distributed delays. For simplicity, 
it is also supposed that these functions are normalized, i.e. 

* oo roc 

lK 1 (u)jdu = 1 and \K 2 (u)\du = l. (2) 

Jo Jo 

Equation (1) can be interpreted as a distributed delay version of the SDOF system with 
discrete delay considered by Agrawal et al (1993). 

When the system is subjected to seismic excitations and environmental fluctuations, 
(1) modifies to the form 


0 


f K 2 (t — s)x(s)ds = F(x, x, t ), 


x + co 2 x + 2t,cox + g x 


K x (t — s)x(s)ds + g 2 


o 
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where F(x, x, t) reflects stochastic perturbation forces which can be decomposed as 

F{x,x, t) = + <j 2 x(t)£ 2 (t) + a 3 z(t). (4) 

The first two terms on the right side of (4) correspond to stochastic environmental 
perturbations, and the last term represents a ground level acceleration corresponding 
to seismic excitations. The random environmental perturbations per unit displacement 
and per unit velocity are modelled by the independent white noise processes ^(r) 
and f (t). <7 ; , (i= 1,2) give the magnitude of the fluctuations incorporated in (4). 
The first two terms express state-dependent noise. Such terms are called multiplicative 
noise. 

Several attempts to model seismic excitations have been made, e.g. Shinozuka (1967, 
1972) or Kozin (1977). Following Bolotin (1960), seismic excitation can be modelled as 
the nonstationary process 

z(t) = /(£K 3 (0, (5) 

where £ 3 (£) is a white noise process being independent of ^ (r) and £ 2 (t). I{t) is supposed 
to be of the form 


I (t) = exp( — at) — exp( — ft), 0<ot<f, 


( 6 ) 


with its parameters a and /?. 

The resulting integro-differential equation (3) can be interpreted in many different 
ways due to the variety of stochastic integration calculus. Two major interpretations 
have crystallized, namely Ito and Stratonovich calculus. These two are also related to 
each other in the sense that the results of one of them can be obtained from the other via 
a transformation formula, see e.g. Arnold (1974). Here we have adopted the 
Stratonovich interpretation as it is preferable for modelling physical phenomena, cf. 
Wong & Zakai (1965). Thus, from now on we make use of the terms ‘Stratonovich 
interpretation’ and ‘Ito prescription’. 

Additionally, when effects of symmetric nonlinearities are to be incorporated, see 
Moon (1987), the equation of motion takes the form 


x 4- co 2 x + 2£cox + g x 


K x {t — s)x(s)ds + g 2 \ K 2 (t — s)x(s)ds + 


yx 3 = F(x,x, t). 


(7) 


Note that in the absence of external excitations, displacement and velocity feedback 
gains, (7) has the form of the well-known Duffing equation, cf. Iyengar (1986), Moon 
(1987). To analyse the model we need specific forms of the weight functions K x (t) and 
K 2 (t). For the sake of simplicity we shall especially examine the case 
Kft) = K 2 (t) = K(t) in this paper. The main attention is drawn to the two forms 

K(t) = vexp(— vt) and (8) 

K(t) = v 2 texp(— vt). (9) 

In consonance with similar approaches in population ecology, e.g. MacDonald (1978), 
the form (8) is termed as ‘weak delay’ and (9) as ‘strong delay’. At first we consider the 
case incorporating weak delay. 

Weak delay. Waiting x 1 (t) for the displacement x(t) as well as x 2 (t) for the velocity x(t) 
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of the vibration, system (7) is equivalently described by the set 
Xi = x 2 , 

x 2 = - (ft) 2 *! + 2Ccox 2 +g l x 3 + g 2 x^ + yx 3 l ) + (r 1 x 1 ^ 1 (t) + (T 2 x 2 ^ 2 (t) 

+ a,l(tU 3 (t), 

*3 = — VX3 + VXj_ , 

x 4 =—vx 4 + vx 2 . (10) 

Thus, the stochastic integro-differential equation (7) has been replaced by a set of four 
coupled differential equations. 

Strong delay: In a similar way we now introduce the system with strong delay. Using 
the symbols x 1 (r) = x(t) and x 2 (t) = x(f), the stochastic integro-differential equation 
can be rewritten as 

■^1 :==Z ^2 ? 

x 2 = — (co 2 x 1 + 2(cox 2 + g 1 x 3 + g 2 x 5 + yxj) + <7 1 x 1 £ 1 (f) + 0 ' 2 x 2 £ 2 (t) 

+ (T 3 /(t)^ 3 (t), 
x 3 = — vx 3 + vx 4 , 
x 4 = — vx 4 + vx x , 
x 5 = - vx 5 + vx 6 , 

x 6 = — vx 6 + vx 2 . (11) 

Thus, the original systems with both weak and strong delay have artificially blown up 
to very specific sets of coupled stochastic differential equations without time delay. This 
appearance is due to the special kernel structure of the distributed lag. 


3. Stability analysis of the deterministic linear model 

When stochastic perturbation forces as well as cubic nonlinearity are absent the 
equation of motion simplifies to (1). Its Laplace transformation gives the characteristic 
equation 

H{ A) = A 2 + co 2 + 2Cft)A + ^^(A) + g 2 AK 2 (A) = 0, (12) 

where K ( (A) is the Laplace transform of K t ( A) (i = 1,2). The stability of the linear system 
described by (1) requires that all roots of (12) have negative real parts. Assume that 
K l (t) = K 2 (t) = K(t). 

Now we examine the two cases corresponding to weak and strong delay. 

Weak delay: In the case K (t) - v exp(- vf), (12) becomes 

(A + v)(A 2 + 2(coA + ft) 2 ) + v(g 2 A + g t ) = 0. (13) 

Applying Routh-Hurwitz criterion, e.g. see May (1974), (13) possesses roots with 
negative real parts iff 2(ft> + v > 0, v(co 2 + g t ) > 0 and 

(2 (co + v)(2 £ft) + g 2 ) + 2(£cd 3 /v) > g x . 


(14) 
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Strong delay: Suppose K(t) = v 2 texp( — vt). Then (12) reduces to 

(A 2 4- 2vA 4- v 2 )(A 2 4-2£coA + co 2 ) 4 - v 2 (g 2 A + g t ) = 0. (15) 

Proceeding as above, the stability conditions are £co + v > 0, 

v 2 (co 2 + g 1 )> 0 and — 4 - 4 (co + v > — — — - + — , (16) 

v rj 

where 

2 co 2 4- 2v(eo 4- g 2 v 
,= 2 (Ca)+v) ' 

A cursory comparison of inequalities (14) and (16) shows that stability conditions in 
case of weak delay are more likely to be satisfied than those for strong delay. 


4. Stochastic analysis of the linear system with weak delay 


While neglecting effects of nonlinearity (v = 0), the system under seismic excitation and 
environmental fluctuations is governed by the set (11) of linear SDEs. Let the triplet 
(W 1 (t), W 2 (t), W 3 (t)) represent three mutually independent Wiener processes driving 
the considered dynamics. Then the system governing the stochastic process 
X{t) = (x x (f), ■ • •, x d (t)) T with d = 4 can be written in the form 

dX(t) — AX (t)dt + B 1 X(t)°dW 1 (t) + B 2 X(t)°dW 2 (t) + CdW 3 (t), (17) 

where A,B X ,B 2 are the real dxd- matrices, and C the d - dimensional real vector 
corresponding to system (11). Here () T denotes the transpose of the inscribed vector or 
matrix. Equation (17) is interpreted in Stratonovich sense. 

For the purpose of more convenient mathematical handling, we transform system 
(17) to its equivalent Ito form (cf. Arnold 1974; Kloeden & Platen 1992), 

d X(t) = AX(t)dt 4 - B A X(t)d W x (t) 4 - B 2 X(t)dW 2 (t) + CdW 3 (t), (18) 


It turns out that the only difference between the matrices A and A rests with the element 
a i ,2 — 0 . 2,2 +i cr 2 - Tlie system matrices of the drift and diffusion parts have the 
following form 


^ — ( a i,j)i,j= 1...4 — 


Bl =( b ij)ij= 1...4 wit h Kj = 

B 2 = (hfj)ij= 1...4 with bfj = | 


0 1 0 0 \ 

-co 2 —2Cco 4- \o\ g i -g 2 
v 0 —v 0 

^ 0 v 0 — v 

04 : if i — 2J = 1 
0 : else 


and 


*C = (0 <T 3 /(f)0 0) T . 


<^2 : if *=7 = 2 

0 : else 


(19) 
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The solution process X ( t) is a diffusion process, and accordingly a detailed description 
of the probabilistic behaviour of the model can be obtained in terms of the probability 
density p = p(x l ,x 2 ,x 3 ,x 4 ;t) given by the Fokker-Planck equation (FPE). The FPE 
corresponding to (18) has the form 


dp 

dt 


d d 

1> 2 P]~ 


dXy 


dx- 


-co 2 x 1 + j ^ a\ 


2£co }x 2 -g 1 x 3 -g 2 x 4 >p 


-for[v( x i~ x 3)p] 


3x , 


[v(x 2 -x 4 )p] 


lf_ 

25^2 


[_{o\x\ + G 2 2 X Z 2 + oll 2 {t))p~]. 


( 20 ) 


The complete solution of this FPE being subject to appropriate boundary conditions 
seems to be impossible. Accordingly, one has to be content with the evolution of 
moments of the process X (f). It is pertinent to emphasize that in the presence of seismic 
excitation the transient solution of process X{t) is very crucial. However, before 
embarking on a detailed numerical analysis of the stochastic system, we first study the 
stability behaviour of the system in the moment sense in the absence of seismic 
excitation. 


Moment stability of the system without seismic excitation. While examining the 
asymptotical behaviour the term of seismic excitation can be neglected. Modelling with 
the approach of Bolotin (1960) this term is damped out exponentially fast, and hence it 
plays no role for the long-term behaviour of the system. 

When seismic excitation is absent, SDE (18) reduces to 

dX(t) = AX(t)dt + B l X{t) d W l (t ) + B 2 X(t)d W 2 (t). (21) 

Taking expectation, the system of first moments Y^t) = EX'(t) follows the equation 

d Y(t) = dEX(t) = AEX(t)dt = A Y(t)dt. (22) 

Obviously, system (21) is asymptotically stable in the mean sense iff the real parts of all 
eigenvalues of the matrix A are negative. The characteristic polynomial c of A expanded 
in z is 


c A (z) = det(A — zU 4 ) 

= (z + v)(z 3 4- (2(co — -|o -2 + v)z 2 -I- ((g 2 + 2£co — + co 2 )z 

+ v((o 2 + g 1 )), (23) 

where U 4 represents the unit matrix in IR 4x4 . 

Using Routh-Hurwitz criterion, we find that the system has stable first moments if 

2C<d + v > jo 2 
and 

(2£a> + v - j(t 2 )(co 2 + 2C<dv + g 2 v- j<t 2 v) >(co 2 + g 1 )v. (24) 

It may be noted that multiplicative fluctuations characterizing the displacement have 
no influence on the stability of first moments. On the other hand multiplicative 
fluctuations affecting random velocity make the conditions for stability more stringent 
than those for the deterministic case, i.e. when er 2 = 0 [cf. (14)]. 
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The analysis of evolution and asymptotic behaviour of the second moments are more 
difficult and tiring than the above for the first moments. For this purpose we follow 
Arnold (1974). One obtains 

Q(t) = AQ + QA t + B lT QB lT + B 2 QB 2 \ (25) 

for the second moment matrix <2(0 = (E X 1 {t)X j [t)) i j= x 4 .Analysing the deterministic 
system (25) one gains the desired assertions for the stability of SDE (21) in mean square 
sense. We introduce the notations 

Pl = #l,l» Pl~ C ll,2'’ P3 =< ?1,3’ P 4 == *?1,4’ Ps = ^2,2’ 

P6 = #2,3’ P7 = ?2,4’ P8 = < ?3,3’ P 9 = *?3,4 PlO = 

With this notation one finds the vector differential equation equivalent to (25) 

p = Sp, (26) 

where p(t) is the corresponding d'(d' + l)/2 = 10- dimensional vector of second mo¬ 
ments. The elements of matrix S are received from (25) using the symmetry of the second 
moment matrix Q(t). For the decision on mean square stability one encounters the 
characteristic polynomial c s (z ) = det(S — zU x0 ) of degree 10 where U 10 represents the 
unit matrix in R 10xl °. An application of computer packages like MAPLE for symbolic 
computations provides unwieldy expressions running into several pages. Thus there is 
no hope of getting an exact analysis of roots of the polynomial as functions of system 
parameters. Therefore one has to resort to numerical computations for specific 
parameters, as discussed in § 6. 


X 


W 



5. Stability analysis of the linear system with strong delay 

Proceeding on similar lines, one can carry out moment stability analysis of the system 
with strong delay. The system is governed by a set of linear SDEs (13) while assuming 
y = 0 = <r 3 . After rewriting the system in its equivalent Ito form one obtains the drift 
matrix 

1 0 0 0 0 I 

-2 £co+±a 2 ~g x 0 -g 2 0 

0 -v v 0 0 

0 0 -v 0 0 ’ 

0 0 0 —v v 

v 0 0 0 — v / 

(27) 

which establishes the qualitative mean behaviour of the linear system. Its characteristic 
polynomial is 

c A (z) = det(A - zU 6 ) 

= (z + v) 2 ((z + v) 2 (z 2 - z(- 2 £(d + \o 2 f) + co 2 ) + zv 2 g 2 + v 2 g x ). (28) 

This can be rewritten as 

c A (z) = (z + v) 2 (z 4 + 2b x z 3 + vb 2 z 2 + 2vb 3 z + b 4 ), 


1...6 


-CD' 

0 

V 

0 

0 
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where 


b x = Cco + v-0-2/4, 

(29) 


b 2 = (co 2 /v) + 4 (ca + v — o 2 , 

(30) 


b 3 = co 2 + v£co + jg 2 v — 

(31) 

and 


b A = v 2 (a) 2 + g 2 ). 

(32) 


Necessary and sufficient conditions for negative real parts of the roots of polynomial 
(28) are 

&i>0, b 3 >0, b A > 0, b 2 >{b 3 /b l ) + (b 1 /b 3 )(co 2 + g 1 ), (33) 


see May (1974). We note that in the presence of velocity-dependent noise (i.e. c 2 > 0) 
conditions (29) and (30) become more stringent. 

One can obtain the differential matrix equation for the second moment matrix by 
a procedure similar to the previous section. However, in order to derive conditions for 
mean square stability of the system, one has to resort to numerical computations, which 
we omit here. 


6. Numerical simulation for the linear model with weak delay 

In the presence of seismic excitation and environmental fluctuations the knowledge 
about the transient solution of the vector process X(t ) is of paramount importance. 
For carrying out transient analysis we have to simulate sample paths of the system. 
A brief outline of the numerical procedure is given in appendices A and B. It would be of 
great interest to have an estimate of the frequency with which the sample paths cross the 
boundaries of the deterministic solution of the system. It would be equally interesting 
to study the duration time for which the sample paths remain outside the boundaries or 
above some critical values. In view of these goals we resort to the numerical solution of 
stochastic systems. 

Another important aspect relates to the numerical computation and visualization of 
the evolution of first and second moments of variables of interest. In linear systems we 
have two alternatives, namely the direct statistical estimation of the moments from the 
stochastic system or the solution of the corresponding ordinary differential equation 
system. Of course, in linear models the latter could be without additive noise. However, 
to incorporate seismic excitations in the nonlinear model we make use of numerical 
techniques and statistical estimation procedures. 

6.1 Numerical simulation of sample paths and first moments 

It is recommended to implement the implicit Euler method with drift implicitness 0-5 or 
a balanced method with simple correction by a scalar factor depending locally on the 
current Wiener process increments. This enables us to integrate fairly accurate 
stochastically stiff systems and guarantees numerical stability of moments, particularly 
mean square stability, cf. Schurz (1993). These procedures provide us with very robust 
approximate solutions. 

For gaining insight into the evolution of components of the system without seismic 
excitation, we resort to the implicit Euler method with implicitness p = 0-5 and 
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equidistant time step size A = t„ +x —1„ = 0-001. This method employs the scheme 

Y n+ ,={11- pAAfffU + (1 - p) AA + B 1 A W l n + B 2 A W 2 ) Y n (34) 

(n = 0 , 1 , 2 ,-') 

starting with Y 0 = X (0) at time r 0 for the linear system without additive noise. 
A W„= W{t n+1 )— W{t n ) denotes the current Wiener process increment, whereas Y n+l 
gives the value of the approximate solution at time t n + 1 — t 0 + nA. Once again 
U represents the unit matrix. The parameter values for the simulation of the model with 
weak delay are 

X(0) = (1,0,0,0); co = 5; £ = 0-1514; g x = 11-85; 0 2 = 4-87; (35) 

v = 0-l; 0^=5; o 2 = 1-4 and'cr 3 =0. 

Figure 1 visualizes a plot of the components of the random oscillator with weak 
delay. Obviously, the components are more or less rapidly damped out. For the 
corresponding first moments one observes the same pattern. The damping seems to be 
slightly slower than that in the deterministic model, and from a critical value of o 2 
onwards the damping gives place to instability (not due to numerics!) with increasing 
amplitudes (amplification). Furthermore, for this parameter choice (35) we know that 
the linear homogeneous equation has a mean square stable, and hence a mean stable 
equilibrium solution (We checked it with MAPLE). This fact is demonstrated in 
figure 2 . 



Figure 1. Pathwise evolution of components without seismic excitation. 
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E tXl<t> 3 



The plot of this figure compares the mean displacement of the stochastic system with 
the displacement of the deterministic one. In figure 2 it is shown that an amplification in 
the mean sense is caused by multiplicative noise forces. The value of the parameter <r 2 
plays the decisive role in establishing this amplification process. However, the phase of 
the mean oscillations does not seem to change under these noise forces. For producing 
figure 2 we have applied the same Euler method (34) with implicitness and step size as in 
figure 1. 

Finally, we have carried out some simulations for the system with weak delay in the 
presence of seismic excitation using the same setup (35). The new parameters are 

cr 3 = 0-9; a = 1 and /? = 2. 

For further simplification, we make use of balanced methods circumventing the 
costly inversion of correction matrices in the implicit Euler methods. Balanced 
methods introduced by Mil’shtein et al (1992) give an alternative means of controlling 
the numerical moment behaviour (for details see appendix). We suggest the balanced 
method 


r„ + 1 = c; 1 -((Uc n + AA + B 1 A W\ + B 2 A W 2 ) Y n + CA W„ 3 ), (36) 

(n = 0,1,2,--) 

for the numerical solution of the linear system. Its correction factor c„ is simply a real 
number and chosen with 

c n =l + M||A + (l + ||B 1 || 2 )|AIFi| + (l + ||B 2 || 2 )|AVF„ 2 |. 
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Consequently, this numerical method simply represents a locally and appropriately 
corrected Euler method. Furthermore, it gives more numerical stability (visible with 
large step sizes and other parameter choices). 

Results concerning the temporal evolution of estimated first moment of the displace¬ 
ment are shown in figure 3, along with the corresponding deterministic evolution. The 
presence of seismic excitation leads to a shift in the phase of the stochastic displacement 
compared with the deterministic one, as seen in this figure. The results depicted in the 
figures above can be considered as reliable. For confirming the presented results, we 
repeated the numerical integration and estimation with smaller step sizes several times. 

Corresponding numerical studies could also be executed on computers for the model 
with strong delay, but this is omitted as the computational effort increases consider¬ 
ably. However, comparison studies between models with weak and strong delay would 
be of great interest. 

6.2 Numerical simulation of exit frequencies 

In order to answer the questions pertaining to the frequency with which the sample 
paths cross the boundaries of the deterministic solution and the duration of remaining 
outside the boundaries, we make use of sample path wise simulation of the linear system 
with weak delay and seismic excitation. For this purpose 1000 simulations are carried 
out. Figure 4 shows the proportion of sample paths of the displacement remaining 
outside the interval [ — 0-5, + 0-5] for increasing time t. 


E CXl<t>] 



Figure 3. Evolution of deterministic and mean displacement with seismic 
excitation. 
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Figure 4. Temporal evolution of the relative exit frequency. 

7. Effects of feedback and stochasticity under cubic nonlinearity (y > 0) 

In this section we briefly discuss some effects coming in through the nonlinear term 
*i(£) in the model system, i.e. y ^ 0. Up to a certain level of y we can treat this case as 
a perturbation of the corresponding linear system. Thus, for very small parameters y, it 
would not affect decisively the dynamical behaviour. This is dramatically changing 
with increasing y. For practical simulations we recommend using only balanced 
methods with appropriate weights. To some extent they guarantee the most numeri¬ 
cally stable behaviour which is possible. Now we draw the attention to several special 
cases in the presence of cubic nonlinearity. 

7.1 A deterministic nonlinear oscillator with small feedback 

At first we numerically examine the deterministic dynamical behaviour in the phase 
plane of the first two components. Under small feedback parameters (, g t , g 2 ) the system 
behaves like a stable nonlinear oscillator in the vicinity of the equilibrium point zero. 
Thus we can expect a spiralization process of the corresponding flow converging to its 
steady state (equilibrium zero) in the phase plane. This effect is demonstrated in 
figure 5. As already mentioned, the system without distributed delays undergoes the 
movement of the well-known Duffing oscillator. This oscillator possesses one to three 
steady states (equilibria) depending on the system parameters (the sign in front of co 2 
establishes its number). In our model we observe the presence of only one steady state 
which is stable because of the positivity of of 1 . The qualitative behaviour seems to be the 
same under small feedback, as seen in the figure. 



Effects of distributed delays on the stability of structures 


463 



Figure 5. Phase diagram of the deterministic nonlinear system with small feedback. 


7.2 A deterministic nonlinear oscillator with large feedback 

A change of feedback parameters g l and g 2 as well as the exponential rate v cause 
several effects. It turns out that the spiralization and attraction process of the obtained 
flow are decisively influenced by them, e.g. the clustering process, the speed and 
direction of attraction. This fact is seen in figure 6. Beyond critical values for the 
feedback gains it even leads to a destabilization of the dynamical system, whereas the 
increasing value of v causes further stiffness of the system (simultaneous appearance of 
slowly and rapidly varying components). 

7.3 A stochastic nonlinear oscillator with distributed delays 

Now we incorporate stochasticity. Experiments show that the speed of attraction is 
damped in the presence of stochastic terms. Noise as well as cubic nonlinearity effect an 
amplification of the system when it is moving far from the equilibrium point. Further¬ 
more the built-in stochasticity destabilizes the spiralization process (form, duration 
and convergence) depending on its intensity parameters in the phase plane. However, 
under small noise an attraction of the flow can be studied cf. figure 7. The following 
figure visualizes the dynamical behaviour in terms of a stochastic flow in the phase 
plane. The paths for several initial values are plotted with respect to the same random 
noise output. 

In contrast to our specific model setup, the general noisy Duffing oscillator is 
a well-studied example of a bistable random oscillator with three steady states (the zero 
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Figure 6. Phase diagram of the deterministic system with large feedback. 


X2(t » 



Figure 7. Phase diagram of a noisy nonlinear system with delay. 
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point is unstable!). This physical object leads to a chaotic movement between two 
attraction regions and a symmetric change of clustered trajectories (tunnelling). In our 
model the parameter o 1 determines the nearness to this behaviour, however processes 
of attraction and separation alternate. With growing a 1 one observes a significant 
change between separation and attraction periods around different points, finally the 
movement of the oscillator becomes more and more chaotic. 

For the same parameter setup as in § 6 the system has still a stable behaviour under 
cubic nonlinearity with parameter y = 1. This picture is confirmed by figure 8. There we 
estimated the mean square evolution of the nonlinear system. As seen the amplitudes 
decrease monotonically. However, this process is damped with growing influence of 
seismic excitation and nonlinearity. Thus, one can expect serious instabilities, both in 
estimation procedures for the mean evolution and in the displacement. 

As we have already seen, the nonlinearity can have several effects. It is important to 
take into consideration such terms. By varying parameters and o 2 one could expect 
a significant change in the long-term behaviour of solutions, but varying cr 3 affects the 
transient behaviour. This process which is often called bifurcation undergoes a pitch- 
fork bifurcation within the framework of the Duffing oscillator. The point where just 
this effect occurs seems to be of great interest. Such bifurcation points indicate some 
structural instability in general. Corresponding bifurcation diagrams should show that 
bifurcation is related to the emergence of new solutions and a change in the stability 
behaviour. Stable and unstable steady states (equilibria) may occur in our seismic 
models. For a discussion of the stochastic bifurcation theory see the papers from the 
school of Arnold (1974) or in Horsthemke & Lefever (1984). However, this theory for 


E CMC.><«>]* 



Figure 8. Mean square evolution of the components X t and X 2 with delay and 
seismic excitation (y = 1). 
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higher-dimensional systems is still in its infancy and reliable numerical methods to 
detect steady states and to notice the qualitative aspects of stochastic dynamical 
systems are urgently required. 


8. Conclusions 

The paper has examined and visualized some effects of nonconstant weak and strong 
time delay on seismic structures. Additionally we have introduced stochasticity, 
through several noise sources in seismic models. In this paper we examine the 
stochastic dynamical systems governing the seismic structures. It would be useful to 
understand more accurately their behaviour, before one introduces active controls in 
them. The form of stochasticity introduced here not only serves the purpose of 
expressing uncertainty in the process but also allows an interpretation of random 
perturbations in system parameters. 

The paper demonstrates that Stratonovich noise destabilizes the temporal behav¬ 
iour of seismic structures, although this aspect is not new in the general theory of 
stochastic dynamical systems. The same effect was observed in both linear and 
nonlinear models. Incorporating stochasticity and cubic nonlinearity leads to a signifi¬ 
cant amplification of the system components. Up to a certain level of stochasticity the 
damping and attraction process is somewhat delayed and beyond that level even 
serious instabilities can occur. Seismic excitations following the specification (6) due to 
Bolotin (1960) mainly effect a change in the transient system behaviour and a shift in the 
phase between displacement and velocity of the oscillations. 

For solving the model equations for seismic structures, numerical techniques must be 
carefully chosen. They are useful in indicating qualitative characteristics of these dynamical 
systems, e.g. stability of an equilibrium point, and provide the user with reliable results. 
Numerical simulations should employ implicit Euler methods, or more generally some 
appropriately chosen balanced methods. They are able to control fairly well the 
behaviour of the simulated dynamical system, at least up to the mean square level. 

Appendix A. Numerical methods for solving SDE’s 

The development of stochastic numerical techniques began with some examinations to 
generate the simplest numerical solutions in the early fifties. At this stage, of its infancy, 
one already knows about the Euler scheme and its convergence to the exact solution. 
Meanwhile there are many contributions on this subject. Mil’shtein (1974) has done 
pioneering work in this field. Further contributions by himself followed in the eighties, 
e.g. see the book (MiFshtein 1988). Furthermore, we want to point to the paper of 
Wagner & Platen (1978), which laid the foundations for a general calculus for 
numerical analysis of stochastic differential equations. They developed the stochastic 
Taylor formula, a counterpart of the deterministic Taylor formula, which leads to 
systematic numerical methods of a higher convergence order. Finally, it led to the 
development of a ‘new field’ in the eighties. These investigations and more are given in 
the monograph of Kloeden & Platen (1992). For a more experimental and computer- 
oriented approach, Kloeden et al (1994) have recently published a book. The French 
school, see e.g. Pardoux & Talay (1985) or Talay (1982,1989,1990), has made important 
contributions to the subject. An alternative approach to stochastic numerical methods 
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mentioned here was given by Kushner (1989). He constructs Markov chain approxi¬ 
mations in order to treat efficiently stochastic control problems, such as the heavy 
traffic problem. This appendix follows the approach of Artemiev, Mil’shtein, Kloeden 
and Talay. For further sophisticated reviews and interesting facts see the papers of 
Artemiev, Clark and Cameron, Newton, Riimelin, Shkurko, Strittmatter and many 
others. However, we are only able to present some basic facts on this subject, and do not 
claim any completeness. 

In contrast to the deterministic calculus, the stochastic integration depends on the 
choice of intermediate points. By varying this choice one gains different integral 
notions. Two of them turn out to be very effective. One is the Ito integral, which has 
more mathematical meaning due to convenient properties, whereas the other, the 
Stratonovich integral, is more appropriate for practical modelling (cf. Wong & Zakai 
1965). Both versions can be transformed to each other in a natural way. Now, for the 
sake of simplicity, we only consider the Ito type of stochastic differential equations. 
These equations are often driven by a stochastic process having independent identically 
Gaussian distributed increments. The Wiener process W(t) with zero mean and 
E[ W(t)] 2 = t represents such a process. Although very erratic, this stochastic process is 
appropriate for modelling (at least in mean square sense). Several applications rely on 
this noise source. Now, let there be given an m-dimensional Wiener process 
(W j (t)) j=1 m which drives the Ito differential equation 

m 

dX(t) = a(t,X(t))dt + £ b j (t,X(t))dW j (t), (37) 

j= i 

starting at AT(0) = x: 0 e!R d on the time interval [0, T]. Solutions {X(t):t^ 0} of (37) exist 
and are unique under the assumptions of Lipschitz continuity and of ‘appropriate’ 
polynomial boundedness of the functions a,.(-) and b{(-) (i — 1 - d, j= l---m). The 
simplest method generating such solutions numerically is the Euler-Maruyama 
method (for short Euler method). These solutions are constructed by 

m 

n + i = n + «(n)A„+ Z j J (Y n )AW j „; (« = 0,1,2,--). (38) 

Here Y n+1 denotes the value of the approximate solution using integration step size 
\-t n+l — t„ at time point f n+1 . With AW j n = W j (t n+l ) — W j (t n ) we denote the 
current jth increment of the Wiener process W j {t) which can be generated as a standard 
Gaussian random variable multiplied by A* /2 . At least for ‘small enough’ step sizes A„, 
appropriate convergence theorems justify the application of method (38) to (37) to 
obtain an approximate solution depending on the practical purpose. In case of 
pathwise approximation (strong) one requires that a constant K — K(T)> 0 (T 
terminal time) exists such that 

Vt„:E|| A"(r„) — Y'(f„)|| ^K(T)-A y , (39) 

where A = sup A„ < + oo. In contrast to that, for momentwise approximation (weak) it 
is sufficient to demand only the existence of positive constants K = K(T,g) such that 

Vt n :||E(^(^(r n ))-6r(7(r„)))KIC(T,^-A^, (40) 

with respect to a class of ‘sufficiently smooth’ functions g (often geCf). The weak 
convergence has more practical utility because one is mostly interested in the calculation 
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of moments only. In those cases one may even simplify the generation of the random 
variables in (38). For equidistant approximations, it turns out to take any 
independent identically distributed random variables ^ „ instead of A W J n which satisfy 
the moment relation 

\U\ + |E<f I + |E£ 5 | + |E<^ 2 — A | + |E£ 4 - 3A 2 | ^ C-A 3 , 

for some constant C > 0. Thus we keep the weak convergence order /? of numerical 
methods, at least up to the order /? = 2-0. For example, this is true for three-point 
distributed random variables t, with 

P(£= ±(3A) 1/2 )=l/6 and P(f = 0) = 2/3. 

Moreover, for method (38) one can take simpler random variables other than the 
above. Choosing noise increments ^ J n satisfying 

|E£| + |E£ 3 1 + E£ 5 | 4- |E£ 2 - A| ^ C-A 2 (41) 

for some constant C > 0, e.g. two-point distributed random variables £ with 
P(£= ±A 1/2 ) = |, 

the weak convergence order /? ^ 1 of numerical methods is achieved, cf. Kloeden 
& Platen (1992). These simplifications in the generation of random variables save time 
and computational effort, but the same procedure cannot be applied to the scheme (38) 
approximating pathwise solution of (37) under the requirement (39). The method (38) 
possesses strong convergence order y = 0-5 and weak convergence order /J = 1-0. For 
proofs, see e.g. Kloeden & Platen (1992). 

Mil’shtein has done one of the firs rials of systematic construction of numerical 
methods and proved the convergence of the well-known Mil’shtein methods (with 
y=l-0 and (i = 1-0). In general, corresponding higher order methods are derived from 
the stochastic Taylor expansion, resulting from iterative application of Ito’s lemma by 
appropriate truncation. This approach suggested first in Wagner & Platen (1978) is 
described in Mil’shtein (1988) or in Kloeden & Platen (1992). For further details, see 
Artemiev (1993), Clark & Cameron (1980), Newton (1991), Pardoux & Talay (1985), 
Talay (1982,1989,1990) or Kloeden et al (1991, 1994). 

As in deterministic analysis, asymptotic stability and the already mentioned conver¬ 
gence together give reasonable robust solutions. For the sake of guaranteeing stable 
numerical behaviour, in our experiments we used slightly changed schemes based on 
the Euler method. Stability guarantees that small initial perturbations have no marked 
influence on the dynamical behaviour. In deterministic analysis one suggests implicit 
met! xls, i.e. such methods which also involve locally the new value of the approxi¬ 
mation. These methods ensure numerically stable behaviour. In stochastic analysis too 
there are such methods. However, the variety of stability concepts here is large, cf. 
Kozin (1969). We suggest following the concept of moment stability. For a brief 
exposition on mean square stability, i.e. the stability of second moments, see appendix 
B. To some extent, stochastically stable behaviour can be guaranteed by following the 
family of implicit Euler methods envisaged by the scheme 

r ,,+i = l'„ + {/>a(Y„ +1 ) + (l-p)a(y„)}A„+ £ b>(Y,) AW"; 

j= 1 


(n = 0,l,2, —) 


(42) 
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with fixed implicitness parameter pe[0,1]. Of course, for the application of these 
methods the corresponding system (42) must be locally resolvable. An alternative 
means to control the stochastic stability behaviour is given by the balanced methods. 
Following Mil’shtein et al (1992) these methods possess the form 


Y„ + a{Y n )\+ X b*(Y n )AW{+ £ c%, Y„){ Y„ - Y n+ t )|A W{\ , 
i=i j=o 

(43) 


where c j (.,.) are bounded dxd — matrices and A W° = A n (n = 0,1,2,---). For their 
existence and convergence it is convenient to require that the matrices 


M(t,x):=U+ £ ajC J (Ux) (44) 

j =o 

are invertible. Additionally ihe norms of their inverses must be uniformly bounded for 
all pairs (t,x)e[0, T] x U d and sequences of non-negative real numbers (a,j) J=0A m 
with a o e[0,<2]. Strong convergence of order y = 0*5 has been established in MiFshtein 
et al (1992). Furthermore, one immediately concludes the weak convergence of order 
/? = 0-5 for general models of type (37). 

In particular, methods (42) and (43) are appropriate to integrate systems interpreted 
as physically stiff. Stiffness often occurs in physical phenomena, e.g. in stochastic 
mechanics. One encounters stiffness if there are two dynamical components in the 
system where one is moving rapidly and very erratically while the other is relatively 
slow. Generally speaking, the methods (42) are recommended to moment stability and 
in such cases where the deterministic part plays the decisive role in the dynamics. In 
cases where the stochastic aspect is more important one should rather adopt some 
appropriately chosen balanced methods. They can even be implemented more simply 
and can achieve better control on the dynamical behaviour of numerical solutions. The 
problem of choosing the weight matrices c J in the balance methods (43) is circumvented 
by the simple choice of c j = ctj\\b j \\ U with positive scalars oq in case of finite norms. 
Note that these methods are especially constructed to treat more accurately systems 
with linear multiplicative noise or with diffusion vectors having only bounded linear- 
polynomial growth (more cannot be expected in general, because of the assumptions 
guaranteeing their strong convergence !). However, the knowledge about them is still 
inadequate. 

With the methods (42) and (43) we have obtained reasonable results for our random 
oscillators. In this paper corresponding higher order methods are deliberately avoided. 
Extensive studies concern their stability. Furthermore, higher order methods would 
not be always applicable to nonlinear models. They require ‘too much smoothness, 
boundedness’ of the drift and diffusion coefficients a‘(-), bj(-) and ‘additional informa¬ 
tion’ on the noise (multiple It6 integrals with respect to the Wiener process). Their 
generation involves massive systems and sets of multiple integrals. Sometimes these 
methods cause explosions in their numerical solutions close tn 7pm Mhp ta nnh™mrJp.H 
derivatives of drift and diffusion), else they do not show stabli 
see Schurz 1993). As of now we cannot recomme^^ ww ™ 
nonlinear models, for visualizing stochastic dyn? 
characteristics (such as Lyapunov exponent? ‘ 

However, in specific models they can be ver 


| 
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with fixed implicitness parameter ps[0,1]. Of course, for the application of these 
methods the corresponding system (42) must be locally resolvable. An alternative 
means to control the stochastic stability behaviour is given by the balanced methods. 
Following Mil’shtein et al (1992) these methods possess the form 

m m 

Y n + 1 = Y n + a(Y,)A„ + £ b\Y n )AW{ + £ c\t„, Y,)(Y„- Y„ + ,)|AH"|, 

7=1 7=0 

(43) 

where c j (.,.) are bounded d x d — matrices and AW° = A„(« = 0,1,2,---). For their 
existence and convergence it is convenient to require that the matrices 

m 

M(t,x):=U + £ a/c'‘(r, *) (44) 

7 = 0 

are invertible. Additionally rhe norms of their inverses must be uniformly bounded for 
all pairs (r,x)e[0, T] x U d and sequences of non-negative real numbers (ocj) jxs0 Am 
with a o e[0, a]. Strong convergence of order y = 0-5 has been established in Mil’shtein 
et al (1992). Furthermore, one immediately concludes the weak convergence of order 
ft = 0-5 for general models of type (37). 

In particular, methods (42) and (43) are appropriate to integrate systems interpreted 
as physically stiff. Stiffness often occurs in physical phenomena, e.g. in stochastic 
mechanics. One encounters stiffness if there are two dynamical components in the 
system where one is moving rapidly and very erratically while the other is relatively 
slow. Generally speaking, the methods (42) are recommended to moment stability and 
in such cases where the deterministic part plays the decisive role in the dynamics. In 
cases where the stochastic aspect is more important one should rather adopt some 
appropriately chosen balanced methods. They can even be implemented more simply 
and can achieve better control on the dynamical behaviour of numerical solutions. The 
problem of choosing the weight matrices c j in the balance methods (43) is circumvented 
by the simple choice of c j = <Xj\\ b j \\ U with positive scalars a } in case of finite norms. 
Note that these methods are especially constructed to treat more accurately systems 
with linear multiplicative noise or with diffusion vectors having only bounded linear- 
polynomial growth (more cannot be expected in general, because of the assumptions 
guaranteeing their strong convergence !). However, the knowledge about them is still 
inadequate. 

With the methods (42) and (43) we have obtained reasonable results for our random 
oscillators. In this paper corresponding higher order methods are deliberately avoided. 
Extensive studies concern their stability. Furthermore, higher order methods would 
not be always applicable to nonlinear models. They require ‘too much smoothness, 
boundedness’ of the drift and diffusion coefficients a'(-), b{(-) and ‘additional informa¬ 
tion’ on the noise (multiple ltd integrals with respect to the Wiener process). Their 
generation involves massive systems and sets of multiple integrals. Sometimes these 
methods cause explosions in their numerical solutions close to zero (due to unbounded 
derivatives of drift and diffusion), else they do not show stable numerical behaviour (e. g. 
see Schurz 1993). As of now we cannot recommend higher methods for solving general 
nonlinear models, for visualizing stochastic dynamics or for estimating their qualitative 
characteristics (such as Lyapunov exponents which describe the long-term behaviour). 
However, in specific models they can be very effective. For example, if one can make use 
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stable iff 


3 c 1 ,c 2 >OVf 0 ^0 : E|| X t (x 0 , t 0 ) jl 2 c 1 1| x 0 1| 2 exp( - c 2 (t - r 0 )). (47) 

For results on exponential stability, see the works of Sasagawa(1981) and Khas’minskij 
(1980). Further contributions on stability and different concepts can be found in Arnold 
(1974), Kozin (1969) or Kushner (1967). For sophisticated approaches, see e.g. the 
papers of Baxendale (1985) or the collection of papers in Arnold & Wihstutz (1986). The 
following theorem forms the base for further consideration of corresponding discrete 
systems. 

Theorem B.l. Assume that the matrix-valued functions A{t) and B j {t) in (45) are 
bounded on [t 0 ,oo]. Then, for exponential stability of the null solutions in the mean 
square sense it is necessary that for any, and sufficient that for a particular symmetrical, 
positive definite, continuous and bounded dxd-matrix C(t) with x T C(t)x ^ k t |x| 2 
(/c 1 > 0) for all t^O the matrix differential equation 

dD(t) m 

+ A T (t)D{t) + D(t)A(t) + £ B j (t)D{t)B s (t) = - C(t), (48) 

« A 

possesses a matrix solution D(t) with the same properties as the matrix C(t). 

This theorem and its proof can be found in a more general form in Khas’minskij (1980). 
The present formulation is from Arnold (1974). Similar to the above definition one can 
introduce the notion of mean square stability of the null solution for numerical 
methods. 

DEFINITION B.2 

A numerical solution (Yj,)^ with fixed step size A starting in y 0 at time t 0 has an 
(asymptotically) mean square stable null solution iff 

3<5 > 0Vt o ^ 0Vy o elR d ||y 0 1| < $ : lim E || yj 2 = 0, (49) 


where we understand that the limit in (49) is taken only at discrete times t„. 

In contrast to stability analysis for continuous time systems, the stability analysis for 
numerical methods is still in its initial stage. A somewhat discrete counterpart to the 
above theorem is stated below for implicit Euler and Mil’shtein methods. For the 
construction and convergence of implicit Mil’shtein methods see Kloeden & Platen 
(1992) and Kloeden et al (1994). Let P P,M denote the matrix of second moments E Y' n Y{ 
in the implicit Mil’shtein method for (45) at time t n , and P p,E that for the corresponding 
implicit Euler method (42) using the same step size A and implicitness pe[0,1]. It is not 
hard to verify that from the stability of matrix sequences P p follows the stability in mean 
square sense, as required in (49), and vice versa. Thus one can draw further inference 
about the behaviour of the sequence (P£)«= 0,1,2,... f° r both the implicit methods. The 
following result is stated for autonomous systems, i.e. for systems with time-indepen¬ 
dent drift and diffusion parts. 

Theorem B.2. Assume that both the discrete systems start with the same positive definite 
matrix P 0 of second moments for the autonomous form of SDE (45). Then the mean 
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square evolution of both discrete systems is given by the corresponding operators ££ E and 
mapping from the space of positive definite d x d-matrices into itself, and the 
following relations hold 

(i) = (^ E p T(Po) < { + ) (&?r(Po) = PP nf. L> «■ 

VxeU d :x T P p n f t x ^ x T P^x-.n = 0,1,2, ■ *-, 

(ti) \/SeU dxd , S positive definite solution of {48), Vp ^ 0-5: 

J£pS< ( + ) S, hence the eigenvalues of £T E p are smaller than unity. 

(iii ) p = 0-5=> 

The implicit Euler method possesses a mean square null solution iff the null solution 
is mean square stable for the continuous time system. 

(iv ) VSelR^, S positive definite, Vp l5 p 2 ,0 ^ p l ^ p 2 :JS? p2 S ^ (+) JS? pl S 

where <£ is the mean square operator of the implicit Euler or Mil’shtein method, 
respectively. 

These are the results of the paper by Schurz (1993) where one also finds the proofs. In 
B.2.{i) matrix inequalities are to be understood in terms of positive definiteness of the 
difference matrix. 

Remarks: The results on mean square stability in implicit Euler and Mil’shtein 
methods are very effective. As a conclusion, the half-implicit Euler method (p = 0-5) can 
be considered as a stability indicator of the continuous time system to be solved 
numerically by this method. This method possesses a mean square stable null solution 
iff the null solution is mean square stable for the corresponding continuous time system 
(see B.2.(iii)). Through B.2.{iv ) the ‘monotonic nesting’ of mean'square stability 
domains has been discovered. Thus the most mean square stable method (within the 
class 0 ^ p ^ 1) is the fully drift-implicit method, i.e. the numerical method with 
implicitness p = 1. This holds for both the family of implicit Euler methods and the 
family of implicit Mil’shtein methods. 

If one is not interested in indicating stability, one could also apply balanced methods 
to achieve control on the discrete dynamical bahaviour. As an advantage of the 
balanced methods, the costly inversion of correction matrices is not really necessary. 
One can use balanced methods with scalar corrections, e.g. in case of bounded norms of 
drift a() and diffusion b j {-) take 

r. +1 = y.+»;'(«<5;)*.+ £ b^Awi), (50) 

where w„= 1 -f-a 0 i|fl(-)||A n + 2” =1 a J ||h J (*)|||AVF^| with nonnegative real parameters 

(a.j)j =01 . m . It is not hard to verify that there exist balanced methods (50) which 

provide mean square stable numerical solutions. Moreover, with (50) one can state 
numerical methods which possess a more mean square stable null solution than any of 
the implicit Euler or implicit Mil’shtein methods for bilinear systems. The only 
disadvantage of balanced methods is that they reduce slightly the weak convergence 
order and could be too stable (not applicable as stability indicator) for general 
continuous time systems. Apart from these drawbacks the balanced methods seem to 
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be the richest and most robust class of numerical methods so far. For results concerning 
them, see Schurz (1994). 

Higher order Taylor methods, methods with higher convergence order and those 
derived from the stochastic Taylor formula do not seem to be appropriate to achieve 
stability. For example, theorem B.2.(i) shows that the implicit Mil’shtein method is 
worse than the corresponding implicit Euler method as regards mean square stability. 
This indicates that one has to introduce terms carefully in the numerical method to 
achieve higher convergence order and stable behaviour simultaneously. However, the 
drawbacks of higher order methods still exist. There is no appropriate stochastic 
control on their stability behaviour up to now. 


Part of the work was done during the visit of the second author to the Jawaharlal 
Nehru University, New Delhi in December 1993, for which the authors would like, to 
thank the University. 
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Stochastic control of hysteretic structural systems 

YOSHIYUKI SUZUKI 

Disaster Prevention Research Institute, Kyoto University, Gokasho, Uji, 
Kyoto 611, Japan 

Abstract. A method of stochastic optimal control of hysteretic structural 
systems under earthquake excitations is presented. Stochastic estimation 
and control problems are formulated in the form of Ito stochastic differen¬ 
tial equations on the basis of the theory of continuous Markov processes. 
The conditional moment equations given observation data are derived for 
nonlinear filtering, and are closed by introducing appropriate analytical 
form of the conditional probability density functions of the state variables. 
Under the assumption that the admissible controls are expressed as func¬ 
tions of the conditional moment functions the Bellman equation is derived. 
If the spatial variables of the Bellman equation are defined by a part of the 
full set of conditional moment functions appearing in the closed moment 
equations, the resulting Bellman equation is coupled with conditional moment 
equations both for filtering and for prediction. The Gaussian and non- 
Gaussian stochastic linearization techniques combined with simple solution 
techniques to the Bellman equation are examined to solve the Bellman 
equation or extended Riccati equations without prediction procedures. 

Keywords. Stochastic optimal control; stochastic estimate; hysteretic 
structure; stochastic linearization. 

1. Introduction 

In the last two decades there has been a growing interest in the active structural control 
with the increase of demand to ensure not only the structural safety but also the 
serviceability of civil and building structures subjected to intense earthquake and wind 
excitations (e.g. Yao 1972, Yang 1975, Leipholz & Abdel-Rohman 1986). Since there 
are many uncertainties and randomness in modelling input excitations and nonlinear 
controlled structural systems, the control force is to be stochastically determined on the 
basis of observation data of output responses of the actual structural system. More¬ 
over, since the observations are partially made on a few limited points of the structural 
system and are contaminated by noise, the stochastic optimal control based on the 
stochastic estimate is to be explored to obtain a reliable controlled structural system 
(Kushner 1967; Jazwinski 1970). The stochastic estimate-based optimal control prob¬ 
lem has been investigated vigorously in the field of control engineering in the recent 
past. It has been well recognized that the stochastic optimal estimate under the 
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minimum error variance criterion and the stochastic optimal control associated with 

the quadratic cost functional could be separately solved for linear dynamic systems 

(Wonham 1968). As for nonlinear dynamic systems, however, the stochastic estimation 

and optimal control problems may not always be solved independently. As far as the f' 

hysteretic structural systems are concerned, further theoretical and computational 

method would be explored to materialize the idea of the structural control for actual 

structural systems. 

The objective of this paper is to find a reasonable method for the stochastic 
estimate-based optimal control of hysteretic structural systems on the basis of the 
theory of continuous Markov vector processes (Gikhman & Skorokhod 1972). First, 
both system and observation equations of controlled hysteretic structural systems are 
expressed in the form of the Ito stochastic differential equations. The fundamental 
equations governing the time variations of the conditional probability density func¬ 
tions and the conditional moment functions given an observation data from an initial 
time to the current time are derived. In order to evaluate the conditional expectations of 
nonlinear functions appearing in the conditional moment equations and to close them, 
the approximate conditional probability density function of the state variables is V 

discussed. On the other hand, if the performance criterion is given by the conditional 
expectation of a cost functional given an initial condition and if the admissible control 
is supposed to belong to a class of functions of conditional moment functions for 
stochastic estimates, the Bellman equation may be derived according to Bellman’s 
optimality principle (Bellman 1957). 

For a general class of nonlinear dynamic systems including hysteretic structural 
systems, if the admissible control is restricted to a class of functions of a set of 
conditional moment functions which is a part of the full set of those appearing in the 
approximate conditional probability density function, the Bellman equation is not 
closed and has to be solved simultaneously with the stochastic estimate solution 
procedure including prediction, since the Bellman equation is conditioned at the end 
time of control. From both analytical and computational aspects, one of the most 
readily available procedures for the above problems might be the stochastic lineariz¬ 
ation technique which combined the stochastic estimation with optimal control of 
nonlinear dynamic systems (Sunahara 1970, 1987). For a practical point of view, in 
order to deal with the stochastic estimate-based optimal control of hysteretic multi- 
degree-of-freedom structural systems, simplified, approximate procedures, to which 
are combined the Gaussian and non-Gaussian stochastic linearization in nonlinear 
filter dynamics with simple solution techniques to the Bellman equation, are discussed. 


2. Formulation of stochastic estimation and control problems 

2.1 Basic equations for stochastic estimate-control 

The basic equations for stochastic estimation and optimal control problems of 
nonlinear structural systems may be expressed as the following pair of system and 
observation equations: 

dZ t = F c t (Z t )dt + G ( dW t , 

dY t = H c t (Z t )dt + R t dV n 


( 1 ) 

( 2 ) 
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where 

F‘(Z t ) = F t (Z t ) + B t C t {S t ), (3) 

HfZ t ) = H t {Z t ) + D t C t {S t ). (4) 


In the above equations, Z r and Y t are, respectively, n-dimensional state and k- 
dimensional output vectors, F,{Z t ) and H t {Z t ) are, respectively, n- and k-dimensional 
vector-valued nonlinear functions of time t and state vector Z t , G t , R t , B t and D, are 
n x m,k x l,n x s and k x s matrix-valued functions of t, and W t and V are, respectively, 
normalized system and observation noises which are supposed to be independent 
vector Wiener processes as follows: 


d W t 
W t 


[d Wj d Wj ] 


I 0‘ 
0 I 


dr, 


(5) 


where the superscript T denotes the transpose of vector or matrix, and / is the identity 
matrix. The s-dimensional vector-valued admissible control function C,(S, ) is described 
in terms of a vector-valued stochastic estimate S t . 

Here the stochastic estimate S, is given as collections of conditional moment 
functions given an observation y t during the time interval [f 0 , f], the admissible control 
function C,(S f ) is then a measurable function with respect to y t . Hence, if the functions 
F t {Z t ) and H,(Z t ) are mathematically well-defined as single-valued nonlinear functions 
of t and Z f , the system equation given by (1) is the Ito nonlinear vector stochastic 
differential equation, and the simultaneous equations consisting of the system and 
observation equations given by (1) and (2) are also the Ito stochastic differential 
equations. 


2.2 Controlled hysteretic structural systems 

In modelling the hysteretic structural systems, it is required to describe hysteretic 
characteristics in well-defined differential forms in terms of relevant state variables in 
order to formulate the system and observation equations in the form of the Ito 
stochastic differential equations. The hysteretic restoring force is in general expressed 
as 


(f> = rx + (l — r)z, (6) 

where <j> is the dimensionless hysteretic characteristic, x is the deformation nondimen- 
sionalized by a reference yield deformation, z is the dimensionless hysteretic compo¬ 
nent, r is a weight of linear component x to (f>, and both z and f are normalized to have 
unit initial rigidities. For the bilinear hysteretic model the hysteretic component z is, in 
particular, expressed in the following differential form (Kobori et al 1976): 

z = x[l - U(x) U(z -d)-U(- x)U(z + <5)] = g(x, z), (7) 

where <5 > 0, <5 is a dimensionless yield deformation, the overdot means derivative with 
respect to time and U(-) denotes the Heaviside unit step function. The unified 
differential form of the general hysteretic constitutive laws is given for a class of 
piecewise linear hysteresis including degrading or stiffening property (Suzuki & Minai 
1988a) for curved hysteresis (Wen 1976, Baber & Wen 1981). By making use of the 
differential forms of hysteretic constitutive laws and by defining the whole state vector 
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Z, representing the controlled hysteretic structural system subjected to earthquake 
excitations, the state space equation of the whole dynamic system may be written in 
the form 

Z t = F t (Z') + B t C t (S t ) + N n (8) 

where N, is an input excitation vector. 

In the case where the input excitation is measurable with observation noise in 
addition to the structural responses, the observation equation is expressed as the 
following ltd stochastic differential equation: 

d E Y t = N t dt + E R t d E V„ (9) 

where E Y t is the measured input excitation, E R t is a time-dependent matrix and E V, is 
a normalized Wiener process representing observation noise. From (8) and (9) the 
system equation is then expressed as 

dZ t = [F,(Z t ) + C t B t (S t m + d E Y t - E R t d E V t . (10) 

By setting G t — - E R t and d W t — d £ V t and by replacing F t {Z t ) + C ( £,(£,) + E Y, with 
F c t (Z t ), (10) is reduced to the form of (1). 


3. Stochastic estimate of nonlinear dynamic systems 


It is well-known that the optimal estimator under the minimum error variance criterion 
is given by the conditional expectation of the state vector Z T at the estimation time 
t given observation data y t in a time interval [t 0 ,t] (Jazwinski 1970). That is 


< Z r>£ = 


dZ t Z r p T (Z t |y r ), 

/?» 




( 11 ) 


where p x (Z x \ Y ( ) is the conditional probability density function of filtering, prediction or 
smoothing according to the case where x — t, x >t or x < t and R n denotes the 
n-dimensional full state space. Since it is difficult to determine directly the conditional 
probability density functions, the conditional moment equations are solved numeri¬ 
cally in a nonstationary state to evaluate the time-dependent conditional probability 
density function as well as conditional statistics of state variables including the optimal 
estimator. By defining an arbitrary twice differentiable and integrable function of the 
state vector Z t as </>(Z T ), the conditional expectation of 4>(Z z ) and its differential with 
respect to t are respectively given by 


and 




d Z t cf>(Z r )p r (Z x \y t ), 

K" 


d A<KZ t )\ = 


dZ>(ZJd r p,(Z,|y,). 

R n 


( 12 ) 

(13) 


In order to obtain the time evolution of <</>(Z t ) > t with time parameter x it is necessary to 
determine the differential form d T p r (Z t jy t ) of the conditional probability density function. 
For controlled nonlinear dynamic systems the filtering and prediction problems are 
essential. The filtering problem is discussed herein for the sake of simplicity. 
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Since the solution process Z t to (1) and the joint solution process (Z t , Y t ) to (1) and (2) 
are continuous Markov vector processes, the associated Kolmogorov differential 
operators are given as (Minai & Suzuki 1987) 


V<P t = i«G t G?) KV q>^-(Flq> t )' K , 

(14) 

L z (p t = \{{G t Gj ) KV cp UKV + F c tK (p lK . 

(15) 


In the above equations, cp t is considered as a scalar-valued function of t and Z r and Lf 
and L z are forward and backward operators. It is noted that in these equations the 
summation and differentiation conventions are used, the subscripts k and v attached 
directly to vectors and matrices denote the specified element and those following the 
comma represent partial differentiations with respect to the specified elements of 
the state vector Z t . By making use of these differential operators the differential form of 
the conditional probability density function is given by 

dp f (ZJy,)=L|p I (ZJyJdr + H t (Z t )(^R t 7 ’)- 1 p l (Z t |y,)dv I , (16) 

where 

dv t = dY t -(HfZ t )\dt, (17) 

H t (Z t ) = H,(Z t ) - <H r (Z f )> = tf‘(Z f ). (18) 

Substituting (16) into (13) and taking account the independence of dv t and Z, yields 
d< 1 #.(2,)>, = <I 2 ^(Z,)>,dt + <0(Z,)H, I '(Z,)>,(R,/! 1 r )->p,(Z,ly,)dv,. (19) 

In particular, the first order conditional moment, namely the optimal estimator of 
filtering is written as 


d<Z f > f = <F'(Z,)>,dt + K f dv t , (20) 

in which 

K t ~<Z t Hj(Z t )> t (R t Rjr\ ( 21 ) 

Z, = Z r -<Z t > t , (22) 

where Z t is the estimation error vector. The differential form which governs the 
conditional expectation of a scalar-valued function 4>{Z t ) of the error vector is given as 
follows: 


d<tf(Z,)> f = <!*<*(£,) >,dt 


+ UMZjHjizjxiR.Rlr 1 - • 

(ll^) X '} dVt5 (23) 

^Z^Pt ~2^tKvtyt,KV "h ^ tK^Pt, KV ’ 

(24) 

G[ = G { Gj + K t R t R[Kj, 

(25) 

F' t = F t (Z t )-K,H t (Z t ), 

(26) 

FfZ,) = F t (Z t ) — <F t (Z t )>, = F c t {Z t ). 

(27) 


where 
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By defining the higher order conditional central moment function as 

(n^i). for f.h> 2, 

\ i = 1 ft i = 1 

the higher order conditional central moment equation is derived from (22) as 

d= t hljG'.Jz-yzy n dt 

i=1 j = i + 1 \ s=l / t 

1 A 


+ ~ £ W-l)G;«<2,7 2 n^)dr 

Z i= 1 


s= 1 / I 


i= 1 
/c 


+ 1 / ( TOz^nzi)^ 


5=1 I t 


1 = 1 


with 


+1 (^(zjnz:; 

s= 1 It 


( 28 ) 


(29) 


M{l 1 ,l 2 ,..J n ;t) = 0 for £ *.- = 1 

i = 1 

M(0,0,..,0;t) = 1 • 


In order to evaluate the conditional expectations in the right-hand sides of (21) and 
(29) and to close the conditional moment equations given by (21) and (29), the 
time-dependent joint conditional probability density function is definitely required. An 
approximate probability density function expressed in the form of a finite mixed type 
orthogonal series expansion depending on the property and range of each state 
variable may be efficient. Such approximate probability density functions have been 
utilized in the stochastic response analyses, reliability analyses and stochastic estima¬ 
tion (Suzuki & Minai 1985,1988b, 1989; Minai & Suzuki 1987). With the cooperation 
of the appropriate conditional probability density function the closed set of conditional 
moment equations can be written in the following vector-matrix form: 

dS t N = P c t (St)dt + <2 ( (Sf)dv ( , (30) 

where the set S? denotes the full set of the conditional moment functions which are 
needed to prescribe the approximate conditional probability density function, and 
P c t and Q t are considered as the drift vector and diffusion matrix, respectively. 


4 . Stochastic estimate-based optimal control 

The stochastic optimum control of a nonlinear dynamic system may be formulated as 
the minimizing problem of a prescribed cost functional subject to the constraints of the 
system and observation equations in order to find the optimum control function 
expressed in terms of the stochastic estimate S r A typical cost functional is given by 

J(u s ,y 0 ) = E 

se[to.T} 

where u s is a vector-valued control function, T is the end time of control, and L S (Z S , u s ) 
and <X> r (Z T ) are a real-valued nonnegative convex cost density function and concentrated 


rr 


O r (Z r ) + 


L s {Z s ,u s )ds\y 0 


(31) 
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cost function at T, respectively. The initial observation j/ 0 may be equivalent to the 
initial condition Z 0 of the state vector. Here the control function is confined to be 
a function of stochastic estimate S t which is the full set Sf of the conditional moment 
functions or its subset, that is 


u t = C t {S t ), 


(32) 


for S t £ Sj. The minimum value function associated with the cost functional defined by 
(31) is written in the following form: 

V t (y 0 )= min eU t (Z t ) + f L s (Z s ,u s )ds\y 0 ], (33) 

Cs(S s ) L J t„ ) 

se[t 0 ,T] 

for t 0 ^t <T. By supposing that the’minimum value function V t {y t ) is denoted as 
) and the admissible control function is given by C ( (Sf), the Bellman equation 
which governs the minimum value function as well as the optimality law of control is 
derived in the following form: 


min 

Cdi) 


< L t (z t ,c t (m t 




+ 


d T 

dt ? 


V t (Q 


= 0, 


with the end condition 


(34) 


K ( (0 = <O- / -(Z r )> T | sr= =, (35) 

where 

+ P c t M<p t . K , ( 36 ) 

which is the Kolmogorov backward differential operator associated with (30). 

In this case, since S t = Sf, the Bellman equation (34) is closed in the sense that all the 
coefficient functions are determined as the definite functions of time, and the optimal 
control function C*(Sf) and the minimal value of the cost function ^(^q) may be 
obtained by solving (34) backward independently from the stochastic estimation 
procedure (Wonham 1968). 

In the case where the admissible control functions are defined as functions of a subset 
S t of Sf and the minimum value function is supposed to be expressed as F t (S") which 
may contain unknown coefficient functions, the same derivation procedure as that of 
(34) may bring formally the following equation: 


min 

C t ({) 


(L t (Z t ,CM))\ 


Is; 


with the end condition 



= 0, 


(37) 


F t (£) = <O r (Z r )> r | s; = ( =, (38) 

in which L ? takes the same form as in (36), but is a formal backward differential 
operator concerning the following stochastic differential equation: 

d5; = ^(5;)df + Q t (5;)dv,. (39) 

For nonlinear system and observation equations, it is clear that (37) is not closed, in 
general, for the case where S t — S' c= Sf In order to complete the Bellman equation given 
by (37) the unknown coefficient functions, which may depend on the complementary set of 
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S[ with respect to Sf, are to be determined in cooperation with both filtering and predic¬ 
tion procedures. 

The Bellman equation given by (34) or (37) may concern with the optimal control law 
and determination of the minimal value function. In the Bellman equation the control 
function appears in the first term and drift term of L% V t (£). But dividing the coefficient 
function J^(c) appearing in L^ into two parts, 

P:($= l P t (Q+ 2 Pt(Q> (40) 

where *?,(£) is independent of the control function whereas 2 P,(£) depends on the 
control function, the optimality law of control may be given by 

^<i,(Z„C,(0 )>,|+ K«) = 0 (41) 

As indicated in (1) to (4), (20) and (29), the explicit form 2 P t (£) is B t C t {E) which is 
concerned only with the first order conditional moment equations. If the vector-valued 
optimal control function Cf(£) is found from (41) in terms of the minimum value 
function or its derivatives, substitution of this optimal control function into (34) or (37) 
may yield the reduced Bellman equation which is a nonlinear partial differential 
equation governing the minimum value function V t (£). By solving the reduced Bellman 
equation backward to obtain V t (£) and by replacing £ with S t the optimal control 
function is determined as a function of conditional moment functions concerning 
filtering. Further substitution of the optimum control function in (1) to (4) completes 
the system and observation equations of tfie controlled structural system. 

As a simple but most practical performance criterion, consider the case where the cost 
functional is a quadratic form and S t = <Z f > r That is, the cost functional is written as 

J(u„y 0 ) = E\zl<liZ T + f (ZlM,Z, + ulN,u,my 0 \, (42) 

se[r 0 ,T] L J to J 

and an admissible control function belongs to a class of functions of the optimal 
estimator as follows: 

u s = C s ((Z t \), (43) 

where M s and N s are n x n nonnegative and s x s positive definite weight matrices for 
the state and control vectors, respectively, i/ms an n x n concentrated weight matrix at 
the end time of control. For this special case, the solution to the optimality condition 
given by (42) is determined as 

cn^)=-\Nr i Br~-v l a,), (44) 

and the Bellman equation is reduced to 

+ + tr[M,<Z,Z, r >,] + ‘Pft 



( 45 ) 
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with the end condition 

K r «i) = f[,/r£ I+ tr[i HZ T Z T T y T l, 

where 


i pra l )=(F t (z t )\ i <ZiWi , 


(46) 

(47) 


Qtf y ) = K t | <Z|Wi = <Z t H f r (2 f )) t | <Zi>t = 5i (R tJ R t T ) _1 . 


(48) 


In the above equations, ^ is the n-dimensional spatial vector corresponding to the first 
order conditional moment functions, d/d£ t means the gradient operator with respect to 
<*! and tr[-] means the trace of matrix. 

The reduced Bellman equation and the end condition given by (45) and (46) contain 
undetermined coefficient functions. If the approximate conditional probability density 
function is described in terms of the conditional moment functions up to the Nth order 
to solve the stochastic estimation, the order N is generally larger than one which is the 
highest order in S t for this special case. Therefore the undetermined functions, which are 
expressed in terms of the higher order conditional moment functions of the order from 
2 to N, may be determined by means of the stochastic estimation including both 
filtering and prediction scheme. By referring to the solution technique for linear 
quadratic systems, the Bellman equation for the nonlinear quadratic systems may be 
converted to extended Riccati equations if the coefficient functions in the Bellman 
equation could be expressed by polynomials or by expanding them in a finite power 
series of the spatial variables. 



5. Simplified, approximate solution procedures 

In applying the stochastic estimate-based optimal control techniques in the preceding 
sections to large complex structural systems such as hysteretic multi-degree-of- 
freedom structural systems, it may be necessary to simplify the whole procedure, 
specifically the closure technique of the conditional moment equations and the 
solution method for the reduced Bellman equation or its associated extended Riccati 
equations. 

As a practical method, the stochastic linearization of nonlinear functions appearing 
in the system and observation equations is considered. The vector-valued nonlinear 
function F t (Z t ) in (3) is linearized as follows: 

F t {Z t ) =f t Z t +/°, (49) 

where/, and / t ° are, respectively, nx n matrix and n- vector linearization coefficients. 
These coefficients are determined so that the conditional expectation of the square 
norm of n-vector difference between the actual nonlinear function and the equivalent 
linear function is minimized (Sunahara 1987). That is 


min<||ej 2 > t , (50) 

U° 


where 


e, = F,(Z,)-(/,Z,+/,°). 


(51) 
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Then the linearization coefficients /, and /, 0 are given by 

/, = <F 1 (Z,)Z, r >,<Z,Z, 7 V I . (52) 

f? = <F,(Z,», - <F,IZ,)ZJ\ <Z,Z, r >r 1 . (53) 

The stochastic linearization under Gaussian assumption is effective for weakly non¬ 
linear dynamic systems. In this case the coefficient /, is especially given by 



(54) 


In the same way, the nonlinear function H t (Z t ) in (4) is linearized as h t Z t 4 - h°. 

In particular, consider the nonlinear function g,(x,-, z,), given by (7), which appears in 
the differential representation of the bilinear hysteretic constitutive law. Here the suffix 
i means the ith bilinear hysteretic component and 0 ,(x f ,z,) is linearized in terms of x £ 
and z £ . Since the range of the hysteretic component z t - is [ — <5,-, <5,), it is evident that the 
probability density function of z, is quite dissimilar to the normal density function 
whereas the probability distributions of the displacement x,. and velocity x { are 
relatively close to the normal distribution. A non-Gaussian linearization is to be 
performed to take account of the non-Gaussian property of z f . The conditional joint 
probability density function p,(x,-,z f |j; ( ) may be approximated by the following form 
(Kobori et al 1976; Suzuki & Minai 1989): 


Pt(*i>Zi\y t ) = jD t -w N (x £ ,z,.|y f ), (55) 

W') = tU{ Zi 4- 6 ,) - U(zj- & t ) + 5{z t + 5,) J:*dC,- + S[z t - d£ ( ](-), 

(56) 

where D t denotes the folding operator which is introduced to take account of the spikes 
of the probability density function concerning the state variable z t at the boundaries 
and w N is the joint normal density function. In this case the conditional expectations 
including the nonlinear term g t can be expressed in terms of the conditional moment 
functions up to the second order. It is noted that the covariance matrix prescribing the 
normal probability density function is different from the conditional covariance matrix 
with respect to x ; and z £ , and the transformation between them is needed. 

If the admissible control function is described as a function of the optimal estimator, 
the linearized system and observation equations are written as 

dz . = {f,Z, +/? + B,C,«Z,>,)}dt + G,dW ,, (57) 

dY, = {h,Z, + K + D,C,«Z,),)}dt + R,dK„ (58) 

where as in (5) W t and V t are assumed to be incrementally uncorrelated normalized 
Wiener processes. The conditional moment equations for filtering are closed within the 
second order, and are obtained simply from ( 20 ) and (29) as follows: 

d<Z,X = {/,<Z,>, +/? + B,C,«Z 1 >,)}dt + <Z t Z t T > t /j 1 r (i? ( 7?j r )” I dv ( , (59) 

d<Z,Z, r >, = {/,<2,2, r >, + iZ.ZjyjJ + G,GJ 

-<Z,ZjyX(R,Riy l h,<Z,Zj},}dt, (60) 


where 


dv, = d Y, - {A,<Z,>, + A? + D,C,«Z,>,)}dt. 


(61) 
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The functional form of F f (£j) for the linearized system can be expressed in the 
quadratic form 

V',« 1 ) = 4[c t ,c 1 + 2^i?, + y,. (62) 

The associated Riccati equations are given by 

& t + M,+ fjccy + a t f t — a t B t Nf l Bf a, = 0, (63) 

P t + oi t f?-oL t B t Nf'B?P= 0, (64) 

y, + tr[M f <Z t Z, r )J + tr[<Z t Z r r > f /i f r (R t R t r )~ l h t (Z t Zj > f a ( ] 

+ 2(/,°) T /J, — (ij B t Nf l Bj p t = 0, (65) 

with the end conditions 


a r = T, (66) 

/*r = 0, (67) 

= tr[H ' T <.Z T Z r T y T l (68) 

The optimal control function is given by 

Cr«Z t \)=-N^B!(a t (Z t y t f t ). (69) 


It is noted that the linearization coefficients/,, / ( °, h t and hf are to be determined on the 
basis of stochastic estimate. Hence, in order to solve the Riccati equations given by (63) 
through (65) in the backward direction from the specified end time T to the current 
time, the linearization coefficients as well as the conditional moment functions <Z, ) t 
and <Z,Z t T > t are to be determined in the future time interval (t, T) through filtering and 
prediction. 

To avoid the prediction procedure the Riccati equations given by (63) and (64) are 
solved in quasi-stationary state in which the time-rate terms are neglected. This 
approximation may be supported by the slowly varying excitation and system para¬ 
meters and fast converging properties of a f and with respect to time. Under the 
quasi-stationary assumption, the matrix a f is reduced to the solution to the following 
time-dependent nonlinear algebraic equation: 


M t +/( T «l + a r/t 


— a t B t N t 1 Bj a, = 0, 


(70) 


If the solution a t to (70) is obtained, the quasi-stationary solution and the associated 
solution y t to (65) are successively given by 


P t ~ (& t B t Nf 1 Bj , 

y, = tr»<z r z?> r ] + r {tr[M,<Z,Zj> s ] 

+ tr[<Z,Zj>A r (M,T ‘A/Z.ZjXaJ 
+ 2(n) T H s -PlB,Nr 1 B^ s }ds. 


(71) 


(72) 


The optimal control function is determined from (69) by making use of the solutions to 
the quasi-stationalized Riccati equations. 

The above mentioned method for stochastic estimate-based optimal control has 
the advantage of the simplicity in the analytical and computational procedures. On 




486 


Yoshiyuki Suzuki 


the other hand, the method may be inapplicable to the case of strongly nonlinear 
dynamic systems, since the stochastic linearization may not be available, especially the 
Gaussian assumption of the conditional probability density function may no longer be 
valid, and since the slowly varying and rapid convergence properties of the solution to 
the Riccati equations, may be lost in this case. From the practical point of view, 
however the combined method based on the stochastic linearization and quasi- 
stationalization techniques may be sufficiently effective since most cases of structural 
control may aim at suppressing the dynamic responses within or around the elastic 
limit deformations. 


6. Concluding remarks 

The stochastic estimate-based optimal control for hysteretic structural systems has 
been presented from a methodological point of view. The stochastic estimate and 
optimal control system may be described as a closed-loop system consisting of 
a nonlinear dynamic system including hysteretic characteristics subjected to the 
optimal control forces and stochastic excitations, an observer with a stochastic 
estimation scheme to identify the full state of system and excitations and a controller 
having a stochastic optimization scheme to minimize a prescribed cost function. The 
system and observation equations are expressed in the form of the Ito stochastic 
differential equations on the basis of the theory of continuous Markov vector processes. 
The admissible control functions are supposed to belong to a class of functions of 
stochastic estimates, which mean a set of conditional moment functions given observa¬ 
tion data, and the cost function is assumed to be the conditional expectation of 
an integral type functional with a concentrated cost at the end time of control given 
initial data. 

For a general class of nonlinear dynamic systems, if the admissible control functions 
are defined as functions of the full set of conditional moment functions appearing in the 
approximate conditional probability density function, the Bellman equation as well as 
the reduced Bellman equation are closed in the sense that they could be solved 
independently of the stochastic estimation procedure. Otherwise the Bellman equation 
is coupled with the conditional moment equations for filtering and prediction. As 
a rule, the coefficient functions appearing in the Bellman equation or reduced Bellman 
equation could be expressed as analytic functions of the conditional moment functions. 
For a general class of nonlinear dynamic systems including hysteretic structural 
systems, however, the solution to the Bellman equation may not be obtained without 
any approximation. A possible solution procedure to this problem is to approximate 
the coefficient functions in the Bellman equation by polynomials or by a finite power 
series expansion of the spatial variables corresponding to a set of conditional moment 
functions, the reduced Bellman equation is then converted to the associated extended 
Riccati equations which are, in general, to be simultaneously solved with the condi¬ 
tional moment equations for filtering and prediction. Only for the case of the closed 
Bellman equation, the associated extended Riccati equations are solved independently 
of the conditional moment equations as in the case of linear quadratic systems. 

From a practical point of view, in order to apply the stochastic estimate-based 
optimal control scheme to large complex structural systems, an approximate pro¬ 
cedure based on the stochastic linearization and quasi-stationalization techniques has 
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been discussed and found to be effective for a class of weakly nonlinear dynamic 
systems (Kawamoto et al 1990). For strongly nonlinear dynamic systems the more 
accurate approximate methods are to be explored on the basis of higher order conditional 
moment equations than second. If the excitation is measurable with observation noise, 
the measured excitation term is added to the drift term of the system equation and 
its noise term is replaced by the observation noise of excitation. This slight modifica¬ 
tion maintains almost the same solution procedures as in the case of stochastic 
excitations. The observation of excitation is effective to simplify the stochastic estima¬ 
tion procedure since it is not necessary to identify dynamic systems generating the 
excitation. 

The main concerns of the present paper have been in the formulation of the 
stochastic estimate-based optimal control of nonlinear dynamic systems including 
hysteretic structural systems, and the solution techniques of the conditional moment 
equations concerning the observer and of the Bellman equation or the extended Riccati 
equations associated with the controller. There might remain, however, a number of 
matters to be further examined from analytical and computational aspects, for example 
stochastic modeling of excitations and structural system, effective non-Gaussian 
closure methods for conditional moment equations depending on the properties of 
nonlinear functions in system and observation equations, and appropriate solution 
techniques for the Bellman equation or the extended Riccati equations under con¬ 
straint performance criteria. 
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Active control of non-stationary response of a two-degree 
of freedom vehicle model with nonlinear suspension 
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Abstract. Active control of non-stationary response of a two-degree of 
freedom vehicle model with nonlinear passive suspension elements is 
considered in this paper. The method of equivalent linearization is used to 
derive an equivalent linear model and optimal control laws are obtained by 
using stochastic optimal control theory based on full state information. 
Velocity squared quadratic damping and hysteretic type of stiffness non- 
linearities are considered. The effect of the nonlinearities on the active 
system performance is studied. The performance of active suspensions with 
nonlinear passive elements is found to be superior to the corresponding 
passive suspension systems. 

Keywords. Nonlinear vehicle model; stochastic linearization; optimal 
control; nonstationary response. 


1. Introduction 

In recent years considerable interest has been generated in the use of active vehicle 
suspensions which can improve the performance of the vehicle compared to the passive 
suspension systems. Many of the works reported in the literature are concerned with 
the active control of the stationary response of vehicle models with a constant velocity 
traverse. There are many situations in which the response of the vehicle becomes 
non-stationary, examples being (a) when it traverses a nonhomogeneous road profile at 
constant or variable velocity, (b) when the vehicle traverses a homogeneous rough road 
with variable velocity, and (c) when the vehicle traversing a smooth road suddenly 
encounters a homogeneous rough road at constant velocity. The problem of control of 
non-stationary response of linear single degree of freedom (sdf) and two-degree of 
freedom (2-dof) vehicle models with active suspensions including full state and partial 
state feedback in the cases (b) and (c) mentioned above has been treated by Narayanan & 
Raju (1990,1991). 

The suspension systems of a large class of road vehicles consist of a combination of 
coil and/or leaf springs and dampers. Normally in the vibration analysis of vehicle 
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systems, these are modelled as linear springs and dampers for simplicity of analysis. In 
reality, however, the force-deformation characteristics of these springs exhibit non¬ 
linear behaviour especially of the hysteretic type. Therefore, for a realistic vehicle 
dynamic model it is necessary to take into consideration the nonlinearities in the 
passive suspension systems. In vehicular vibration problems nonlinearities in the 
suspensions have been taken into consideration in the vibration analysis by Kirk & 
Perry (1970), Kirk (1971), Yadav & Nigam (1978), in the context of landing gear 
response to runway unevenness, and by Rill (1983), Harrison & Hammond (1986a), 
Narayanan (1987), and El Madany (1987) in the context of response of road vehicles to 
random road undulations. All these works considered only passive suspensions. The 
active control of non-stationary response of a single degree of freedom vehicle model 
with nonlinear passive suspension elements has been considered for the first time by 
Narayanan & Raju (1991). 

In this paper, the active control of non-stationary response of a 2-dof quarter car 
vehicle model with nonlinear passive elements such as quadratic dampers and hys¬ 
teretic type stiffness elements, due to variable velocity traverse is considered. The 
method of equivalent linearization is used to derive the linearized equations of motion 
for the vehicle model. The hysteretic nature of the suspension spring is modelled by 
a modified Bouc’s (1967) model in which the restoring force of the spring is assumed to 
be a combination of pre-yielding and hysteretic components. The stochastic optimal 
control theory is applied to the linearized vehicle model in the deriving optimal control 
law based on full state information. Since the equivalent linear spring and damping 
coefficients of the linearized model are functions of the response statistics of the vehicle, 
an iterative scheme is required in computing the optimal control gains and response 
statistics. The performance of the active system is compared with that of a passive 
system in terms of ride comfort, suspension stroke, roadholding and control efforts. 

2. Road input model 

Taking into account the rolling contact of the tyre with the road surface, the variable 
h(s) representing the road input displacement in space domain, is governed by the 
second order filter in spatial domain given by (Harrison & Hammond 1986a) 

h”(s) + (a +P)h'{s) + cc(}h(s) = rw(s\ (1) 

where the number of primes denotes the order of differentiation with respect to the space 
variable s representing the traverse along the rough road, and a and /? respectively are 
the cutoff wave numbers of the road profile and rolling contact filters, cr 2 is the variance 
of true road profile and w(s) is the spatial white noise excitation with autocorrelation 
function given by 

E[w(s 2 )w(s 1 )] = Qd(s 2 -s 1 ), (2) 

where Q is the intensity of the white noise taken to be unity, <5(-) is the Dirac delta 
function and T = j8<r(2a) 1/2 . 

Defining h 1 (s) = h ( 5 ) and h 2 (s) = h! (s), (1) can be put in state variable form in the space 
domain as 


h\(s) = h 2 (s), 

h 2 (•$) = — (a + /?) h 2 (s) — ot^h l (s) + T w(s). 


(3a) 

(3b) 
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Considering s as a function of time, the variables h x (5) and h 2 {s) which are functions 
of the space variable s can be regarded as functions of time, that is, h x {s(t)) = h x (t), 
h 2 (s(t)) = h 2 (t). It is assumed that the functional form of s is restricted such that s{t) ^ 0 
and s(f) vanishes only at isolated times r f . 

Noting that d(*)/ds = (df/ds)-(d(-)/dr), one can express (3a) & (3b) in the form 

K x (t) = s(t)h 2 (t), (4a) 

MO = - s(t)( « + P)h 2 (t) - s(t)aph x (t) + (s(t)) ll2 ^M (4b) 

In (4b) the delta function property <5(s(t)) = S{t - t,)/|s(t)| (Zadeh & Desoer 1963), has 
been made use of with the parameterized white noise process w(s(f)) being equivalent in 
a covariance sense to the white noise process wjfl/lslr)) 172 (Harrison & Hammond 
1986b), with autocorrelation function £[w 1 (t 1 )w 1 (t 2 )] = Q<5(t 2 — t x ). 

For variable velocity traverse of the vehicle, (4a) and (4b) are with time varying 
coefficients. Consequently the road input h x (f) becomes non-stationary and so also the 
response statistics of the vehicle model. 


3. Vehicle equations of motion 

The 2-dof quarter car vehicle model with quadratic damping and hysteretic stiffness 
between the sprung and unsprung masses is shown in figure 1. M is the sprung mass, 
m is the unsprung mass, c is the coefficient of linear viscous damping, c x is the coefficient 
of nonlinear quadratic damper, a x and k are the hysteretic suspension spring parameters, 
and k t is the tyre stiffness. 

The dynamics of the nonlinear 2-dof vehicle model can be represented by the 
equations of motion: 

My x + cq + c x \q\q 4- a x kq + (1 —oi x )kz — u = 0, (5) 

my 2 — cq — c x \q\q —a x kq — (l— <x x )kz + k t {y 2 — h x ) + u = 0, (6) 

where q = yi—y 2 is the relative displacement between the sprung and unsprung 
masses, y x and y 2 respectively, are absolute displacements of the sprung and unsprung 
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Figure 1 . Two-degree of free- 
S ( t ) dom nonlinear vehicle model. 
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masses, h 1 is the filtered road input displacement to the vehicle model, and u is the 
control force. The parameter z representing the hysteretic displacements is given by the 
nonlinear differential equation (Wen 1980), 

£= -y|4|z|z|" _1 -m\z\ tt + Aq. (7) 

In the above equation y, rj, A and n are parameters fitting the force-displacement 
characteristic of the hysteretic system. Parameters y and rj control the shape of the 
hysteretic loop, A the restoring force amplitude, and n the smoothness of transition 
from elastic to plastic response. In further analysis the exponent n is assumed to be 
unity without loss of generality. In the absence of hysteretic stiffness the equations of 
motion are obtained by setting ct x = 1 in (5) and (6). If the quadratic damping is absent 
we set c x = 0 in (5) and (6). 


4. Equivalent linear system 

A multidegree of freedom nonlinear dynamic system can be represented in the form, 

MX + CX + KX + g(X, X) = f (t), (8) 

where X is the state vector which includes 2 , the hysteretic displacement given by (7), 
and g(X, X) is the vector containing nonlinear elements. Let the system of equations (8) 
be replaced by the equivalent linear system of equations represented by 

MX + C* X + K*X = f(t), (9) 

where 

C* = C + C' and K* = K + K', (10) 

C and K representing, respectively, the linear part of damping and stiffness matrices, 
and C' and K' are obtained by minimizing the mean square error £[e r e] where the 
vector e is defined by 

e = g(XX) — C'X — K'X. (11) 

Assuming the excitation vector f(t) to be a zero mean Gaussian random vector and 
since the system of equations (8) is linear, the response of the equivalent linear system 
can also be considered Gaussian. Under this condition, using the results of Iwan 


& Yang (1972), and Atalik & Utku (1976), it can be shown that the elements of the 
matrices C and K' are given by 

c;,. = £[5 gJdXjl (12) 

Ky = E[dgi/dXj]. (13) 

Using the above results the linearized equations of motion of the 2-dof vehicle model 
can be obtained as 

Afy'i + (c + c eq )q + a x kq +(1 — oqjfcz — w = 0, (14) 

my 2 -( c + c eq )q-* l kq-{l-ct 1 )kz + k t (y 2 - h t ) + u = 0, (15) 

z + c h q + k h z = 0, ' (16) 

c e 9 = 2 ( 2 / 7r ) 1/2c i^ 


where 


(17) 
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c h = (2/tt) 1/2 b{E{qz)/a 4 )) + ~ A, 

k h = (2/7r) 1/2 [rj(E(qz)/<j z ) + yoj. 

5. Optimal control of linearized vehicle model 

Defining an augmented state vector as, 

X a (t) = [> 1 x 2 x 3 x 4 x 5 x 6 x 7 ] r , 

where 

x l =y l ,x 2 =y l ,x 3 =y 2 ,x 4 . = y 2 ,x 5 = z, x 6 = ti l ,x 1 = h 2 , 


the linearized equations of motion of the vehicle, [(14)—(16)], and the equations 
representing the filtered road input, (4a, b), can be combined to yield 


where 


X a (t) = F + G(t)U(t) + D(t)W(f), 


F «-[« ?;} <"-[?} dw =[dV’ 
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D m = i° ° ° K/ m °t F M = r ° i 

JtU [0 0 0 0 oj ’ L — (a + f)s(t)j 

G x (t) — [0 1/M 0 -1/m 0] r , D rf (r) = [0 Ptr{2as{t)) il2 Y. (22) 


Optimal control law based on full state feedback is obtained by minimizing a weighted 
sum of the mean square values of sprung mass acceleration (J x ), suspensi on stroke ( J 2 ), 
roadholding (J 3 ) and control forces (Jf) with corresponding weighting factors p 1 , p 2 , 
p 3 , and p 4 . Thus the performance index can be expressed as 


7 — £ J (Pi + Pz^2 4" P^d 2 + P^.df)dt J. (23) 

The overall performance index can be expressed in the standard form as (Kwaker- 
naak & Sivan 1972) 


J = E X a (t 1 )S,Xj(r 1 )+ [XIWM] dt, 


A(t) N(i)lfX,(t) 
N r (t) B(t) | U(f) 
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where £[•] denotes expectation operator, A(f) is symmetric positive semi-definite, B(t) 
is positive definite, = S(r t ) is symmetric final state weighting matrix, and [t 0 , t x ] is 
control period. The matrices A(r), N(f) and B(f) which are functions of system 
parameters and the weighting factors are given in Raju (1991). 

The optimal control law based on full state information has the form (Kwakernaak & 
Sivan 1972), 

U(t) = — C(t)X a (t), (25) 

in which C(t) is the control gain vector given by 

C(t) = B~ 1 [N r + G T S]. (26) 

S(r) is symmetric and is the solution of the matrix differential Riccati equation 
- S = S(F - GB“ l N T ) + (F - GB -1 N T ) r S 

— SGB “ 1 G r S + (A — NB “ 1 N r ), {21) 

with the terminal condition S(t t ). 

The mean square value of the state P(r) = £[X fl (r)Xj(t)] is given by the solution of 
the matrix differential Lyapunov equation 

P(£) = (F — GC)P(t) + P(t)(F - GC) T + DQD T , (28) 

with initial conditions P(f 0 ). 

In the problem considered here the system matrix F(£) is a function of response 
statistics of the vehicle and the Riccati equation in turn is a function of system matrix 
F(t). This means that in order to be able to solve the Riccati equation, the response 
statistics of the vehicle should be known a priori. This problem is overcome by an 
iterative procedure described in the sequel. 

Iterative procedure 

In this scheme control gains corresponding to the linear vehicle model are first 
computed over the entire period of control. These control gains are used in obtaining 
the response statistics of the nonlinear vehicle model. Then using the current response 
statistics of the nonlinear vehicle model, control gains are computed again over the 
period of control. This iterative procedure is repeated until the values of control gains 
and the response statistics at any particular time converge to a specified degree of 
accuracy. In the example considered here convergence was achieved with respect to the 
control gains and response statistics within 4 iterations. 

6. Evaluation of performance measures 

The mean square values of the sprung mass acceleration J l5 suspension deflection J 2 , 
tyre deflection J 3 , and control force J 4 can be evaluated from the following relations. 

A=I I {FM-C(i)IM}{F(2,j)-CU)/M}P ip (29) 

£= 1 j -1 


^2 — ^11 2P 13 +P 3 3, 


(30) 
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^3 ~ ^33 2P 36 + P 66 , 

(31) 

Ja= £ E c(i)C(j)P u . 

i=i j =i 

(32) 

7. Initial conditions 


The initial conditions required in solving the Lyapunov equation (28) are obtained as in 
Raju & Narayanan (1991). The case of the vehicle starting from rest and moving with 
constant acceleration is considered in this study. The initial conditions for this case are 
given in the sequel. 

For the homogeneous filtered road profile, 

E[hU = <T 2 m + P) = P 66 ( 0), 

(33) 

Elh 2 2 l = cT 2 up 2 /(a + P) = P 71 ( 0). 

(34) 

Other initial conditions are 


Pn( 0) = Pi 3 (0) = Pi«(0) = p 36 ( o) = + jS). 

(35) 

Since the zero-lag covariance matrix P(t) is symmetric 


i\ 3 ( 0) = i*3l(0X Pl 6 (0) = Pfil(0), ^63(0) = ^36(0)- 

(36) 


All other 0) are identically equal to zero. 


8. Results and discussion 

The-optimal control problem of the non-stationary response of a nonlinear 2-dof 
vehicle model is solved for the variable velocity run, corresponding to constant 
acceleration. 

The numerical data used for the vehicle and road profile model are, M = 267 kg, 
m = 36-6 kg, k = 18-7 kN/m, c — 1398 Ns/m, c x = 1000 Ns/m, k t — 184kN/m, a = 0T5m, 
0 = 2-0, <r 2 = 75-69 x 10" 6 m 2 . 

The values of the parameters for the hysteretic type suspension spring model are 
chosen as ot x — 0-2, rj = y = 0-5, A = 1-5 and n — 1-0. The weighting factors used in the 
performance index are p x = 1-0, p 2 = 10 3 , p 3 = 10 4 , p 4 = 10 -6 . 

To bring out clearly the influence of different types of nonlinearities on the 
performance of passive and active systems, the results are plotted for the passive and 
active systems for the cases of (i) only quadratic damping, (ii) only hysteretic spring and 
(iii) a combination of both quadratic damping and hysteretic spring. The curves 
corresponding to these three cases in the case of passive suspension are respectively 
designated by (a), (b) and (c) and in case of corresponding active suspensions by (d), (e), 
and (f) where the active system is obtained based on full state feedback. 

The performance with respect to ride comfort, for the cases (a)-(f) are shown in 
figure 2. From the figure it is clear that the active suspension can substantially improve 
the ride quality as compared to the corresponding passive system. Also while in the case 
of the passive systems, the suspension with quadratic damping performs better than 
that with hysteretic spring or a combination of both, in the case of active systems the 
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Figure 2. Ride performance 

of nonlinear passive (-) and 

active (-) systems. Pass¬ 

ive (a) quadratic (b) hysteretic 
(c) quad, and hysteretic (d), (e) 
and (f) corresponding active 
systems. 


performance with only the hysteretic spring (case (e)) is better in comparison to the 
cases (d) or (f). 

The performance of the vehicle with respect to rms stroke response (figure 3) is 
somewhat reversed between the active and passive systems. In all the three cases the 
rms stroke response is worse for the active suspensions than for the passive suspensions. 
However, when this performance criterion (stroke response) is compared separately for 



Figure 3. rms stroke re¬ 
sponse of nonlinear passive 
(-) and active (-) sys¬ 

tems. 
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the passive and active suspension schemes with respect to the type of nonlinearities 
present it is observed that the performance is better with only quadratic damping ((a) 
and (d)), followed by the use of both quadratic damping and hysteretic stiffness ((c) and 
(f)). The performance with nonlinearity corresponding to hysteretic spring alone ((b) 
and (e)) is below that for the cases (a) and (d), and (e) and (f). 

The performance with respect to roadholding (figure 4) behaviour is similar to the 
case of suspension stroke for the different forms of nonlinearities in the suspension. In 
this case also the performances of the active suspensions are poor compared to the 
passive suspensions. Also, the performance of the active suspension with the only 
hysteretic spring type of nonlinearity (case (e)) is worse than the other two active 
schemes (d) and (f). In the case of passive suspensions the types of nonlinearities do not 
influence the performance with respect to roadholding very much. The poor perform¬ 
ance of the active suspensions in comparison with the passive suspensions with respect 
to roadholding and suspension stroke may be attributed to the relatively large 
weightage for the term corresponding to the ride comfort in the overall performance 
index. 

The overall performance index for the active control schemes and for the passive 
suspensions corresponding to the three types of nonlinearities considered for the 
passive device is plotted in figure 5. Even though, the performance of the active systems 
is better or worse than the corresponding passive suspensions with respect to individual 
performance criterion considered separately, with respect to the overall performance 
of the vehicle the active suspensions perform much better than the corresponding 
passive suspensions for all the cases of nonlinearities. The nature of variation of 
overall performance index with respect to time in figure 5 is almost similar to the 
variation of the rms acceleration in figure 2. This corroborates to the fact that the ride 
comfort criterion is weighted relatively more in the performance index than the 
roadholding and suspension stroke criteria. 



Figure 4. Roadholding per¬ 
formance of nonlinear passive 
(-) and active (-) sys¬ 

tems. 
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Figure 5. Overall performance 

of nonlinear passive (-) and 

active (-) systems. 


The control force effort of the active schemes with full state feedback corresponding 
to the three nonlinear suspensions is shown in figure 6. The control force requirement 
for the case (e) of only hysteretic spring nonlinearity is the least as time increases, even 
though it is slightly more for initial times than for the case (d) of only the quadratic 
damping nonlinearity. 



0.0 
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Figure 6. Control expendi¬ 
ture (J 4 ) for nonlinear active 
systems with full state feedback. 
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9. Conclusions 

Active control of non-stationary response of an accelerating 2-dof vehicle model with 
nonlinear passive elements of the quadratic type, hysteretic spring type, and a combina¬ 
tion of quadratic damping and hysteretic spring has been considered along with the 
equivalent linearization method in formulating the application of linear stochastic 
optimal control theory of time varying systems to the problem. It has been demon¬ 
strated that with the active control scheme with the full state feedback one can control 
the non-stationary response of the nonlinear vehicle model very effectively. 
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Nonlinear dynamics of offshore structures under sea wave 
and earthquake forces 
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Abstract. A study of dynamic response of offshore structures in random 
seas to inputs of earthquake ground motions is presented. Emphasis is 
placed on the evaluation of nonlinear hydrodynamic damping effects due to 
sea waves for the earthquake response. The structure is discretized using the 
finite element method. Sea waves are represented by Bretschneider’s power 
spectrum and the Morison equation defines the wave forcing function. 
Tajimi-Kanai’s power spectrum is used for the horizontal ground acceler¬ 
ation due to earthquakes. The governing equations of motion are obtained 
by the substructure method. Response analysis is carried out using the 
frequency-domain random-vibration approach. 

It is found that the hydrodynamic damping forces are higher in random 
seas than in still water and sea waves generally reduce the seismic response 
of offshore structures. Studies on the first passage probabilities of response 
indicate that small sea waves enhance the reliability of offshore structures 
against earthquakes forces. 

Keywords. Offshore structure, dynamic response, nonlinear hydrodynamic 
damping, first passage probability. 


1. Introduction 

The analysis and design of offshore structures is a complicated process and requires 
several assumptions and approximations. The structures have to resist hostile environ¬ 
ment and must stand on soils whose properties may be predicted only to an approxi¬ 
mate degree of accuracy. The loads acting on offshore structures in a typical ocean 
environment are many such as wind, waves, tides, currents, ice, earthquakes, tempera¬ 
ture loads, operational loads and so on. It is necessary to design an offshore structure 
such that it can respond to moderate environmental loads without damage and is 
capable of resisting severe environmental loads without seriously endangering the 
occupants. The various offshore structures presently in use are jacket-type fixed 
platforms, submersible or semisubmersible type floating platforms, steel or concrete 
gravity platforms, guyed towers, tension leg platforms, storage tanks, submersible pipe 
lines etc. The choice of a particular type of offshore structure depends on environmental 
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and geotechnical conditions, functional objectives, safety requirements, and economic 
cal and political considerations. Most of the offshore structures located in 30 m to 
200 m depth for oil and gas exploration purposes are jacket-type platforms. These 
platforms usually consist of space frame structures with piles driven through their legs. 

Offshore structures may be analyzed using static or dynamic analysis methods. 
Static analysis methods are sufficient for structures which are rigid enough to neglect 
the dynamic forces associated with the motion under the time-dependent environmental 
loadings. On the other hand, structures which are flexible due to their particular form 
and which are to be used in deep sea must be checked for dynamic loads. Dynamic 
analysis is particularly important for waves of moderate heights as they make the 
greatest contribution to fatigue damage and reliability of offshore structures. The 
dynamic analysis methods may be based on either deterministic approach or stochastic 
approach. In the deterministic approach, time records of dynamic loads are used to 
calculate the time-dependent structural responses. In the stochastic approach, dynamic 
loads are treated as random processes and are represented by their spectral densities. 
The structural responses are also expressed accordingly. Thus, in the deterministic 
approach, all calculations are carried out mostly in the time domain whereas in the 
stochastic approach, they are carried out mostly in the frequency domain. Stochastic 
methods have become increasingly popular as they are better suited to study the 
long-term behaviour of offshore structures for a range of sea states. 

For offshore structures assembled by slender members, evaluation of wave forces is 
one of the most important steps in any dynamic analysis. Rigorous mathematical 
solution has yet to be developed and the empirical force model proposed by Morison 
et al (1950) is widely used. Morison equation is an approximate solution to a complex 
problem. Numerous researchers have investigated the validity and applicability of this 
equation. It appears preferable to attempt to improve this equation rather than 
formulate a new one (Bea & Lai 1978). Morison equation was originally proposed for 
stationary cylindrical piles in waves. This equation is generalized for a non-stationary 
structure by considering the relative motion between the water particle and the 
structure. Since the motion of water particles in the ocean have statistical characteristi¬ 
cs, wave forces are also represented with stochastic processes namely with a Gaussian 
and ergodic process. Using the power spectral density function for the variations of 
wave heights, power spectral density functions of wave forces can be easily determined 
through the spectral approach. Weigel et al (1957) observed that close proportionality 
exists between measured wave force and sea surface spectral densities. Borgman (1967) 
confirmed their results and proposed a spectral method of formulation of wave forces 
due to random wave action on vertical piles fixed at sea bottom. Deo & Narasimhan 
(1988) have proposed a generalized spectral method of formulation of wave forces due 
to multi-directional random waves on inclined members of offshore structures. Once 
the wave forces are expressed in the spectral form, dynamic analysis can be carried out 
using the principles of random vibration and the structural responses may be deter¬ 
mined. Foster (1970) developed a stochastic model for dynamic analysis of fixed-base 
offshore towers. One of the major problems involved in the stochastic analysis is the 
efficient modelling of nonlinear hydrodynamic drag and damping forces. The studies 
by Malhotra & Penzien (1970a, 1970b) provided an improved method for stochastic 
dynamic analysis of offshore structures. A linearization technique was proposed in these 
studies, to tackle the nonlinear hydrodynamic drag and damping forces, the essence of 
which is to alter damping coefficients in an optimal manner. The applicability of 
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this linearization technique for random waves in comparison with other possible 
procedures have been discussed by Dao & Penzien (1982), and Gudmestad & Connor 
(1983). Eatock Taylor & Rajagopalan (1983) showed that this linearization technique is 
found to predict good responses only when the structural velocities are quite small 
compared to water particle velocities. Nonlinear dynamic analysis methods have been 
proposed by Laya et al (1984) and Jain & Dutta (1987) to deal with larger structural 
velocities under regular or random waves. 

For a typical offshore structure located in a seismically active region, earthquake 
loading should also be considered in the dynamic analysis. Earthquake motions are 
generated through numerous random phenomena and are essentially random .in 
nature. Several methods are available for the dynamic analysis of structures on land 
subjected to earthquake ground motions. However, these methods cannot be directly 
applied to offshore structures due to the presence of surrounding water and sea waves. 
Dynamic analysis is complicated due to the fact that hydrodynamic forces are coupled 
to the dynamic response. Penzien et al (1972) have presented a stochastic method of 
analysis ,of fixed-base offshore towers due to random sea waves and strong motion 
earthquakes. They observed that the hydrodynamic drag effects become important 
with increasing tower period or water depth. Bea (1979) developed the design criteria 
for offshore platforms subjected to these loads and compared his results with onshore 
building structures. Anagnostopoulos (1982) carried out studies on the evaluation of 
modal solutions of offshore platforms. All these investigations treat the effects of wave 
and earthquake forces independently assuming that they do not act simultaneously. 
But in normal seas subjected to earthquake ground motions, there may be wind 
generated waves of at least 1 m to 3 m heights. Therefore, it is important to investigate 
whether the earthquake response of offshore structures is significantly affected by these 
sea waves. 

This paper discusses the dynamic response of offshore structures in random seas 
subjected to earthquake ground motions. Emphasis is placed on the evaluation of 
nonlinear hydrodynamic damping effects of sea waves for the seismic response. The 
input ground acceleration is horizontal and is expressed by Tajimi-Kanai’s power 
spectrum for the stationary filtered white noise. The dynamic equations of motion are 
derived using the substructure method and the response analysis is carried out using 
the frequency-domain random-vibration approach. Numerical results are presented 
for various combinations of typical sea states and earthquake ground motions. The 
response quantities are compared using the principles of first passage probabilities 
across specific barriers. 

2. Dynamic response analysis method 

Figures 1 and 2 show the elevations of offshore structure models with pile-soil 
foundation systems. The wave force is expressed using the Morison equation and the 
nonlinear relative-velocity squared drag term is replaced by an equivalent linearized 
drag term assuming that the proability density function for the relative particle velocity 
is Gaussian. In the present problem, relative velocity is the difference between the water 
particle velocity due to wave motion and the structural velocity due to wave motion as 
well as due to earthquake motion. The linearized Morison equation has the form: 


(i) 
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Figure 1. Analytical model of 
offshore structure: Model A. 
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where 

[C„] = [ KpC m V V], [CJ = [ Kp(C m -1) V V], 

[C D ] = t <\pC d (A/2)(8/x) ,l2 iT r ^j], r = v a — u g , 

in which p is the mass density of water, V is the volume of the displaced water, A is the area 
projected in the direction of flow, C m is the inertia coefficient, C d is the drag coefficient, 
{u fl } is the displacement vector of the structure, is the water particle velocity vector 
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Figure 2. Analytical model of 
offshore structure: Model B. 
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and {y a } is the water particle acceleration vector at the undeflected structure coordi¬ 
nate locations, a r is the rms value of the relative particle velocity which is obtained by 
an iterative procedure. 

Dynamic response characteristics of the structure depends on the vibrational 
behaviour of the structure as well as on the dynamic characteristics of the foundation- 
soil system. In the present study, the equation of motion is obtained by the substructure 
method in which the structure-pile-soil system is hypothetically divided into two 
substructures: the structure and the pile-soil foundation. The equation of motion for 
the total system is obtained by combining equations of motion for the structure 
subsystem and for the pile-soil subsystem, and by satisfying the compatibility condi¬ 
tions of displacements and the equilibrium conditions of forces at base nodal points. 
The nodal points except at the base have three degrees of freedom: horizontal 
displacement, vertical displacement, and rotational displacement in plane. The soil- 
structure interaction is idealized as a system of spring-dashpot element. The dynamic 
stiffness coefficient of the pile-soil foundation is obtained by the impedance method. 
The total displacement of the structure is treated as the sum of the dynamic displace¬ 
ment of the structure for the rigidly supported base condition, and the quasi-static 
displacement due to the interactions with the foundation. The dynamic displacement of 
the structure is treated as a linear combination of the first few vibration modes for the 
rigidly supported base condition which have significant effects on the response. The 
quasi-static displacement is expressed as the superposition of a few generalized 
displacements. 

The equation of motion for the structure-pile-soil system is finally expressed as 
(Venkataramana et al 1989): 


where 
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in which the subscript a denotes the unrestrained nodal points above the base, the 
subscript b denotes the nodal points at the base, the subscript p denotes the pile-soil 
foundation, the subscripts w and e denote the wave and earthquake respectively, q is the 
modal displacement vector for dynamic displacements, O is the undamped eigenvector 
(mode Shape) for the structure with rigidly supported base condition, o) fj is the 
corresponding natural frequency for the jth vibration mode of the structure with rigidly 
supported base condition, j3 fj is the corresponding damping ratio which includes both 
the structural damping and the hydrodynamic damping and u g is the seismic ground 
acceleration. 

Equation (2) contains a nonproportional damping matrix and hence complex 
eigenvalue analysis must be carried out for exact response analysis. However, when the 
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lower natural frequencies are well separated, the modal coupling effects due to 
off-diagonal terms are negligible. Therefore a simplifying approximation is made in the 
present study by considering only the diagonal terms of the damping matrix. 

The response quantities in (2) can be expressed as a linear combination of generalized 
coordinates {y} using a new eigenvector [T] for the structure-pile-soil system. 
Therefore, (2) can be transformed as 


where 


{y}+[^2jS i co j ^]{y} + [f\coj ^i]{y} = 
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m 1 
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(3) 


in which cDj is the natural frequency for jth vibration mode of the structure-pile-soil 
system, j0. is the corresponding damping ratio which includes the structural damping, 
the radiation damping through the pile-soil foundation and the hydrodynamic 
damping. 

Responses may be obtained from (3) by substituting for water particle velocities, 
water particle accelerations and ground accelerations. But, these variables take ran¬ 
dom values and require the application of random vibration approach. Assuming that 
the sea waves have stochastic properties of the zero-mean Gaussian process, they are 
represented by the one dimensional power spectrum of the Bretschneider type which is 
a function of statistically known mean wave height H and mean wave period T. The 
associated wave kinematics are computed by small amplitude wave theory. On the 
other hand, the seismic characteristics of the ground motion for stationary conditions 
are represented by Tajimi-Kanai’s expression for power spectrum of ground acceleration. 

Using the assumptions of stationary process and independent random process of sea 
wave and input ground acceleration, the power spectrum of generalized modal forces 
can be obtained by the summation of each power spectrum. The variance of the modal 
responses is then expressed as 
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in which [ff(co)] is the complex frequency response function and [H(co)*] is its 
conjugate. Using the results of (4), the relative response of superstructure is determined 
by transforming the modal response due to the modal matrix of the superstructure. 

The duration times of earthquake ground motions and sea waves are very different. 
Therefore the responses due to these forces and their inter-relationships may be better 
interpreted using the principles of first passage probabilities. The reliability L(X) of the 
structure corresponds to the probability of response x(r) not exceeding the value of 
a critical barrier X and is given as (Vanmarcke 1975): 


L{X) = exp 
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where 


a, = J ci/G^coJdco (i = 0,1,2) 

in which t 0 is the duration of the input excitation, y is the ratio of the critical barrier and 
the rms response, G x (co) is the power spectral density function of the response and q x is 
the bandwidth parameter which represents the degree of dispersion of the spectral 
density function about its central frequency. 

3. Numerical results and discussions 

The dynamic response analysis is carried out for model A and model B shown in 
figures 1 and 2 respectively. Model A has a lower fundamental frequency compared 
to that of model B. The depth of water is 110 m from mean sea level. The main members 
have an outer diameter of 2-8 m and a thickness of 27 mm. Each leg of the tower rests on 
a pile-soil foundation. The structural members as well as the piles in the foundation are 
made of steel. Table 1 shows the details of pile-soil foundation. The shear wave velocity 
V s in the soil is assumed to be 100 m/sec. The natural frequencies and vibrational mode 
shapes are computed by eigenvalue analysis. The values of natural frequencies are 
shown in table 2 for up to the third mode of vibration. The structural damping for the 
first vibration mode of the structure subsystem is assumed to be 2%. When the 
soil-structure interaction is considered, by incorporating the pile-soil foundation in 
the offshore structure model, the structural and radiation damping is found to be 
1*35% for model A and 1-95% for model B. 

Figure 3 shows the example of Bretschneider’s wave energy spectrum for mean wave 
height of 3 m, and mean wave periods of 5 s and 7 s. Higher frequency components have 
smaller energy while lower frequency components have greater energy. Figure 4 is 
a plot of Tajimi-Kanai’s ground acceleration spectrum for rms ground accelerations of 
30 gal (moderate earthquakes) and 100 gal (severe earthquakes). Marine soil is usually 
soft and therefore a characteristic ground frequency of 10 rad/s and damping ratio of 
0-6 are assumed. It is seen that sea wave motion is a narrow band process whereas 
earthquake ground motion is relatively a broad band process. The dominant peak 
frequencies of these two types of loadings are also well separated. 

The Morison equation is used to compute the hydrodynamic forces due to the 
surrounding water and sea waves. The values of the inertia coefficient C m and the drag 



Figure 3. Wave energy spec¬ 
trum. 
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Figure 4. Ground acceleration 
spectrum. 


coefficient C d are assumed to be 2-0 and 1-0 respectively. The linearized drag term is 
computed by a cyclic procedure by altering the damping coefficients in an optimal 
manner. In the present study, reasonable convergence was attained in about three 
cycles of iteration. 

Figures 5 and 6 show the rms response displacement at node 1 (top node) of models 
A and B respectively, and the response is plotted against rms ground acceleration. The 
response due to earthquake input generally increases with the increase in ground- 
acceleration. When sea waves are neglected, hydrodynamic damping has a small effect 
on the response and the response increases linearly with the increment in the input 
ground acceleration. For rms acceleration of earthquake motions over about 30 gal, 
the displacement responses due to earthquake forces in random seas are lower than 
those due to earthquake forces in still water. The differences of these responses have an 
increasing tendency for an increase of the mean wave height and the mean wave period. 
Flowever, the probability of the simultaneous occurrence of the severe earthquake 
motions and very strong sea waves is very small. In moderate wave conditions, which 
correspond to mean wave heights under about 3 m, the probability that the offshore 



Figure 5. RMS displacement at 
node 1 for model A (-earth¬ 
quakes in still water; - 

earthquakes in a random sea). 
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Figure 6. RMS displacement at 
node 1 for model B (-earth¬ 
quakes in still water; - 

earthquakes in a random sea). 


structure is hit by severe earthquake motions is considerable. Also, when the offshore 
structure is subjected to earthquake motions under about 20 gal, the responses due to 
simultaneous loadings of earthquake motions and sea wave forces are generally larger 
than those due to earthquake motions only. 

The response characteristics are very sensitive to the intensity of earthquake motions 
and sea wave forces. Further, in the absence of sea waves, nonlinear hydrodynamic 
damping forces which are proportional to the square of structural velocities are 
relatively small. On the other hand, if sea waves are included, the hydrodynamic 
damping forces are now proportional to the square of the relative velocities between the 
waves and the structure. Since the wave velocities are very much higher than the 
structure. Since the wave velocities are very much higher than the structural velocities, 
nonlinear hydrodynamic damping forces become larger than those without waves. 
Examples of hydrodynamic damping ratios for the first vibration mode of the struc- 
ture-pile-soil system are shown in table 3 for mean wave height of 3 m, mean wave 
period of 7 s and rms ground acceleration of 30 gal. 

Figures 7 to 10 show the rms response displacement at node 1 for sea waves of mean 
heights ranging from 3 m to 7 m in situations of specified earthquake forces. When rms 
ground acceleration is 30 gal, responses due to simultaneous loadings of earthquake 
motion and small sea waves give generally smaller evaluations than those due to 
earthquake motions in still water. But, when the earthquake motions are severe, the 
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Figure 7. RMS displacement at 
node 1 for model A (-earth¬ 
quakes in still water; - 

earthquakes in a random sea). 
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Figure 8. RMS displacement at 

node 1 for model B (-earth- 

s quakes in still water; - 

earthquakes in a random sea). 


if 


responses due to wave and earthquake loadings are very much smaller than those due 
to earthquake forces in still water, even for large wave conditions. Therefore the 
responses are mainly controlled by the earthquake forces, the proximity of the 
excitation frequency to the fundamental frequency of the offshore structure system and 
the nonlinear hydrodynamic damping due to sea waves. 

The reliabilities on the first passage of responses, which are functions of the duration 
times of sea waves and earthquake ground motions, are calculated using Vanmarcke’s 
expression (5), and are plotted in figure 11 for model A and in figure 12 for model B. The 
numbers in the figures indicate the rms ground accelerations. As a rough estimate, 
3-50*, where g x is the response displacement at node 1 and 3-5 is the peak factor, is used 
as the mean extreme response or barrier level X. The duration time t 0 is expressed in 
multiples of the first natural period T 0 . The reliabilities of the structure for earthquake 
forces are generally higher when sea waves are included in the analysis since the 
hydrodynamic forces reduce the response values. 





mean wave period mean wave period 

Figure 9. RMS displacement at node 1 for . Figure 10. RMS displacement at node 1 

model A (-earthquakes in still water; for model B (-earthquakes in still water; 

-earthquakes in a random sea). -earthquakes in a random sea). 
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Figure 11. Reliabilities on level 
crossing of response displace¬ 
ment: Model A (-earth¬ 
quakes in still water; - 

earthquakes in a random sea). 


The two loads considered in the present research are the wave and earthquake loads. 
Sea wave is a narrow-band steady-state process of several hours duration and 
earthquake is a broad-band transient process occurring for a few minutes. Comparing 
only the rms response of structure due to these loads is not sufficient for understanding 
the mechanism of hydrodynamic damping effects of sea waves on the seismic response. 
Therefore a first passage probability analysis, which takes into account the differences 
in their duration times, is carried out in this paper. Of course, one may question the 
validity of such an approach. But this method will serve as a tool for understanding the 
basic concepts and for realizing the extent of damping effects of sea waves on the 
earthquake response evaluation. 


4. Conclusions 

The dynamic response analysis of offshore structures in random seas have been carried 
out for the input of earthquake loadings. Two types of models are considered for 
numerical study: one is a relatively flexible structure and the other is a relatively rigid 
structure. The results of analysis indicate that responses due to earthquake motions 
vary with the intensity of the input ground acceleration. In the absence of sea waves, the 
effects of nonlinear hydrodynamic damping are small. When sea waves are considered, 
hydrodynamic damping forces are proportional to the square of the relative velocities 
between the waves and the structure. Since the wave velocities are very much higher 
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Figure 11 Reliabilities on level 
crossing of response displace¬ 
ment: Model B (-earth¬ 
quakes in still water; - 

earthquakes in a random sea). 
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than the structural velocities, these damping forces become larger than those without 
waves. 

In the present paper, the results are mainly based on frequency-domain analysis. As 
a next step, time-domain analysis is being carried out. The findings of that study will be 
published later. 
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Stochastic response of guyed tower under second-order 
wave force 

T K. DATTA, R S BISHT and A K JAIN 

Civil Engineering Department, Indian Institute of Technology, Hauz Khas, 
New Delhi 110016, India 

Abstract. Guyed towers have low natural frequencies compared with 
generally encountered wave frequencies. They are therefore highly suscep¬ 
tible to low frequency second-order wave forces resulting from the non¬ 
linear phenomenon in the integrated fluid pressure on the submerged 
structure. A simplified approach is presented here to obtain the second- 
order wave forces encountered in random sea state for offshore guyed 
towers using a simulation procedure. Nonlinear response of a guyed tower 
to second-order and first-order wave forces, and a combination of the two 
are obtained in time domain. The time histories of responses and the wave 
forces are analysed to investigate their stochastic characteristics. 

Keywords. Offshore guyed tower; second-order wave force; drift force; 
nonlinear vibrations; stochastic characteristics. 


1. Introduction 

The basic feature of a guyed tower platform is that it is designed to move with the waves, in 
at least some of its degrees of freedom. The system opposes wave forces predominantly by 
inertial effects. This is in contrast to the fixed platform which opposes wave forces primarily 
by the stiffness effect in the structure-foundation system. By their very nature, guyed 
towers have low fundamental frequency and, therefore, are susceptible to excitation at 
much lower frequencies than those of waves. Analysis of these structures for first-order 
wave forces would then invariably predict very low responses. The effect of second- 
order wave forces, which have generally significant energy content at low frequencies, 
on the response of such platforms must be included in their overall analysis. 

Investigation of the second-order drift forces, exciting low frequency responses, has been 
extensively conducted for floating structures like buoys, semisubmersibles etc. Potential 
theory was used to derive the second-order drift forces by Pinkster (1979), Pinkster & 
Huijsmans (1982), Standing et al( 1981), Chakrabarti & Cotter (1984), Eatock Taylor (1986) 
and Karppinen (1979), where the wave drift forces were defined as the second-order term in 
the integrated fluid pressure on the submerged body and was found to be proportional 
to the.square of the wave amplitude. Salvesen etal (1982), Wahab (1974), Pijfers 
& Brink (1977) and Ferretti & Berta (1980) studied the slowly varying motion caused by 
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the second-order viscous drift force due to interaction between current, wind and wave 
force. The drift force was defined as the mean value of the total force over a wave period 
and was derived by using Morison’s equation. The time histories of wave drift force for 
random waves were generated with the help of the square of the slowly varying wave 
envelope (hence, second order) and the drift force operator (i.e. drift force coefficient 
plotted against frequency of encounter). 

Peak mooring forces were analysed by Hsu & Blenkarn (1970) which were caused by 
the vessel’s slow drift oscillation in a random sea. They calculated the wave drift force 
based on conservation of momentum principles and the concept of “radiation stress’ 
introduced by Longuet-Higgins & Steward (1964). Remery & Hermans (1971) also 
investigated the slow drift oscillations of a moored object in random seas. The drift 
force was calculated as a function of the square of the reflected and scattered wave 
elevation. The reflection coefficient was determined from experimental tests on a rec¬ 
tangular barge in head waves. 

Kim & Yue (1989) have presented a new method for predicting slowly varying wave 
drift excitations in multidirectional seas. The method retains the assumption of narrow 
bandedness in frequency but treats the wave directional spreading exactly. Naess (1989) 
has emphasised the importance of estimating the extreme values of the combined first order 
and slow drift response of an offshore structure, subjected to a stationary irregular sea 
wave. The study indicates a strong coupling between the first order and the slow drift 
response. Kato et al (1990) presented an approximate solution for calculating the power 
spectral density function of total responses including first-order as well as second-order 
motions. Also, they showed that there exists a coupling between the first- and second-order 
responses. Grue (1988) has examined wave drift damping and low frequency oscillations of 
a moored elliptic cylinder. For long incoming waves he has obtained positive damping. 
But for short incoming waves damping forces were found to be negative. Langley (1987) 
has derived probability density function for low frequency second-order forces and 
motions using a method which represents the random sea state as a sum of regular 
waves. Grue & Palm (1986) have analysed the influence of a uniform current on slowly 
varying forces and displacements and the results show that current may have a great 
impact on the slowly varying motion of a moored body. 

Jain & Datta (1991) have carried out investigations on articulated towers for the 
response due to viscous drift forces. They have calculated the slowly varying drift force 
by assuming that the drift force is proportional to the product of the square of the wave 
envelope and the drift force coefficient in regular waves. 

Herein, a simplified method for obtaining the second-order wave force, encountered 
in the nonlinear oscillation of guyed tower in random sea states, is determined by 
a procedure similar to that proposed by Pijfer & Brink (1977). The guyed tower is 
analysed in time domain for the second-order wave force, first-order wave force and 
a combination of the two. The time histories of the nonlinear response as well as the 
wave force are analysed for a particular sea state in order to highlight the stochastic 
behaviour of the guyed tower in low frequency oscillation. 

2. Analysis 

2.1 Guyed tower model 

The Exxon guyed offshore tower model has been considered to carry out the present" 
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Figure 1. Schematic Exxon 
prototype guyed tower. 


investigations. The tower is shown in figure 1. The bottom of the tower is provided with 
a restricted rotational restraint. For the present study the tower has been idealized as 
an equivalent uniform shear beam with a rotational spring at the base of the tower as 
shown in figure 2. Varying degrees of rotational movement of the tower may be 
achieved by varying the rotational spring stiffness. The guylines have been idealized by 
a nonlinear spring. The contribution of buoyancy to the restoring force in these 
structures is generally small when compared with that from the guyline stiffness, and is, 
therefore neglected in this study (Dutta 1984). Further, it is assumed that the dynamics 
of the guylines does not significantly influence the global tower motion. Thus, the 
stiffness of the nonlinear spring is obtained from a separate static analysis of the 
guylines under its own weight and current induced forces. The typical force excursion 
relationship of the guylines, for the example problem is shown in figure 3. 

The offshore tower is discretized into a number of plane 2D beam elements. At each 
node, the dynamic degrees of freedom consist of sway translations and in plane 
rotations. The equation of motion in structural co-ordinates takes the form 

[M]{x} + [C]{x} + [K]{x} = F(t). (1) 

In the mass matrix [M], added mass of the element is considered up to still water 
level. The effect of variable added mass due to the instantaneous position of the tower is 
assumed to be negligible and is ignored in the formulation of the problem. The damping 
matrix [C] of the system is assumed to be a linear combination of (constant) mass and 
stiffness matrices; {x}, {x} and {x} are, respectively the vectors of structural acceler¬ 
ation, velocity and displacement. The stiffness matrix [K] is made time invariant by 
considering in it only the initial tangent stiffness of the nonlinear spring representing 
the guy system. As a result of this, correction to the resistive force at any instant of time 
is required and it is incorporated in the right-hand side load vector F( t). The equations 
of motion are finally uncoupled using normal mode theory and an iterative technique. 
As a result, the damping matrix [C] need not be explicitly known. Modal damping 
ratios are only required for the solution of equation of motion. F(t) depends upon the 
type of analysis being performed. For the first order wave force analysis, F(t) is due to 
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Figure 2. Finite element model of guyed tower. 
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Figure 3. Force excursion re¬ 
lationship for prototype guyed 
tower. 


the hydrodynamic loading calculated based on inertia and drag effects produced by the 
water particle kinematics. For the second order wave force analysis, the time history of 
F(t) is simulated by considering low frequency wave forces resulting from nonlinear 
phenomenon in the integrated fluid pressure acting on _the structure. For complete 
analysis, F(t) is a combination of first-order and second-order wave forces. It is to be 
noted that the first-order wave force F(t) depends on both structural velocity (due to 
nonlinear drag) and structural displacement (due\o nonlinear force excursion relation¬ 
ship of guylines) apart from the water particle kinematics. The second-order wave force 
depends upon the nonlinear response of the guyed tower in regular waves of different 
periods as explained later. Thus (1) becomes highly non-linear especially where first- 
and second-order wave forces are considered together. 

2.2 Simulation of second-order wave forces 

The time histories of the drift forces are obtained using a method proposed by 
Pijfers & Brink (1977). According to this method, a mean hydrodynamic force at 
various nodes is calculated, which is the time average of all hydrodynamic forces acting 
on the structure, obtained through the Morison’s concept. The wave drift force is then 
defined as the mean force in consequence of the waves and can be calculated by 
subtracting the true current force from the hydrodynamic force. The method is based 
on the assumption that the wave spectrum is sufficiently narrow. A random, long 
crested sea may then be written as an amplitude modulated signal: 

5(0 = Reft(f) = Re [R(0{exp zT(0}], (2) 

where, 8(0 = wave amplitude, R{t) = slowly varying wave envelope, ft(0 = a complex 
variable whose real part provides the wave amplitude T(t) the phase angle, equal to 
(o c t + fi(0, where co c is the central frequency of the narrow band spectrum and e(t) is 
a slowly varying phase angle. 

The above formulation leads to the expression for a momentary frequency of the 
spectrum as: 

f (0 = (o c + e(0- 


(3) 
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The slowly varying drift force is then calculated assuming that the drift force is 
proportional to the product of the square of the wave envelope R(t) and the drift force 
coefficient in regular waves at the momentary frequency t (t). The drift force D t (t) at any 
node i of the tower is then given by 

D,(t) = 0-5 fi w 0C t (t) R 2 (t). (4) 

Evaluation of the value of D,(f), (5), requires the time histories of R 2 (t), t(t) and drift 
force coefficient C f at each momentary frequency T(f) 

2.3 Determination of time histories of R 2 {t ) and T(f) 

The wave envelope R is the modulus of a complex variable i.e. 

R 2 =(Reft) 2 + (Imft) 2 . (5) 

Also, 

T = tan _ 1 [Im ft/Re ft], (6) 

and 

T = [ (Re ft) * (Im ft) - (Im ft) * (Re ft)]/[(Re ft) 2 + (Im ft) 2 ] 

= (1/R 2 ) [(Re ft) * (im ft) - (Im ft) * (Re ft)]. (7) 

The irregular wave amplitude <5 (t) is defined as Re ft(f) and is given by the sum of a large 
number of regular waves, i.e. 

<5(f) = Reft(f) = Y, A„cos(co n t + ©„), (8) 

n 

in which A„ is obtained according to the well-known formula; 

4 = [2S a K)Aco„] 1/2 . (9) 

Sa(a>„) is the spectral density ordinate of the wave spectrum at frequency cu„; 0„ is the 
random phase angle between 0 and 2n radians; co„ is the randomly varying frequency 
and Atu„ is the frequency interval. Since (5(f) is considered as Reft(f), the imaginary part 
of ft(f) is found to be 


Imft(f) = £ A„sin(a)„f + 0„). 

ft 

The time derivatives of Re ft and Im ft are therefore, 

(10) 

Reft = -£ A^sin^f+ 0„), 

ft 

(ID 

Im ft = X A„co„ cos(eo„f 4- 0„). 

(12) 


Using (8)—(12), the time histories of R 2 {t) and f (f) can be determined from (5) and (7). 
2.4 Determination of drift force coefficient 

The drift force coefficient C, at any given momentary frequency T is obtained from the 
plot of C, vs cr e , where cr e is the frequency of encounter. For a given regular wave of 
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amplitude b a and frequency <x, the drift force coefficient C, is given by 

C^DJiOS^gS 2 ), (13) 

where D, is the mean drift force and is found by subtracting the true current force from 
the mean hydrodynamic force i.e. 

A = Fi — F c , (14) 

where F c is current force due to constant current profile across the depth, F f is mean 
hydrodynamic force and i is the node number. 

The frequency of encounter is given by: 

o e = ff(\ +oU c /g). (15) 

The mean hydrodynamic force F, at node i is obtained by determining the mean 
hydrodynamic forces over one period of encounter T i.e. 

Fi = WT)llmdt, (16) 

fi(t) = hydrodynamic force at node i. 

The hydrodynamic force/ ; (f) is calculated from Morison’s equation viz. 

f i (t)^0-5p w C d D d \U i \(U i ) + 0-25CJ w nD 2 m U i 

-0-25np w (C m -l)D 2 m X i . (17) 

The relative velocity U comprises mass transport velocity, V mi , water particle velocity, 
U h the current velocity U c and structural velocity X t and is given by 

U^V^+Ui + U'-X', (18) 

In which V mi , in deep water, according to Stokes theory, is given by 

V mi = d 2 Kcrexp(-2Ky), (19) 

where K is the wave number and y is a function of the water depth. 


Since the computation of/,(r) as given by (17) requires X t and X { , the response of the 
tower is to be obtained for different combinations of the regular wave of amplitude 5 a 
and frequency o. The response analysis of the guyed tower for regular waves of 
amplitude S a and frequency a is performed by (1) in time domain using the Newmark 
ft method. Because of the dependence of F(t) on structural velocity and displacement, 
the equation of motion is solved by numerical integration using iteration at each time 
step to tackle all the nonlinearities. The efficiency of the analysis is augmented by the 
use of the normal mode theory (Bathe & Wilson 1976). 

2.5 Simulation of the first-order wave force 

For a given random sea state the time history of first-order wave force at any node is 
obtained by using (17). The time histories of water particle kinematics JJ i and U i are 
obtained from the PSDFs of water particle kinematics using the Monte Carlo 
simulation technique (8). 

2-6 Response analysis 

The responses of the guyed tower for the first-order and second-order wave forces and 
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Table 1. Details of the idealised guyed tower. 


Deck height 

Mean sea level 

Height at guy node 

Guy diameter 

Number of guys 

Guy system initial tension 

El 

Deck weight 
Structural weight 
Base restraint 
Effective drag diameter 
Effective inertial diameter 
Cd 
Cm 

Structural critical damping ratio 
Height at node 17 
Structure’s fundamental frequency 
Structure’s second frequency 


480 m 
457 m 
442 m 
9 cm 
20 

1680 kN 
50000 GNm 
69000 kN 
3790 kN/m 
20 GNm/rad 
35-05 m 
5-79 m 
0-90 
2-00 
2% 

447 m 

0-2689 rad/s 
1-3260 rad/s 


for the combination of the two are obtained in time domain using Newmark’s /? method 
with iteration required at each time step to tackle the nonlinearities (Datta & Jain 
1988). The equations of motions are solved in normal coordinates thereby decoupling 
the equations of motion. The details of the method are available in the reference cited 
above. 


3. Numerical results and discussion 

The details of the idealised guyed tower are given in table 1. The tower is analysed for a 
random sea state with # 1/3 = 15 0 m and T z = 9-0 s. The sea state is characterised by the 
one-sided Pierson Moskowitz (P-M) spectrum and has been simulated for a duration 



Figure 4. Time history of wave 
envelope square. 
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Figure 5. Time history of mo¬ 
mentary frequency. 


of about 17-5 minutes. Typical time histories of wave envelope square R 2 (t) and the 
momentary frequency t(t) for the sea state are shown in figures 4 and 5. 

Figure 6 shows the variation of the drift force coefficient with encounter frequency 
computed at node 17 of the tower which is the node just above the fairlead point (very 
near to the mean sea level). The nonlinearly varying drift force coefficients do not differ 
significantly for different wave amplitudes up to an encounter frequency equal to 1-0 
rad/s. Since the oscillation of the tower is predominantly guided by the low frequency 
response, the range of instantaneous frequencies generally falls within 1-0 rad/s. 
Therefore the values of drift force coefficient C^QT) for any value of r ; (f) may be 
obtained from any of the curves corresponding to wave amplitudes 3 a — 1 m, 2 m 
and 3 m. 

Figure 7 shows the power spectral density function (PSDF) of second order wave 
force at node 17. The PSDF has significant energy at lower frequencies, as expected. On 
the other hand, the low frequency contents of the PSDF of the first-order wave force are 
small (figure 8). 







Tip Displacement (m) p S ^ rQ ' Density (N 2 sec/rad) ^ e Spectral Density (N 2 sec/rad) 
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Figure 8. PSDF of 1st order 
wave force (at node 17). 
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too 


Figure 9. Time history of tip 
displacement due to 1st order 
wave forces. 
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Figure 10. Time history of tip 
displacement due to 2nd order 
wave forces. 


The PSDFs of tip displacements and the corresponding time histories of response 
due to first- and second-order wave forces are shown in figures 9-12. Low frequency 
excitation of the structure at the fundamental frequency due to a second-order wave 
force is evident from the plot of PSDF shown in figure 12. It is interesting to note that 
the frequency contents of the response spectrum due to a second-order wave force are 
practically insignificant in the vicinity of predominant wave frequency. On the other 
hand, the response characteristics due to the first-order wave force are such that the 
spectrum has large frequency content at the predominant wave frequency. Low 
frequency contents of the spectrum are not large. 

Figure 13 shows the PSDF of tip displacement due to the combined first and second 
order wave forces. The spectrum is characterised by two peaks, one occurring near the 
fundamental frequency of the structure (0-269 rad/s) and the other at the predominant 
wave frequency (0-7 rad/s). The first peak shows the low frequency resonating effect due 
to the second order wave force while the second peak is predominantly the effect of the 
first-order wave force. Comparing figures 12 and 13, it is seen that the response peak 
(the first peak in figure 13) due to the second-order wave force is reduced when first- and 



Figure 11. PSDF of tip dis¬ 
placement due to 1st order wave 
force. 
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Figure 12. PSDF of tip dis¬ 
placement due to 2nd order 
wave force. 


second-order wave forces are considered together. This reduction is presumably caused 
by nonlinear hydrodynamic damping associated with the first-order wave forces. 

Table 2 compares the mean square values of the tip displacements due to the two 
wave forces and their combinations. The response due to the second-order wave force 
alone is quite significant (about 26% of that due to the first-order one). If the effect of 
the second-order wave force is not taken into consideration, the overall response is 
underestimated by about 13%. The difference in the overall response, however, 
depends upon the amount of hydrodynamic damping present in the first-order wave 
force. 

Figure 14 shows the cumulative distribution function (CDF) of the first order wave 
force, the second-order wave force and the combination of the two at node 17. It is seen 
that the CDFs of the forces deviate from the normal distribution, especially near the tail 
ends. This clearly shows the effect of nonlinearities on the distribution of the hy¬ 
drodynamic forces. The CDFs of responses corresponding to different types of wave 
forces are shown in figure 15. Distributions deviate considerably from the Gaussian 
distribution near the ends, because of considerable non-linearities present in the wave 



Figure 13. PSDF of tip dis¬ 
placement due to combined 1st 
& 2nd order wave forces. 
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Table 2. Comparison of mean square values of tip dis¬ 
placement response for different sea states. 



First 

Second 

Combined 

Sea 

order 

order 

wave force 

states 

(m 2 ) 

(m 2 ) 

(m 2 ) 

H s = 150 m 

6-492 

1-705 

7-476 

T z = 9-Os 






Figure 14. Distribution of 
wave forces (in Gaussian plot). 


■ 4 , 




Figute 15. Distribution of tip 
displacement responses (in 
Gaussian plot). 




Figure 16. Distribution of 
positive peaks of tip displacement 
response (in Rayleigh plot). 
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forces. However, the distribution of positive peaks of the responses (figure 16) tend to 
(mildly) follow Rayleigh distribution. 


4. Conclusions 

A simplified approach for obtaining the second-order wave force in the random sea 
state is presented for offshore guyed towers. The responses due to first- and second- 
order wave forces, and their combination are obtained in time domain considering 
hydrodynamic nonlinearity and the nonlinearity present in the restoring force pro¬ 
vided by the guy lines. The responses of the tower are analysed to investigate their 
statistical characteristics. The results of the numerical study lead to the following 
conclusions. 

(a) The second order drift force has significant energy at the low frequency range. 

(b) The response due to the second-order wave force only shows resonating effect at the 
structures’s fundamental frequency and could be quite significant. 

(c) When the second-order wave force is combined with the first-order wave force, the 
low frequency response caused by the effect of the second-order wave force is 
reduced because of hydrodynamic damping associated with the first-order wave 
force effect. 

(d) Both first-order and second-order wave forces are non-Gaussian due to significant 
nonlinearity present in the integrated fluid structure system. 

(e) The distributions of the responses are non-Gaussian. However, the peak distribu¬ 
tion tends to (mildly) follow Rayleigh distribution. 
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Chaos in mechanical systems - A review 

P SEKAR and S NARAYANAN 

Machine Dynamics Laboratory, Department of Applied Mechanics, 
Indian Institute of Technology, Madras 600036, India 

Abstract. In this paper, a review of the various developments in the field of 
chaotic dynamics with specific emphasis on chaos in structural and mech¬ 
anical systems is presented. The paper discusses some known chaotic 
systems such as the Lorenz, Rossler, Ueda and Henon attractors as well as 
chaos in Duffing and Van der Pol oscillators. The paper also covers chaos 
in piecewise linear systems, impacting oscillators and flow induced vibra¬ 
ting systems. Topics such as bifurcations and routes to chaos, different ways 
of characterising chaos, domains of attraction and control of chaos are also 
discussed. 

Keywords. Chaos; bifurcations; nonlinear systems; strange attractors. 


1. Introduction 

Nonlinear dynamical systems are encountered in many areas of science and engineer¬ 
ing, such as in chemical kinetics, celestial mechanics, fluid flow problems, structural 
dynamics and mechanical vibration. Because of the numerous interesting phenomena 
exhibited by nonlinear systems and also because of the lack of a general method of 
analysis for all nonlinear systems, the theory of nonlinear oscillations have fascinated 
physicists, mathematicians and engineers for a long time. 

Recent years have witnessed a significant increase in research efforts in understand¬ 
ing chaotic motions of nonlinear dynamical systems in many fields of science and 
engineering. Chaos may be described as bounded random like motion exhibited by 
deterministic nonlinear systems characterised by their sensitivity to initial conditions. 
Not all nonlinear systems exhibit chaos, nor does chaos occur for all combinations of 
system parameters and initial conditions for a given system. In fact no general 
conditions have as yet been clearly established to a priori identify chaos for a given set of 
parameters and initial conditions. Chaotic behaviour is represented by highly irregular 
motion and the power spectrum of the associated time history is continuous. In many 
cases of chaotic motion, the phase plane plot shows a complicated pattern without 
being closed and the Poincare map which is the stroboscopic projection of phase 
trajectories sampled at equal intervals of time (usually equal to the period of excitation) 
shows the existence of a strange attractor where the phase points tend to occupy certain 
regions of phase space for dissipative systems. In the case of regular periodic motion of 
the system the Poincare map contains a single point for period 1 and two points for 
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period 2 motion etc. The most reliable way of characterising chaos is through 
Lyapunov exponent which is a measure of the sensitivity of the system behaviour to 
initial conditions. 

Even though such irregular behaviour of nonlinear systems had been alluded to in 
the pioneering works of Poincare (1921), Birkhoff (1927), Andronov et al (1966) and 
Lyapunov (1949), it is only in the last two decades that a systematic study of chaos has 
been carried out aided by the massive power of computing provided by present day 
computers and sophisticated graphic facilities. 

In this paper, we present a review of the literature in the area of chaotic vibrations 
in nonlinear dynamical systems. As the subject has grown rapidly in the last decade, 
the literature available is very vast and covers many different systems like convective 
flows, aeroelastic systems, nonlinear electrical circuits, feedback control devices, 
mechanical systems with clearances and backlashes, rotating machines, nonlinear 
acoustic systems etc. The literature is abound in the numerical aspects of characterising 
chaotic motions in terms of phase plane diagrams, Poincare maps, time histories, and 
associated power spectra, Lyapunov exponents and fractal dimensions. It also covers 
many of the theoretical and mathematical concepts, especially the geometric theory of 
nonlinear differential equations, which are used to predict the existence of chaos 
for a set of system parameters as well as investigate the different routes to chaos by 
use of bifurcation theory and stability analysis which also help to understand the 
phenomenon of chaos. There are papers which employ conventional methods of 
nonlinear analysis like the averaging and harmonic balance methods to approximately 
predict certain bifurcation behaviour leading to chaos. Some other works are devoted 
to experimental methods to demonstrate chaos in physical nonlinear systems. In 
addition to the vast literature available on the identification of chaos in various 
systems, control of chaos is also drawing considerable attention. 

It is a formidable task to review the entire literature on the subject in view of the sheer 
magnitude of the information available. Therefore we confine our literature survey 
mainly to those concerning chaos in mechanical and structural systems. 

Important developments in this area can be found in the books by Guckenheimer & 
Holmes (1983), Lichtenberg & Liberman (1983), Iooss & Joseph (1980), Arnold (1983), 
Kubicek & Mareck (1983), Berge et al (1984), Hao Bai-Lin (1984), Thompson & 
Stewart (1986), Moon (1992), Devaney (1987), Hsu (1987), Seydel (1988), Cvitanovic, 
(1989), Baker & Gollub (1990), Parker & Chua (1989), Wiggins (1990), Kapitaniak 
(1991) and also in the papers by Takens (1980), Ott (1981), Holmes & Moon (1983), 
Salam et al (1983), Mees & Sparrow (1987), Grebogi et al (1987), Dowell (1988a, b), 
Holmes (1990a, 1990b), Ott et al (1990), Argyris (1991), Thompson & Soliman (1991), 
Stewart & Ueda (1991) and Thompson (1992). 

1.1 Early history 

Though the ideas, methods and rigorous mathematical theories regarding chaos have 
crystallised only in the last two decades as an important discipline in many mathemat¬ 
ical, physical and engineering fields, Poincare (1921) was aware of the existence of 
irregular motions in nonlinear systems and their sensitivity to initial conditions even at 
the beginning of this century. The foundation laid by Poincare on the geometrical 
theory of nonlinear systems was strengthened by Birkhoff (1927) who was also aware of 
aperiodic motions in nonlinear systems as were Van der Pol & Van der Mark (1927) 
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who had also observed random like behaviour in experiments with nonlinear electrical 
circuits. 

Lorenz (1963) a student of Birkhoff, was one of the earliest to observe chaotic 
behaviour in the study of atmospheric turbulence and its effects on long range weather 
predictions. He considered a simplified version of the convective equation in non- 
dimensional form as 


x = er(y — x), y = px — y — xz, z = xy — fiz, (1) 

where the overdot denotes differentiation with respect to nondimensional time and 
<y and p are related to the Prandtl number and Rayleigh number respectively and 
P a geometric factor. Even in this simple form (1), which were derived by some scheme of 
modal approximations of the governing nonlinear partial differential equations 
(Navier-Stokes equations) with xz and xy being the nonlinear terms for the parameter 
values of a = 10, p = 28, ft = 8/3, Lorenz observed chaotic behaviour. 

Lorenz equations are invariant under the symmetry (x, y, z) -+ (— x, — y, z) and over 
a large region of parameter space, there are three unstable stationary points: the origin 
of the phase space, (0,0,0), ((fi(p - 1)) 1/2 , (P(p-l)) l/2 , (p- 1)), (-(/?(/? - 1)) 1/2 , 
— (P(p — 1)) 1/2 , (p — 1)) which are mapped onto each other by the symmetry. The origin 
is a saddle point while the other two equilibrium points are unstable foci. The location 
and the type of equilibrium points for a range of parameter values are concisely 
discussed by Thompson & Stewart (1986). Much of our present understanding of 
these equations and their solutions are summarized in the excellent book by Sparrow 
(1982). 

Originally it was hoped that these equations describe the Rayleigh-Bernard flow of 
a fluid between heated plates. That no longer seems likely because of the poor 
convergence properties of the modal expansion. But the equations have retained 
interest for their possible prototypical value (Dowell 1988a). For parameters specified 
above, this system of equations exhibits chaotic behaviour and the corresponding 
phase space plot is given in figure 1. Such behaviour is termed as strange attractor to 
distinguish the motion from classic limit cycle motions, forced periodic motions as well 




Figure 1. Lorenz attractor. 
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as fixed equilibrium points to which the system eventually comes to rest. The Lorenz 
model may be considered as a self excited autonomous system exhibiting chaos and had 
led to the oft repeated condition that there should be at least three variables for the 
existence of chaos in an autonomous system. 

1.2 Structure of the paper 

This paper is organised into eleven sections. In §2 we discuss the dynamics relating to 
some of the celebrated chaotic attractors in the literature. Section 3 discusses various 
types of nonlinearities which are of the Duffing and Van der Pol type of oscillators, 
nonlinear pendulum and systems with dry friction and associated chaotic phenomena 
encountered in mechanical systems. Section 4 is devoted to the dynamical systems 
where the nonlinearities arise due to piecewise linear functions such as those encoun¬ 
tered in impacting systems. Chaos in flow induced problems is discussed in § 5. Various 
types of bifurcations and the different routes to chaos are presented in § 6. Section 
7 deals with the control of chaos. The computational methods for obtaining periodic 
solutions are discussed in §8. In §9, various methods of obtaining the domains of 
attraction of coexistent solutions are presented. The various methods of characterisa¬ 
tion of chaos are presented in § 10. The paper ends with some of the important 
conclusions which are presented in § 11. 


2. Some other chaotic attractors 

Chaos and the various phenomena associated with nonlinear systems have been found 
to occur both in continuous time dynamical systems represented by differential 
equations as well as discrete time dynamical systems represented by maps. While in the 
review we concentrate mainly on continuous time systems we also make some 
references to maps. Some of the strange attractors other than the Lorenz attractor 
which are quite extensively studied in the literature on chaos are the Rossler, Ueda, 
double scroll and Henon attractors. 

2.1 Rossler attractor ( spiral chaos) 

Rossler (1976) introduced asymmetry into the Lorenz system and obtained equations 
of the following form 

x = - (y + z), y = x + ay, z = b + z(x - c). (2) 

Using the equations Rossler explained the dynamics of chemical reactions in a stirred 
tank. This apparently simple nonlinear autonomous system has only a single nonlinear 
term xz. For negligibly small values of z and for 0 < a < 2 the system ; possesses an 
unstable focus at the origin and hence the trajectories spiral outwards and will 
eventually be attracted to infinity if the nonlinear term is absent. But as z increases the 
effect of nonlinearity becomes predominant and the trajectories are confined within 
a bounded attractor. The constant c acts as threshold for switching of the nonlinear 
term. For some values of the parameters a, b, and c, it can happen that only a periodic 
attractor is obtained. However, for substantial ranges of parameter values no such 
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periodic attractor exists and in these ranges all trajectories eventually resemble figure 2. 
Figure 2 represents the phase plane plot for the following parameter values: a — 0-398, 
b = 2, c = 4. Any two trajectories starting at nearby points on this attracting set will 
diverge exponentially, which is primary evidence of chaotic motion (Thompson & 
Stewart 1986). 

2.2 Ueda's ( Japanese ) attractor 

Even though Ueda obtained aperiodic solutions during an analog computer simula¬ 
tion of a forced self oscillatory system as early as 1961 he could publish his results only 
after he met Professor Hiromu Momota and Professor David Ruelle (Ueda 1985). It 
was generally believed that multiple equilibrium points are required for a system to 
exhibit chaos. But for the first time Ueda (1979) observed a strange chaotic behaviour 
in a single well potential problem with the origin as the only equilibrium point of centre 
type. Ueda (1979,1985) studied a nonlinear electrical circuit which is governed by the 
following differential equation 

x + cx + x 3 = / cos(t). (3) 

It has been found that for the parameter ranges / = 9-9 — 13-3 for c = 0T and 
c — 0-0 — 0-31 and for / = 12-0, the Duffing equation in the above form exhibits chaotic 
behaviour. A typical strange attractor is given in figure 3 and Ruelle calls this as the 
Japanese attractor. 

2.3 Double scroll attractor 

A simple autonomous electrical circuit known as Chua’f 
Matsumoto (1984) is given below, 

x = a(y — h(x)), y = x — y + z, z- 
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Figure 3. Ueda’s attractor. 
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where h(x) is the piecewise linear function given by 

Wjjc + fmo — m t ) x>l, 
h(x) = m 0 x |x| ^ 1, 

m i x — (m 0 — m i ) x< —1. 

The phase trajectories of this system of equations also exhibit chaotic behaviour and 
has been extensively analysed. For parameters a = 9 0, /?= 100/7, m 0 = — 1/7 and 
m l = 2/7, (4) show chaotic behaviour and the corresponding phase plane plots are 
shown in figure 4. The existence of chaos in double scroll attractor has been explained 
using the Shilnikov theorem (Shilnikov 1970). 



Figure 4. Double scroll attractor. 
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Figure 5. Henon attractor. 


2.4 Henon attractor 

Henon (1976) studied a two-dimensional quadratic map of the following form. 

X tt+1 = l-0 -aX 2 n + Y n , 

y n+i =bX n . (5) 

It possesses a saddle point at (0,0) and a fixed point at ( b — 2 ,b 2 — 2b). For 6 = 0-3 and 
‘a’ small this fixed point is a sink. As ‘a’ increases it undergoes a sequence of period 
doubling bifurcations of periods, 2,4,8,16,... Ultimately this sequence of bifurcation 
culminates in chaos. The mapping as given in (5) has been studied extensively and 
possesses a strange attractor for a range of values of a and b. For a = 1-4 and b — 0*3 this 
system of mapping equations exhibit chaos and the corresponding strange attractor is 
shown in figure 5. 

3. Structural systems and chaos 

3.1 Structural systems and nonlinearities 

Presence of nonlinearities is essential for a dynamical system to exhibit chaotic 
behaviour. Structural and mechanical systems also exhibit certain measure of non¬ 
linearity in their dynamic behaviour. In most cases, when the motions are small the 
system dynamics could be adequately represented by linear models. Modern structural 
design practice like optimum designs, fully stressed and limit state designs, post 
buckling behaviour, which utilizes material to its fullest load carrying capacity and the 
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use of composite materials, greater speeds of operation of machinery and lighter 
constructions, extreme environments like earthquakes, heavy storms, ocean wave 
turbulence, fluid flow etc. have led to many of these systems operating in the nonlinear 
range. In these situations, the geometric or material nonlinearities become significant 
and dominate system behaviour. In mechanical systems, these nonlinearities are 
generally manifested in the stiffness and damping terms and have to be taken into 
account in the dynamic analysis. In mechanical systems the nonlinearities are usually 
associated with the presence of clearance, impact, preload, dry friction, hysteresis, fluid 
flow, dissimilar compression and tension behaviour etc (Schmidt & Tondl 1986). 

In structural dynamical systems, there are a number of examples which exhibit 
chaotic behaviour. Some of them are (1) oscillations of beams with nonlinear restraints 
(Moon & Shaw 1983; Gudmundson 1989), (2) forced vibrations of a buckled beam 
(Holmes 1979; Moon 1980), (3) nonlinear flutter of plates under combined axial load 
and fluid flow (Dowell 1982), (4) parametrically forced pendulum (McLaughlin 1981; 
Leven et al 1985), (5) mechanisms with dry friction (Popp & Stelter 1989; Narayanan & 
Jayaraman 1991), (6) gear trains with clearance and impacts (Kunert & Pfeiffer 1989; 
Kahraman & Singh 1990b; Sato et al 1991), (7) heat exchanger tubes subjected to 
external flow (Pai'doussis & Li 1992), (8) fluid conveying pipes with large flow velocity 
and loose baffles (Pai'doussis & Moon 1988a), (9) rolling element bearings with 
clearance (Mevel & Guyader 1993), (10) vehicle systems (Kaas-Petersen & True 1985; 
Meijaard & De Pater 1989; Palkovics & Venhovens 1992), (11) large amplitude 
oscillations of arches, cables, strings (Johnson & Bajaj 1989; Szemplinska-Stupnicka 
et al 1989; Molteno & Tufillaro 1990; O’Reilly & Holmes 1992; Perkins 1992; Thomsen 
1992), (12) rotor dynamics (Ehrich 1992), (13) rolling motion and capsizing of ships 
(Nayfeh & Hydeir 1986; Virgin 1987; Nayfeh & Sanchez 1988; Thompson et al 1990; 
Kan & Taguchi 1993), (14) slider crank mechanism (Peurach & Tongue 1991), (15) time 
delay systems (Plaut & Hsieh 1987; Tsuda et al 1992), (16) helicopter blade dynamics 
(Flowers & Tongue 1992), (17) structural cracks with fractal geometry (Theocaris & 
Panagiotopoulos 1993), (18) machinery acoustics (Moon & Broschart 1991). 

3.2 Chaos in Duffing type nonlinearities 

Duffing (1918) introduced a nonlinear oscillator which contains a positive cubic term. 
This equation in different forms has been used to model the dynamics of many systems 
in various fields of science and engineering. 

3.2a Chaos in buckled beam ( two-well potential): The equation of motion of the 
buckled beam in its first mode under harmonic excitation as derived by Tseng & 
Dugundji (1971) results in a Duffing’s equation with negative linear stiffness and 
positive cubic stiffness and is of the form: 

x + cx — ax + fix 2, — f cos(cot), with a, j3>0. (6) 

The unperturbed form of the equation contains three equilibrium points,’ one at the 
origin of the phase space which is an unstable saddle point, and the other two 
symmetrically placed on either side of the origin at ( + (a/j3) 1/2 ,0) (- (a//J) 1/2 ,0) being 
centres. 

Ever since Holmes (1979) presented his results of the dynamics of the harmonically 
excited buckled beam and showed the existence of chaotic and strange attractor 
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behaviour for certain parameter values by analog and digital computer studies, the 
Duffing’s equation in this form has become one of the most celebrated models in the 
study of chaos as it represents a simple system bringing out most of the important 
features of chaotic behaviour. Chaos has been explained in these systems in terms of the 
solution jumping erratically from phase plane orbits about one equilibrium point to 
orbits about the other equilibrium point as the magnitude of the harmonic force is 
increased for a given frequency of excitation. Chaos in harmonically excited buckled 
beams has also been experimentally demonstrated by Moon & Holmes (1979) and 
Moon (1980) where magnetic forces were used to buckle a cantilever beam and the 
support was excited to introduce the external harmonic excitation. Moon & Li (1985a) 
observed fractal looking boundary which is correlated with the appearance of homo¬ 
clinic orbits in the Poincare map. Stable and unstable manifolds and the basin 
boundary for such a system is given in figure 6. Moon & Li (1985b) calculated the 
dimension of the strange attractor and observed that higher damping results in 
a smaller dimensional attractor. 

Dowell & Pezeshki (1986, 1988) have shown, by means of a shell plot, the extreme 
sensitivity of the system to initial conditions as the harmonic force was increased with 
the periodic motion about the left equilibrium point changing to a periodic motion 
about the right equilibrium point even for negligibly small differences in initial 
conditions with attended loss of predictability about the system. Similar explanation 
has also been provided by Tongue (1986). Tang & Dowell (1988a) have considered the 
effects of the higher modes on the chaotic motion of a buckled beam under external 
harmonic excitation and computed the threshold force required for chaotic motions. 
They compared their numerical results with experiments on a magnetically buckled 
beam similar to that of Moon (1980). Good qualitative agreement between numerical 
and experimental results were reported by them based on the inclusion of three modes 
in their analysis. Hanagud etal (1989) studied the motion of a simply supported 
buckled beam subjected to base excitation. Periodic and chaotic behaviour are 
obtained as the amplitude of base excitation is varied. A case with intermediate support 
with clearance is also investigated. 

The route to chaos in this case is through a cascade of period doubling bifurcations 
as the magnitude of the external force is varied for a given frequency. One important 
feature of this period doubling bifurcation is that the ratio of the difference between the 
critical values of the parameter at which successive period doubling bifurcations take 
place for a cycle to that of the next cycle approaches a universal limit equal to 4*669201. 
This universal behaviour was shown to be true by Feigenbaum (1978) in the context of 



Figure 6. Stable and unstable 
manifolds of the Poincare map 
superimposed on the basin 
boundary (Moon & Li 1985b). 
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chaos of iterative maps and the ratio is known as the Feigenbaum number, while the 
period doubling sequence is known as the Feigenbaum sequence. The period doubling 
sequence is one of the most observed pre-chaotic phenomenon. Feigenbaum has 
analysed the noninvertible one-dimensional map x n+J =F(x„) depending on a pa¬ 
rameter A. He has shown that the family F has an infinite sequence of period doubling 
bifurcations of periodic orbits as the parameter A is varied. When A becomes greater 
than or equal to a chaos occurs, where 

cr = lim a, = [A.- J/[A„ + , - AJ = 4-669201. (7) 

ft-* 1 00 

It should be pointed out that because a is independent of the exact nature of the 
function F , this universal constant can be found for physical systems governed by 
differential equations. This question was discussed by Tousi & Bajaj (1985) where 
weak nonlinear and harmonically forced, two-degree of freedom systems have been 
studied by both averaging and shooting methods (Aprille & Trick 1972). Awrejcewicz 
(1991) considered a similar equation in an attempt to identify the route to chaos in 
driven mechanical oscillators. Using the shooting method he showed four sequences 
of period doubling bifurcations and each new solution was stable over an interval 
smaller than the interval of the previous member of the sequence. At the end chaos 
was obtained. Further he addressed the compatibility of the results with Feigenbaum 
constant (<r). He also presented two sequences of period doubling bifurcations in 
which one converged to a while the other oscillated: ,cr„ = 8-076; 3-305; 18-154. 
This kind of oscillation in the Feigenbaum constant was also observed by Tousi & 
Bajaj (1985). 

Another physical system in which chaos occurs is in the case of flutter of a buckled 
plate (Holmes 1977; Holmes & Marsden 1978; Dowell 1982). Dowell considered the 
von Karman nonlinear plate equations under axial compressive force with fluid flow on 
one side and used Galerkin modal expansion to get a two mode representation of the 
equation of motion which very much resembled the two well potential oscillator. He 
showed how the motions in the two modes interacted in a subtle way to generate chaos 
essentially through a Duffing’s equation mechanism, the motion in one mode providing 
excitation to other. Brunsden et al (1989) have proposed a method for the prediction of 
the power spectra of chaotic motions by considering them as random superpositions of 
deterministic structures as single excursion around a homoclinic orbit. They showed 
good agreement between their predictions and the power spectra obtained by numeri¬ 
cal simulations of the buckled beam equation and experimental results. Bispectral 
analysis of a buckled beam subjected to two harmonic excitations is performed and the 
results indicate that varying the relative phase angle of the high frequency term can 
result in the system moving in and out of chaos (Pezeshki et al 1991). 

Criterion for the existence of chaos for the two-well potential problem such as the 
buckled beam has been established in the parameter space (frequency-amplitude of 
harmonic excitation) based on the Melnikov function (Guckenheimer & Holmes 1983; 
Moon 1980) which has come to be known as the Melnikov-Holmes criterion 
(Melnikov 1963). The criterion is based on the observation that the intersection of 
stable and unstable manifolds of the saddle point led to chaos. Holmes (1984) has 
shown that the intersection of stable and unstable manifolds leads to a horse-shoe map. 
The Melnikov function is a measure of the distance between stable and unstable 
manifolds in the phase space when the distance between them is small. Melnikov 
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function for (6) is given by (Holmes 1979), 



ca 3/2 ~] . uoj 

^(2/!jJ Slnh V 75 ’ 


( 8 ) 


and when / exceeds f M the sensitivity to initial conditions increases whereby the system 
exhibits chaotic behaviour. 

Dowell & Pezeshki (1986,1988) based on physical arguments and numerical studies 
have concluded that a necessary condition for chaos to occur in such double well 
potential problems is the intersection of stable limit cycle about the left or right 
equilibrium point and the unstable limit cycle of the saddle point. This is illustrated 
in figure 7. The forcing amplitude corresponding to the intersection of stable and 
unstable limit cycle and the Melnikov’s value do not differ much. This observation by 
Dowell has been discussed critically by Szemplinska-Stupnicka (1992). Dowell (1991) 
made an attempt to give theoretical reasons for results obtained by the previous 
computer studies. Dowell & Pezeshki (1988) and Dowell (1991) state that “in order 
to have steady state chaos one must have near stable and unstable limit cycles in the 
phase space...” This becomes both necessary and sufficient condition for chaos. The 
term “steady state chaos” used here means large amplitude cross well motion which can 
be viewed as irregular hoppings between oscillations around two stable equilibria. 




«b 


Figure 7. Stable and unstable 
limit cycles (Dowell & Pezeshki 
1988). 
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Szemplinska-Stupnicka (1992) argues that for the problem of “necessary and sufficient 
conditions for steady state chaos” to occur, “criteria of chaos” still remain a point of 
contention in current investigations by pointing out the two contradictory assump¬ 
tions considered by Dowell (1991). Rudowski & Szemplinska-Stupnicka (1987), Szem¬ 
plinska-Stupnicka (1988a, 1989), Szemplinska-Stupnicka & Rudowski (1992) used 
approximate analytical method (harmonic balance method) and obtained analytical 
expressions for various bifurcation curves and the regions of chaos in the two 
dimensional plane consisting of the excitation force and frequency. Figure 8 illustrates 
the stable and unstable manifolds before, after and during intersection and also the 
corresponding response curves. Period doubling and saddle node bifurcation bound¬ 
aries and the regions of chaos are shown in figure 9. 

Higuchi & Dowell (1991) studied the effect of a mean load f 0 in addition to the 
harmonic excitation. Awrejcewicz (1991) studied the effect of constant term in the 
external excitation using shooting method. For c = 0-25, / = 0-4, a = 1-0 and co = 10 
he obtained the 1/2 subharmonic solutions for the range of mean load 
f 0 (0-4657 > f Q > 0-0918). For f 0 = 0*12 the system exhibited chaos in which the strange 
attractor formed two line attractors which are well separated. As f 0 is decreased from 
0-0918 and when it reaches 0-04657 one of the eigenvalues of the corresponding 
monodromy matrix penetrates the + 1 boundary of the unit circle while the other 
remains inside. Shooting method fails for further changes in parameter value. For these 



Figure 8. Invariant and response curves (Holmes 1979; Szemplinska-Stupnicka & 
Rudowski 1992). 
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Figure 9. Bifurcation bound¬ 
aries and regions of chaos 
(Szemplinska-Stupnicka 
& Rudowski 1992). 


parameter combinations the oscillator possesses a strange attractor which is the 
closure of the unstable manifolds. This possibility of the violent birth of a strange 
t'haotic attractor has been shown by Neimark (Butenin et al 1976). Gottwald et al 
{l 992) conducted an experiment with a slider kept on a structure with the surface of two 
well potential excited by a harmonic force. Using this experimental mimicry, they 
constructed strange attractors for various parameter values. Baren (1994) studied 
Bernoulli-Euler, Rayleigh and Timoshenko models of a buckled beam using Mel¬ 
nikov’s method. 

(1987) and Ide & Wiggins (1989) investigated quasiperiodically forced 
buckled beam equation. Using Melnikov analysis the homoclinic bifurcation sets in 
a five parameter space (c,/ l5 / 2 , oq and co 2 ) were obtained. By proper manipulation of 
the parameters the bifurcations sets were indicated in a two dimensional plane. It was 
a Iso indicated how homoclinic bifurcation sets for an arbitrary number of frequencies 
may he obtained. Parthasarathy (1992) studied the same system with external and 
parametric excitations and obtained the bifurcation sets. Yagasaki (1990, 1991) and 
Vagasaki et al (1990) studied the motion of a buckled beam subjected to external and 
parametric type quasiperiodic excitations by applying averaging method as given by 
Holmes (1980). The existence of homoclinic orbits was proved and the homoclinic 
bifurcation curves were presented. 

Iyengar (1992) investigated a buckled beam equation with random initial conditions. 
1 n the regimes wherein the solution is periodic, the solutions are insensitive to at least 
small changes in initial conditions. The variance or second central moment of the 
ensemble response is derived with the initial conditions of the state variables which are 
specified as random variables with small variances. In the limit as the variances of the 
i initial conditions go to zero, if the response variances also tend to zero with time, the 
solution of the original dynamical system is nonchaotic. Otherwise the solution will be 
clnaotic. Kunert & Pfeiffer (1991) adequately described the strange attractor of the 
buckled beam equation in terms of the probability distribution of the state space 
variables. To approximate these probability distributions an additional stochastic 
excitation of small intensity was taken into account. This not only reduced the 
problems concerning numerical convergence of the solution but was also more realistic 
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because a finite level of noise would be present anyway. Due to the additional noise 
excitation the calculation of probability density could be achieved directly by the 
solution of Fokker-Planck equation, if the excitation was white noise. 

3.2b Hardening spring {single-well potential ): It is very likely that chaotic motions 
occur in systems with multiple well potential as compared to single well potential. Since 
the year 1979 when distinctly new types of steady state solutions called “randomly 
transitional phenomena”, or chaotic motions were reported by Ueda (1979), it has 
become clear that our knowledge about properties and behaviour of the class of system 
is far from being complete. While the chaotic motion in systems having three positions 
of equilibrium, studied extensively by theoretical and experimental methods can be 
intuitively explained by physical arguments, the arguments fail in systems having single 
well potential (Szemplinska-Stupnicka 1986). The equation of motion of a single well 
potential system is given as 

x + cx + ax + /be 3 = / cos (cot), with a, fi > 0. (9) 

Ueda (1979, 1980) considered the oscillations of a resonance circuit containing a 
saturable inductor. For parameters c = 0T, a = 0-0, fi = 1-0, / = 12-0 and co = 1-0, (9) 
exhibits chaos. Szemplinska-Stupnicka (1987) also showed chaotic behaviour in a simi¬ 
lar system and offered an explanation in terms of the instability of period three motion 
with respect to an aperiodic motion providing a sharp route to chaos. She also 
considered the effect of the static component of excitation. The period doubling 
sequence was predicted reasonably well by approximate harmonic balance analysis 
and considering the stability of period 1 motion with respect to the 1/2 subharmonic 
motion by means of Floquet’s theory. Novak & Frehlich (1982) studied hardening 
Duffing system by harmonic balance and stability analysis and obtained period 
doubling route to chaos in certain range of forcing parameter. Sato et al (1983) 
observed global scaling properties for the bifurcation in parameter space in Duffing 
and generalised Duffing equations using numerical simulations. The scaling properties 
were discussed in terms of a one-dimensional map. Similar observations in the 
parameter space can possibly be observed for double well potential problem in the 
parametric space diagrams given by Szemplinska-Stupnicka & Rudowski (1992). 
Tongue (1986) has also demonstrated the presence of chaos in the system of Ueda’s 
type. He has offered an explanation that for large values of exciting force the system 
behaved in a manner similar to a multi-well potential oscillator. The importance of step 
length for numerical integration was also highlighted. Fang & Dowell (1987) by 
extensive numerical integration obtained multiple steady state periodic solutions like 
period 4 and period 6 and dual periodic responses for certain range of parameters. The 
chaotic motion also possessed dual solution. It has been suggested by them that the 
transition to chaos is through a sequence of dual periodic responses, period doubling 
sequence, almost periodic motion with a large period. 

3.2c Softening spring (inverted double-well potential ): The equation of motion of an 
oscillator with softening type (symmetric) of nonlinearity is given as below: 

x + cx +ax — fix* - f cos (cot), with a, /? > 0. (10) 

Huberman & Crutchfield (1979) investigated such a system and found that for 
parameters c = 0-4, a = 1-0, fi = 4-0, / = 0T15 and for a small range of co in the region 
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0-525 <co< 0-53 the system exhibited chaos. The problem was also studied by other 
authors notably by Steeb eta/ (1983), Turcher (1982), Raty et al (1984a, 1984b),. 
Szemplinska-Stupnicka (1988b) and Van Dooren (1988). It has been observed that 
chaos occurs at frequencies in the region of the principal resonance and the route to 
chaos is initially through a symmetry breaking bifurcation for a period 1 motion in the 
neighbourhood of co = 0-53 leading to two asymmetric period 1 motions being the dual 
of each other followed by a sequence of period doubling bifurcations of the dual 
periodic motions as the forcing frequency co is decreased. Szemplinska-Stupnicka 
(1988b) has applied approximate analytical techniques (harmonic balance method) to 
predict the symmetry breaking bifurcation and the subsequent period doubling 
bifurcations by considering the stability of periodic motions with respect to the other 
periodic motions by using Floquet theory. Similar analysis has been carried out by Van 
Dooren (1988) also. Nayfeh & Sanchez (1989) analysed a similar system using multiple 
time scales, analog and numerical simulations in a two parameter space consisting of 
the frequency and amplitude of excitation. The basin boundary metamorphoses were 
studied using cell mapping method. Virgin & Cartee (1991) used multiple time scale and 
harmonic balance methods to obtain response curves and escape boundaries. 

3.3 Chaos in pendulum (sinusoidal potential well) 

The forced pendulum is a typical nonlinear system having sinusoidal potential well. 
The system with linear damping subjected to harmonic excitation is represented 
by the following equation, 

x + cx + sin(x) = /sin(cof). (11) 

The forced pendulum is isomorphic to many other familiar nonlinear systems, such as 
Josephson junctions and the phase locked loop configuration of a voltage controlled 
oscillator. It is essential to note that at the outset the seemingly simple situation of 
a forced pendulum is quite complex due to the fact that the space of variables is large. 
The interplay between the driving force and the natural modes of the pendulum causes 
chaos. Huberman et al (1980) obtained the response curves for a range of normalised 
frequency and also presented the bifurcation diagram with left side boundary indica¬ 
ting period doubling phenomena and right side boundary obtained from fold bifurca¬ 
tion values. This particular form of the bifurcation diagram is typical of a softening 
system. D’Humieres et al (1982) summarise the important results of their study as 
follows: (a) Prior to going chaotic the pendulum is found to break its spatial symmetry 
and oscillate with a larger amplitude to one side than the other, (b) The period doubling 
cascade is found to be a generic phenomenon for the forced pendulum. It occurs in the 
oscillating states of the pendulum and also in the rotating regime where the pendulum 
rotation frequency is phaselocked to the driving frequency, (c) Another chaotic state 
commonly observed appears to be related to random transitions between two phase- 
locked states that have become unstable, (d) A third kind of chaotic behaviour arises 
when the driving frequency is much smaller than the low amplitude resonance 
frequency of the pendulum. If the amplitude of the driving force becomes larger than 
the critical value at which the pendulum begins to rotate, the motion is a combination 
of positive and negative rotations in between which the pendulum undergoes damped 
oscillations. A two-dimensional state diagram with driving amplitude and normalized 
frequency is presented and various types of phaselocked regions and intermittent 
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regions are indicated. The phaselocked regions and the frequency response curves are 
shown in figure 10. Pederson & Davidson (1981) have also given similar qualitative 
picture by using numerical simulations. Although period doubling is found to be fairly 
common in the phaselocked states of the pendulum, it does not always manifest itself in 
complete bifurcation cascades. Intermittent behaviour between two unstable 
phaselocked states is also commonly observed. Koch & Leven (1985) investigated 
a parametrically forced pendulum and obtained the boundaries of subharmonic and 
homoclinic bifurcations on the basis of Melnikov method and second order averaging 
methods. It is shown that as the parameter is varied, repeated resonances of successively 
higher periods occur culminating in homoclinic orbits. For some selected parameter sets, 
the theoretical predictions are tested by numerical calculations and good agreement is 
found between them. Leven et al (1985) conducted experiments on parametrically 
harmonically excited pendulums to study the periodic and chaotic type motions. 
Period doubling route to chaos is identified with increasing driving amplitude and 
decreasing damping force. The coexistence of different periodic solutions as well as 
periodic and chaotic solutions is demonstrated and various transitions between them 
are studied. Transient chaos is also observed in the system. The transition from 
metastable chaos to sustained chaotic behaviour is also investigated. Hatwal et al 
(1983) obtained periodic and chaotic motions of a pendulum hinged to a mass of 
a harmonically excited nonlinear single degree of freedom (sdf) system. This system is 
taken as an example of an autoparametric system. They used numerical, analytical and 
experimental methods for this study. Miles (1988) investigated the motion of the 
pendulum with a single harmonic term in the solution and obtained the frequency 
response curves for various values of the amplitude of excitation. The symmetry 
breaking bifurcation boundaries are also presented in the parametric plane consisting 
of the damping and the amplitude of excitation. The results are compared with those 
obtained by Bryant who expands the solution in a truncated Fourier series which is 
given in one of the appendices of the same paper. 

3.4 Chaos in Thompson's escape equation 

Thompson (1989) constructed an asymmetric potential well and considered the 
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Figure 10. Phase locked regions and response curves (D’Humieres et al 1982). 
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following escape equation 

x H- cx + x — x 2 =f sin cot. (12) 

This model has been used to explain the capsizing of ships, offshore vessels, etc. By 
extensive numerical methods, Thompson (1989) constructed flip and fold bifurcation 
boundaries. Numerical path following techniques have been used to trace the bifurca¬ 
tion curves. For this purpose, PATH package (Kass-Petersen 1987a) and AUTO 
software (Deodel 1986) have been used. The results have been presented in three- 
dimensional figures consisting of frequency and amplitude of excitation and the 
response. Regions of chaos boundary escape are also given. The frequency response 
curves are given in figure 11. The boundaries of bifurcations, regions of chaos and the 
regions of escape are given in figure 12. Melnikov analysis is performed and the 
bifurcation line based on Melnikov analysis is also marked along with numerical 
results. The route to chaos is through period doubling and also by saddle node 
bifurcations. The system undergoes chaotic oscillations before it gets attracted to 




Figure 11. Frequency re¬ 
sponse curves (Thompson & 
Soliman 1991). 
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Figure 12. Bifurcations and 
escape boundaries and regions 
of chaos (Thompson 1989). 


infinity. Initial condition maps are constructed for various parameter combinations. 
The loss of engineering integrity and basin erosion are discussed. It is noted that as the 
excitation force increases and reaches the Melnikov values, the probability that the 
system gets attracted to infinity increases steadily. Virgin (1988) used harmonic balance 
method in conjunction with stability analysis to obtain approximate boundaries of flip 
and fold bifurcations. This escape equation has been thoroughly studied by Thompson 
and his co-workers (Thompson et al 1987; Soliman & Thompson 1989; Thompson & 
Soliman 1990: Lansbury & Thompson 1990; Soliman 1993). Very recently a new 
phenomenon of indeterminate bifurcation was identified in this system and it attracted 
a lot of attention (Thompson & Soliman 1991; Soliman 1993). 

Soliman & Thompson (1992a, 1992b) studied parametrically excited Thompson’s 
equation and showed the possibility of indeterminate subcritical and transcritical 
bifurcations leading to chaos. Szemplinska-Stupnicka(1993) has analysed Thompson’s 
escape equation using harmonic balance method and the results compare very well 
with those obtained by Thompson and his co-workers. 

3.5 Chaos in Van der Pol type nonlinearities 

The Van der Pol oscillator is another celebrated nonlinear system like the Duffing’s 
oscillator. The equation of motion of the harmonically excited Van der Pol’s oscillator 
is given in the form 

x +p{x 2 —l)x + x = f sincot. (13) 

Without the external excitation it represents a self-excited autonomous system having 
a limit cycle and has been used to model many oscillatory phenomena in which the 
damping is a function of displacement such as vortex induced vibration of bluff bodies 
(Harden & Currie 1970; Iwan & Blevins 1974) and also with dry friction characteristics 
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(Tondl 1985). Guckenheimer & Holmes (1983) have indicated the possibility of mild 
chaotic behaviour of this system for small ft in a narrow range of control parameters 
/ and to. For large values of ft, Cartwright & Littlewood (1945), Levinson (1949), and 
Hayashi (1964), showed the possibility of steady state subharmonic responses with two 
different multiples of exciting frequencies to. Levinson concluded that the transient 
response could become very complicated before the motion settled to one of the steady 
state periodic motions. Such behaviour in the parlance of chaotic dynamics is called 
transient chaos. But it has been observed that the forced Van der Pol’s oscillator does 
not exhibit steady state chaos. 

A combination of the Van der Pol-Duffing oscillator is also extensively studied in 
the literature and has been shown to exhibit chaotic behaviour. Ueda & Akamatsu 
(1981) have shown that for ji = 0-2/ = 17-0 and to = 4-0 the system possesses strange 
attractor. Steeb & Kunick (1987) have considered the general form of nonlinear 
stiffness with x" term and showed that for n = 1 the system does not possess strange 
attractor and for n = 3,5, etc. the system exhibits chaotic behaviour. Chaos was 
confirmed by the positive value of Lyapunov exponent and decaying auto correlation 
function. Steeb et al (1983) observed chaos in a self excited system. Steeb et al (1986) 
have also studied a limit cycle system with two forcing frequencies which are incom¬ 
mensurate and confirmed the presence of chaos by computing the Lyapunov expo¬ 
nents. Awrejcewicz (1989) and Awrejcewicz & Mrozowski (1989) have also considered 
a similar problem describing a mechanical oscillator with the excitation term propor¬ 
tional to square of the frequency of excitation. They also studied the effect of dry friction 
through bifurcation and stability analysis and showed that chaotic behaviour for 
certain parameter ranges in the neighbourhood of Hopf bifurcation. Holmes & Rand 
(1980) considered an autonomous Van der Pol-Duffing oscillator and obtained 
bifurcation curves and had shown the type of phase plane trajectories in all the regions 
of two dimensional parameter plot. Xu et al (1985) obtained the initial condition maps 
for two degree of freedom coupled autonomous oscillator by using simple cell mapping 
method (Hsu 1987). Kapitaniak & Wojewoda (1990) and Kapitaniak etal (1990) 
investigated a Van der Pol oscillator for a quasiperiodic force represented by 
/ cos(cot)cos(filt). Fora fixed value of SI = 2*464, various types of attractors are found as 
co is varied. For to — 1*848 the Poincare map shows a periodic attractor, for to — 1*614 
a two frequency quasiperiodic attractor, for co = 1*401 a strange non-chaotic attractor 
and for co = 1*395 a chaotic strange attractor appears in the Poincare map. The strange 
non-chaotic attractors can be quantified on the basis of the frequency spectrum which 
usually consists of many more harmonic components. These are also characterised by 
determining the maximum Lyapunov exponent. The word strange refers to the 
geometrical structure of the attractor. An attractor which is not 

- a finite set of points 

- a limit cycle (a closed curve) 

- a smooth (piecewise smooth) surface (for example, torus) 

- bounded by a piecewise smooth closed surface volume 

is called a strange attractor. An attractor is chaotic if at least one of the Lyapunov 
exponents of the system is positive. Strange non-chaotic attractor is one which is geome¬ 
trically strange, but for which typical orbits have non-positive Lyapunov exponents. 

Natsiavas (1991) studied a single degree of freedom oscillator with symmetric 
piecewise linear stiffness and piecewise damping of Van der Pol type (negative damping 
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around mean position) and observed periodic, quasiperiodic and chaotic solutions. 
Jump and beating phenomena and coexistence of subharmonic and quasiperiodic 
solutions are observed. Qin et al (1989) used analog simulation to study the driven Van 
der Pol oscillator and obtained quasiperiodic, Farey sequence, period doubling 
bifurcations and period adding sequence in chaos area. Brindley et al{ 1991) proposed 
an analytical method for predicting the boundaries of region in parameter space within 
which a strange non-chaotic attractor exists for quasiperiodically forced Van der Pol 
equation. The boundaries are obtained using conditions for Hopf bifurcation and the 
existence of quasiperiodic solutions. Belogortsev (1992) derived an analytical express¬ 
ion for the double Poincare mapping for a weakly nonlinear quasiperiodic Van der Pol 
oscillator under two periodic forcing and obtained some analytical results about the 
quasiperiodic response of the oscillator. It has been shown that this mapping plays the 
same significant role as the averaged equations in the theory of the periodically forced 
Van der Pol oscillator. The bifurcation portrait in a two-dimensional parameter plane 
consisting of amplitude of exciting force / and the detuning parameter is shown. The 
regions of chaos are also indicated. 

3.6 Chaos in dry friction-damped systems 

Vibrating systems with Coulomb or dry friction damping can be considered as 
a piecewise linear system with discontinuous damping. Shaw (1986) considered the 
dynamics of a linear single degree of freedom vibrating system with dry friction 
damping. As the system is linear in each regime, Poincare return maps can be effec¬ 
tively used to perform the bifurcation analysis. Popp & Stelter (1989) have analytically 
and experimentally showed chaotic behaviour of a single and a two degree freedom 
system under harmonic excitation with dry friction damping. Intermittency-I and 
period doubling bifurcations are observed. Awrejcewicz & Mrozowski (1989) have also 
considered chaotic motions of simple mechanical system with dry friction terms. 
However, in their analysis the basis system was of Duffing type. Grabec (1986) 
modelled friction in cutting tools and found self excited chaos in a four dimensional 
phase space. Feeny & Moon (1989) applied autocorrelation function based on 
symbolic equations to a chaotic dry friction oscillator to estimate the largest Lyapunov 
exponent. The friction problem is well suited for symbolic dynamics since two states 
of motion can be identified: sticking and slipping. This study is done for an experi¬ 
mental oscillator and for a numerical model. 

Narayanan & Jayaraman (1989a, 1991) and Jayaraman (1991) investigated the 
response of a single degree of freedom spring mass system characterised by Duffing type 
stiffness and kept in moving belt. The dynamics of the system was governed by the 
following differential equation 

x-gf(x--v) + gf(x — v) 3 + giJ.sgn{x — v) + y 1 x + y 3 x 3 = f cos cot. (14) 

This system is analysed for the following system parameter a - 0-05, (3 = 0-02, y t = 0-0, 
y 3 = 10000, /i = 0-6, and v = 1-0. The frequency of excitation co is varied from 2 to 30. In 
this range of frequencies many windows of chaos are obtained through extensive 
numerical simulations. The route to chaos is through a cascade of period doubling. The 
approximate periodic solutions are obtained using harmonic balancing and the 
frequency response plots are obtained. The stability of periodic solutions with respect 
to the subharmonics is investigated by using Floquet theory. The presence of chaos is 
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confirmed by the positive Lyapunov exponent. Initial condition maps are obtained 
through interpolated cell mapping approach as given by Tongue & Gu (1988a). 

3.7 Chaos in hysteretically damped systems 

It is well known that for the most important structural materials, the mechanical 
behaviour under cyclic loading at high amplitudes is inelastic. The restoring force 
versus displacement diagram in general shows hysteretic loops with decreasing slopes 
representing energy dissipation and material damage (Sozen 1974; Wen 1989). This 
behaviour can be adequately represented by Bouc’s model (Barber & Wen 1981). Yang 
et al (1991) demonstrated the possibility of chaotic vibrations in hysteretically damped 
systems using numerical methods. Chaos is confirmed by the presence of positive 
Lyapunov exponent. Phase plane plots and Poincare maps are also constructed. 
Probability density function and power spectral density plots are also presented. Sato 
et al (1989) also used point mapping method to study a similar system and observed 
jump phenomenon and chaotic behaviour. 

3.8 Chaos in maps 

In addition to differential flow a large body of literature is available for discrete maps 
also. The one-dimensional quadratic map [x i+ , -»ooc,— xf] also exhibits a variety of 
bifurcation phenomena encountered in nonlinear systems. Period doubling as well as 
crises type of bifurcation are pointed out in this map (Grebogi et al 1982, 1983). For 
a < — 0-25 no fixed point of the map exists. At a = — 0-25 a tangent bifurcation occurs 
at which a stable and an unstable fixed point are created. It is well known that as a is 
increased past —0-25, the stable fixed point undergoes period doubling which culmi¬ 
nates in chaos at a = 1-40095. 


4. Chaos in piecewise linear/impacting systems 

Chaotic vibrations have been obtained in many mechanical systems having piecewise 
linear characteristics such as in impact oscillator problems and systems with clearances 
and backlashes. Holmes (1982) has studied the vertical motion of a ball bouncing on 
a massively sinusoidally vibrating table. The impact motion is analysed using a simple 
difference equation satisfying a simple impact rule involving the coefficient of restitu¬ 
tion. Flip and fold bifurcations have been found to occur and the values of bifurcation 
parameters have been obtained. For large velocities and coefficient of restitution (r) 
close to unity it has been established that chaotic motions occur. The stable and 
unstable manifolds have been generated and it has been observed th~* * Ur "’ 
prior to the occurrence of chaotic motion. The strange attractor is foi 
on the unstable manifolds of the saddle point. The horseshoes for 1 
curves are also discussed. The tangling of stable and unstable manu 
presence of horseshoe map is given in figure 13. Tuf 11 — Sr A1K -'“ 
experiments on the motion of a bouncing ball and 
(1983a) Have studied the dynamics of a bilinear o 
using modern dynamical systems theory (as the < 
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Figure 13. Stable and unstable manifolds (Holmes 1982). 


maps are used). The flip bifurcation curves are drawn for periodic orbits of the impact 
oscillator. The flip bifurcation curves have been drawn using centre manifold calcula¬ 
tions. Shaw & Holmes (1983b) analysed a linear oscillator subjected to a harmonic 
excitation hitting against a rigid stop with very low or zero coefficient of restitution (r). 
As the value of r is zero, the mass returns with zero velocity after hitting the stop. This 
facilitates study of the dynamics of the system using a one dimensional map. The 
physical model with zero coefficient of restitution and the frequency response curves for 
single impact orbits are shown in figure 14. Period doubling bifurcation diagrams for 
periods 1, 2, 3, 4 and 5 are given with frequency of excitation a> as the bifurcation 
parameter. However, finite dissipation cases are also treated by considering 2D map as 
a singular perturbation of the ID map. 
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Figure 14. The physical model and response curves of an impact oscillator with 
large dissipation (Shaw & Holmes 1983b). 
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Thompson et al (1983) analysed a bilinear oscillator representing the motion of the 
offshore mooring tower and found that the system undergoes a series of flip bifurca¬ 
tions before attaining chaos. Using Poincare return maps Shaw (1985) analysed the 
periodic motions of a linear sdf oscillator with two symmetrically placed rigid stops 
about the equilibrium position under harmonic excitation. Frequency response curves 
for various amplitude of excitation (/? = 1,2,3), coefficient of restitution - ^ = 0-5,0-7,0-9) 
for various periodic orbits (n = 1,3,5) in the range of exciting frequencies (co = 0-10) 
have been presented. The frequency response curves exhibit a hardening type non¬ 
linearity for all orbits studied. For periodic orbits with n = 1,3 and 5, saddle node 
bifurcation curves have been drawn in fi-co plane. Using digital simulation homoclinic 
tangling of invariant curves is observed and chaotic motions are encountered. The 
chaotic motions were confirmed by the presence of a strange attractor and sensitivity to 
the initial conditions. 

Kotera & Yamanashi (1986) investigated the influence of coefficient of restitution 
and the coefficient of damping on the chaotic motion of an impact oscillator. Chow & 
Shaw (1986) and Shaw & Rand (1989) analysed a second order linear differentia] 
equation with negative linear stiffness representing the small motion of inverted 
pendulum impacting against rigid stops on both sides. Using Poincare return maps 
single sided and symmetric double sided impact motions were studied. The saddle node 
and flip bifurcation curves have been given in parameter planes /?-a and /?—co. For both 
types of motion Melnikov calculations have been performed to get the parameter 
ranges above which one can expect intersection of invariant curves. Shaw & Shaw 
(1989) performed a similar study on the motion of the above system mounted on 
a linear sdf system. Bifurcation diagram and the explicit expressions for invariant 
curves are obtained. Moore & Shaw (1990) conducted experiments on normal and 
inverted pendulums and compared the results with those obtained analytically. It was 
observed that the inverted pendulum exhibits ten different steady state responses for 
the same driving amplitude and frequency. Isomaki et al (1988) have obtained domains 
of attraction of various coexisting solutions of an impact oscillator. Mahfouz & Bad- 
rakhan (1990) used numerical methods to investigate the different piecewise models 
with setup springs and clearances. Bifurcation diagrams and the various regions of 
steady state responses including chaos are presented. 

Moon & Li (1990) conducted experiments on space truss structure and observed 
chaotic behaviour. In connection with the study of space truss structures Li et al (1990), 
modelled two truss members with smooth pin and play as a zero stiffness oscillator and 
analysed the system by employing a simple impact rule. For small harmonic forcing 
amplitude the symmetric and asymmetric periodic solutions are obtained using 
Poincare return maps. Saddle node and pitch fork bifurcations are found for both types 
of solution, while flip bifurcations are found for the asymmetric periodic solutions. 
Bifurcations diagrams with driving amplitude as the bifurcation parameter are given. 
Using digital simulation strange attractors are obtained. It is quite surprising that this 
work was motivated by the question of whether the source of chaos was the large 
number of elements in the experimental truss or whether chaos would result from 
forcing a small structure which was held together by pin joints. Heiman et al (1987, 
1988) studied the periodic and chaotic motions of a secondary mass moving in an 
inclined slot in a harmonically moving primary mass using the Poincare return 
mapping. Stability boundaries for various types of motions are obtained for r = 0T, 0-5 
and 0-9. Period doubling bifurcations are studied using the return maps. Whiston 
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(1987) analysed the steady state vibroimpacting responses of harmonically excited 
linear oscillators by using a modern dynamical systems approach allied with numerical 
simulation. The homoclinic tangling of invariant curves have been presented with very 
high resolution. The results are discussed in an engineering context. Whiston (1992) 
discussed the non-differentiable nature of the vibro impact dynamics which can lead to 
breakdown of the global stable manifold theorem applicable to smooth dynamical 
systems. An extensive study is made on the various aspects of the stable and unstable 
manifolds as they are very important to understand the structure of the strange 
attractor and the creation of horseshoe maps which leads to chaos. Chaos is also 
observed in impact oscillators designed for the purpose of vibration control (Sharif- 
Bakhtiar & Shaw 1988; Shaw & Tung 1988; Karyeaclis & Caughey 1989). Natsiavas 
& Babcock (1988) modelled the motion of an unanchored fluid tank as a piecewise 
linear oscillator and studied the same under harmonic excitation. Chaotic responses 
are obtained by numerical simulation for a range of parameter values. 

Sato et al (1991) investigated the motion of gear pair considering it as a piecewise 
linear oscillator with time dependent tooth stiffness and backlash using point mapping 
method. Using a stability analysis the saddle node bifurcation and flip bifurcation sets 
are obtained for a range of frequency and forcing amplitude parameters. Kahraman 
& Singh (1990a, 1990b) studied the impact dynamics of a gear pair considering the 
system as a piecewise linear one using digital simulation and concluded that for certain 
combinations of parameters the system exhibits chaos. The dynamics of single degree of 
freedom system with clearance and continuous nonlinearities were studied by Kahra¬ 
man & Singh (1992). Kozol & Brach (1991) analysed the vertical motion of a ball 
impacting against a rocking table and found that for certain frequency range the ball 
motion becomes chaotic. Grazing incidence of an impact oscillator can also cause 
chaotic motion (Nordmark 1991; Ivanov 1993). Peterka & Vacik (1992) have studied 
a simple impact oscillator and presented the regions in parameter plane in which 
various types of periodic motions occur. The transition from periodic to chaotic 
motion in an impact oscillator is discussed. Raghothama (1993) investigated the 
dynamics of spur gear pair with clearance type of nonlinearity by using both numerical 
numeric-analytical methods and obtained various types of bifurcations which lead to 
chaos. Periodic solutions are obtained by incremental harmonic balancing method. 
However, chaotic and quasiperiodic solutions are obtained by numerical simulations. 
A three-degree of freedom model representing a gear-bearing system is also considered. 
A cam mechanism with clearance induced piecewise nonlinearity is also investigated 
and the possibility of chaotic behaviour is established. Vaishali (1994) investigated 
roller bearing system with clearance and Hertzian type contact nonlinearity and 
constructed parametric space diagram indicating the regions of various types of steady 
state response. Period doubling and quasiperiodic route to chaos are identified using 
numerical simulation. 


5. Flow-induced vibration and chaos 

5.1 External flow 

In flow-induced vibration problems such as aeroelastic flutter, vortex induced and 
galloping oscillations, vibrations due to fluid elastic instability, the aero/fluid dynamic 
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forces including the vibrations are almost invariably non-conservative and nonlinear 
functions of structural motion leading to a complex and wide variety of dynamical 
characteristics of coupled fluid-structure interaction problems. In addition to the 
nonlinear nature of fluid forces, if the structure is inherently nonlinear and also excited 
by external forces, the dynamics becomes still more complex and it is natural to expect 
chaotic behaviour in such systems. Holmes & Marsden (1978) and Dowell (1982) have 
shown the existence of chaos in the flutter of nonlinear plates subjected to in-plane 
compressive loads and fluid flow on one side. Holmes (1977) and Holmes & Marsden 
(1978) have also considered in great detail and by use of bifurcation theory and centre 
manifold theorem, the bifurcation behaviour and stability of motions corresponding to 
transition to divergence and flutter of nonlinear plates. Huang et al (1990) have studied 
the dynamic response of a two degree freedom model of offshore structures in a steady 
current and regular wave environment. The hydrodynamic forces on the structure are 
modelled by modified Morison equations and an additional nonlinear symmetric 
hardening spring is included in the model. Period doubling route to chaos is identified 
in this system. Liaw (1988) has observed a subharmonic route to chaos in the dynamical 
behaviour of articulated towers subjected to hydrodynamic forces. The sensitivity to 
the initial conditions is also discussed. 

Chaotic motions of an autonomous two dimensional aerofoil with cubic pitching 
stiffness in incompressible flow are investigated by Zhao & Yang (1990). Phase plane 
diagrams are constructed confirming chaotic behaviour and a period doubling route is 
discerned. A two step approximate analysis by coupling the limit cycle amplitude of the 
flutter motion for one of the equations and treating it as an excitation to the other 
oscillator, the route to chaos for the aerofoil system is presented. Simiu & Cook (1991) 
have considered the chaotic motions of the harmonically forced single degree of freedom 
square prism in a fluid flow and two degree of freedom autonomous elastically coupled 
two square prisms in fluid flow. They have shown chaotic motions in a range of system 
parameters through a quasiperiodic route. Simiu & Cook (1992) presented a number of 
empirical fluid-elastic models to study the dynamical behaviour of bluff bodies. 

Narayanan & Jayaraman (1993) and Jayaraman & Narayanan (1990c) considered 
a nonlinear oscillator with Duffing type of nonlinearity of the hardening type subjected 
to harmonic excitation and fluid forces. The physical model and the corresponding 
piecewise linear stiffness function are shown in figures 15 and 16. The flow-induced 
force is described by a polynomial of relative velocity between the structure and fluid 
and given by Blevins (1977). By numerical simulations the various periodic and chaotic 
solutions are obtained. Using approximate analysis (harmonic balancing in conjunction 
with Floquet analysis) the period 1 and period 2 solutions are obtained (Narayanan 



Figure 15. A square prism in 
cross flow. 
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& Jayaraman 1993). Period doubling route to chaos is identified. Chaos is confirmed by 
the presence of positive Lyapunov exponent. Initial condition maps for periodic and 
their dual periodic orbits are obtained by interpolated cell mapping (Jayaraman 
& Narayanan 1990a). Sekar & Narayanan (1994) used the Fast Galerkin method to 
obtain some of the initial periodic orbits. Fast Galerkin method in conj unction with the 
stability analysis is used to investigate the same problem and the various periodic orbits 
and their bifurcation points are predicted very accurately. Narayanan & Sekar (1994) 
investigated a harmonically excited sdf oscillator subjected to fluid flow using the fast 
Galerkin and numerical methods. A bilinear stiffness function is assumed (figure 16). 
Period doubling, mode locking phenomenon and crises are found to occur in this 
system. Bifurcation boundaries and bifurcation diagrams with flow velocity as the 
bifurcation parameter are presented. Chaos is confirmed by the existence of positive 
Lyapunov exponent. A general theory for the analytical investigation of tube arrays in 
cross flow is given by Chen (1987). Pai'doussis (1987) discussed flow-induced instabili¬ 
ties of cylindrical structures. Pai'doussis & Li (1992) investigated the chaotic dynamics 
of heat-exchanger tubes impacting on loose supports using delay differential equations. 
Both the ends of the tubes are assumed to be clamped while the intermediate support is 
assumed to be having certain amount of clearance. Symmetric and asymmetric (x = 0-5 
and 0-45) cases are studied. The model representing the physical system is shown in 
figure 17. Using numerical simulation the critical velocity of fluid flow beyond which 
irregular impacting motion occurs is determined. The intermediate support is assumed 
as a cubic stiffness and also a trilinear stiffness. The impacting motion is studied by 
Poincare return maps. Chaos is confirmed by positive Lyapunov exponents. Period 
doubling and Hopf bifurcations are found to occur. Pai'doussis et al (1993) have studied 
the response of heat exchanger tubes under the excitation of an experimentally 
determined nonlinear fluid force using a two degree of freedom analytical model for 
rotated triangular array of cylinders. Complex tube dynamics are predicted due to the 
nonlinearities introducted by impact/sliding at the tube supports. 

5.2 Internal flow 

Numerous studies have been conducted on the dynamics of flexible tubes conveying 
fluid. Chaotic motions in nonlinear pipes conveying fluid have been reported in the 



Figure 16. Piecewise linear 
stiffness function. 
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Figure 17. A constrained pipe conveying fluid. 


recent literature. The bifurcation behaviour and the stability of motion of nonlinear 
pipes conveying fluid are treated in the works ofBajaj et al (1980), Bajaj & Sethna(1982, 
1984) and Thompson & Lunn (1981). The bifurcation to subharmonic and combination 
harmonic types of resonances of nonlinear pipes conveying fluid with harmonic flow 
velocity fluctuations are given by Sri Namachchivaya (1989), Sri Namachchivaya & 
Tein (1989). The fluid-conveying cantilevered pipe has become a classical problem for 
the study of all autonomous nonconservative dynamical systems because of the 
simplicity with which it can be modelled analytically and the relative ease with which 
reliable experiments may be conducted (Pafdoussis & Li 1993). 

Tang & Dowell (1988b) were perhaps the first to consider chaotic vibrations in a 
cantilevered pipe conveying fluid in the presence of magnetically generated potential wells. 
They have used a pipe with an inset steel strip and two equispaced permanent magnets on 
either side, exerting strong nonlinear forces on the pipe and buckling it into one of the two 
potential wells on either side of the pipe similar to buckled beam experiment of Moon. 
Increasing flow velocity sufficiently above the critical value for flutter about the 
buckled state, the pipe develops chaotic oscillations. The autonomous system without 
external force was considered theoretically while the nonautonomous system with an 
external harmonic excitation was treated theoretically and experimentally. Once again 
chaos was obtained for sufficiently high amplitudes of harmonic excitation and the 
threshold value force was strongly dependent on the flow velocity. They also considered 
the influence of damping and the number of degrees of freedom used in their model and 
found that the character of the dynamics was sensitive to both. Pafdoussis & Moon 
(1988a, b) considered the motion of a linear cantilevered pipe conveying fluid with non¬ 
linear boundary conditions in the form of motion limiting constraints similar to those used 
in impacting systems. They constructed phase plane diagrams, Poincare maps, calculated 
the autocorrelation and power spectral density functions and Lyapunov exponents 
confirming the chaotic behaviour. Markrides & Edelstein (1992) studied the oscillations 
of cantilevered tube conveying fluid in the presence of motion limiting constraints using 
finite element method. Quasiperiodic route to chaos was observed for increased flow 
velocity and for stiffness parameter associated with the motion limiting stops. 

A detailed account of investigation on the pipes conveying fluid is given in a recent 
review paper by Paidoussis & Li (1993). Pafdoussis et al (1992) have observed chaos in 
cantilevered pipe conveying fluid with motion limiting stops when the flow velocity is 
sufficiently high. The dimensions of the chaotic attractors are calculated using delay 
embedding technique. Paidoussis et al {1991) have analysed a constrained pipe convey¬ 
ing fluid using numerical simulations and compared the simulation with the experi¬ 
mental results. 
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Jayaraman & Narayanan (1990b), Jayaraman (1991) and Jayaraman & Narayanan 
(1993) investigated pipes conveying fluid considering it as an sdf and 2-dof (degree of 
freedom) system with parametric excitations and identified chaotic behaviour. Funda¬ 
mental and parametric resonances were treated by multiple time-scale methods. 
Transient chaos is also observed. Stability regions for trivial and nontrivial boundaries 
and initial condition maps are obtained. Period doubling route to chaos is identified in 
the system. Dual responses for periodic and chaotic motions are also presented. 

6. Bifurcations and routes to chaos 

For a nonlinear dynamical system possessing chaotic behaviour, the nature of transi¬ 
tion from periodic motion to chaos, as a control parameter is varied, is of importance 
both in terms of confirming its deterministic status as well as in anticipating chaotic 
behaviour if it exhibits similar prechaotic behaviour. The stability of periodic solutions 
can be determined by using Floquet theory (Arnold 1983). The eigenvalues of the 
monodromy matrix can be obtained and by plotting these eigenvalues in the unit circle 
one can obtain the stability information of the periodic orbit (Hsu 1974). If all the 
eigenvalues (Floquet multipliers) lie inside the unit circle it indicates the solution is 
stable. The eigenvalues can leave the unit circle in three ways and are shown in figure 18 
(Arnold 1983). 

6.1 Types of bifurcations (flip and fold) 

When one of the eigenvalues leaves the unit circle through — 1 boundary, the system 
undergoes period-doubling bifurcation otherwise called flip bifurcation (intermittency 
type II), in which the original periodic orbit becomes unstable and a new periodic orbit 
with twice the original period gets generated. Successive such bifurcations result in the 



Figure 18. Nature of bifurcations and leaving of Floquet multipliers from unit 
circle (Arnold 1983). 
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familiar period doubling route to chaos which has a quantitative universal constant 
called the Feigenbaum constant (Feigenbaum 1978). 

If one of the eigenvalues penetrates the + 1 boundary of the unit circle it indicates 
birth or annihilation of a new periodic orbit. A saddle node bifurcation indicates that 
a saddle and a node merge and the frequency becomes incommensurate leading to birth 
of quasiperiodic solution which is generally characterised by a closed curve in the 
Poincare map. Before this bifurcation the response contains combination harmonic 
frequencies indicating the modelocking phenomenon and after the bifurcation the 
frequencies become incommensurate indicating the drifting of phase points leading 
to quasiperiodic response. It also indicates a jump phenomenon in which the orbit 
jumps from a resonant to a nonresonant branch (or vice-versa) of the response 
curve in which a vertical tangency is reached indicating an exchange of stability. 
Sometimes a periodic orbit gets destroyed and a sudden explosion of the size of 
the attractor occurs (intermittency-I). This kind of sudden growth of the size of the 
attractor which normally results in chaos is called crisis (Grebogi et al 1982, 1983a). 
Kleczka et al (1989) and Bajaj (1991) discussed interior and boundary crises. 

If the complex conjugate eigenvalue leaves the unit circle radially, it is known as the 
Hopf bifurcation whereby a fixed point bifurcates to a limit cycle and further change in 
the control parameter results in quasiperiodic orbit (intermittency-III). This kind of 
bifurcation is called a Neimark bifurcation (named after the Russian mathematician) or 
secondary Hopf bifurcation. 

A symmetry breaking bifurcation which always precedes the period doubling 
bifurcation is indicated when one of the eigenvalues touches the -1-1 boundary of the 
unit circle from inside and travels backwards. Symmetric solutions characterised by 
odd harmonic components will become asymmetric by the introduction of even 
harmonic components after symmetry breaking bifurcation. Thompson (1992) classi¬ 
fied generic co-dimension-one attractor bifurcations in dissipative systems and pres¬ 
ented them in table form. 


6.2 Ruelle-Takens route to chaos - T 3 , torus bifurcation 

An important route to chaos in mechanical system is through quasiperiodic motions 
and their subsequent instabilities. The precursor to such a route is the existence of 
Hopf bifurcation such as occurring in flutter problems when a static equilibrium posi¬ 
tion becomes unstable and bifurcates into a periodic limit cycle motion as a control 
parameter is varied. Further Hopf bifurcations and consequent simultaneous existence 
of coupled limit cycles with incommensurate frequencies lead to quasiperiodic motions. 
When the quasiperiodic motion becomes unstable chaos occurs in a finite sequence of 
bifurcations. 

The theory of quasiperiodic route to chaos with finite sequence of Hopf bifurcations was 
first observed by Ruelle & Takens (1971) and later elaborated by Newhouse et al (1978) as 
a model for turbulence. It challenged for the first time the mechanism proposed by Landau 
(Landau & Lipschitz 1971) which required an infinite sequence of Hopf bifurcations for the 
generation of turbulence. In fact they showed that quasiperiodic motion with three 
incommensurate frequencies being the result of the three successive Hopf bifurcations 
suffice for the generation of chaos. This route is also called as torus 3 bifurcation or T 3 
bifurcation. The route to chaos through quasiperiodic motion with three incommensurate 



558 


P Sekar and S Narayanan 


frequencies has been used to explain the behaviour of Rayleigh-Bernard convection 
(Gollub & Benson 1979). 

6.3 T 2 , torus bifurcation 

Though the quasiperiodic route to chaos through torus T 3 bifurcations with a strange 
attractor is a more familiar one it is not the only quasiperiodic route to chaos. Curry & 
Yorke (1977) and Berge et al (1984) have discussed another quasiperiodic route to chaos, 
where chaos occurs directly from a quasiperiodic regime with only two incommensurate 
frequencies without the appearance of a third frequency. In this route before the eventual 
departure of the trajectories from the torus T 2 - and its destruction leading to chaos, 
frequency synchronisation (mode locking) takes place when the incommensurate frequen¬ 
cies in the T 2 motion become commensurate. 

The quasiperiodic route to chaos either by T 3 or T 2 torus bifurcations have been 
discussed in the literature by Curry (1978), Gollub & Benson (1980), Grebogi et al (1983b), 
Ostlund et al (1983), Fein et al (1985), Gwinn & Westervelt (1986), Sano & Savada (1983) 
and Stavans et al (1985). The quasiperiodic route to chaos has also been discovered in the 
chaotic motion of nonlinear spring to harmonic motion (Johnson & Bajaj 1989; Molteno 
& Tufillaro 1990) in the coupled Van der Pol-Duffing oscillator with Coulomb damping 
under harmonic excitation (Awrejcewicz 1989; Awrejcewicz & Mrozowski 1989) in the 
self-excited forced autonomous square prisms considered as a two-degree of freedom 
system (Simiu & Cook 1991), in a two-degree of freedom nonlinear oscillator with cubic 
nonlinearities in stiffness terms and subjected to harmonic excitation (Cheng 1991). The 
problems of gear meshing with profile error excitation which represent mechanical systems 
also exhibit chaotic behaviour through quasiperiodic route to chaos (Kahraman & Singh 
1990b, Raghothama 1993). Quasiperiodic route to chaos is identified in roller bearing 
systems with Hertzian type contact nonlinearity (Mevel & Guyader 1993; Vaishali 1994). 

6.4 Intermittent route to chaos 

Another route to chaos in which long periods of periodic motions are interspersed with 
sudden bursts of chaotic motions is called the intermittent route to chaos. As the control 
parameter is varied the chaotic bursts become more frequent and longer. Evidence of such 
intermittent chaos has been provided by Manneville & Pomeau (1980) in the Rayleigh- 
Bernard convection model for turbulence. Simiu & Cook (1991) have also observed 
intermittent chaos in their analysis of chaos of coupled square prisms. 

6.5 Transient chaos 

Sometimes chaotic motions appear for some parameter changes but eventually settle to 
a periodic or quasiperiodic attractor after a short time. This phenomenon is called transient 
chaos and has been discussed in the works of Grebogi et al (1983a). Such transient chaos is 
possible when a weak and unstable chaotic attractor coexists with a stable periodic 
attractor. While motion may begin on the chaotic attractor, as the trajectories wander it 
may get locked onto the periodic attractor. Both intermittent and transient forms of chaos 
perhaps exist in a large class of nonlinear dynamical systems, but have not been reported 
widely in the literature because of their transient nature. 
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7. Control of chaos 

There is a lot of evidence for the presence of chaos in nonlinear dynamical systems. On 
occasions, chaos is a beneficial feature as it enhances mixing in chemical reactions and 
provides a vigorous mechanism for transporting heat and/or mass. However, in many 
situations, chaos may be an undesirable phenomenon which may lead to irregular 
operations and cause fatigue failure in mechanical systems, temperature oscillations in 
thermal systems and increased drag in flow systems. In such situations the chaotic attractor 
may be controlled. The chaotic attractor typically has embedded within it an infinite 
number of unstable periodic orbits (Holmes 1979; Takens 1980; Ueda 1985; Auerbach et al 
1987; Lathrop & Kostelich 1989; Pawelzik 1991; Schwartz & Triandaf 1992). 

7.1 OGY method 

Making use of the above observation, Ott et al (1990) demonstrated how one can convert 
a chaotic attractor to a periodic attractor by stabilising any one of the large number of 
unstable periodic orbits that are typically embedded in the chaotic attractor. This method 
utilizes delay coordinate embedding and so is applicable to experimental situations in 
which a priori analytical knowledge of the system dynamics is not available. The approach 
is as follows: some of the unstable low periodic orbits that are embedded in the chaotic 
attractor are located. These orbits are examined and one which gives improved perform¬ 
ance is selected. Finally, a small time dependent parameter perturbation is made so as to 
stabilize the selected unstable periodic orbit. The Henon map is taken as an example to 
demonstrate the control procedure. 

Ditto et al (1990) implemented the Ott, Grebogi and Yorke (OGY) method to a period¬ 
ically driven physical system (magnetoelastic ribbon) converting its chaotic motion into 
period-1 and period-2 orbits. The power of the method is highlighted. Singer et al (1991) 
demonstrated by using both experimental and theoretical results, how low energy feedback 
control signals can be successfully used to suppress chaotic flow in a thermal convection 
loop. Using the OGY method, Hunt (1991) converted chaotic orbits of the diode resonator 
into stable orbits with periods up to 23 drive cycles long. In addition to stabilizing already 
existing low-period orbits the method allows small alterations in the attractor thus 
permitting previously nonexistent periodic orbits to be stabilized. It is an analog technique 
and therefore can be very fast, making it applicable to a wide variety of systems. 

Mehta & Henderson (1991) implemented the OGY method and showed how a chaotic 
system is able to generate a desired aperiodic orbit by making only small temporal 
perturbations to an available set of system parameters. In their method the chaotic system 
is modified by generating aperiodic orbits within some bounded region, or perhaps by 
generating even different chaotic trajectories. Tel (1991) extended the OGY method to 
control transient chaos and presented the novel features of the control process: the time 
needed to achieve control is constant, independent of the maximum perturbation, number 
of trajectories controlled depends on the maximum perturbation and follows a power law. 
Dressier & Nitche (1992) and Nitche & Dressier (1992) applied the OGY method to control 
systems whose attractor is constructed by delay coordinates. They have brought out the 
important fact that whenever delay coordinates are used, the experimental surface of 
section map depends not only on the actual parameter but also on the preceding one. In 
order to bring out this dependence two modifications are introduced which lead to a better 
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performance of the control. To compare their control abilities these algorithms,are applied 
to simulations of the Duffing oscillator. Romeiras et a! (1992) extended the OGY method 
allowing for a more general choice of the feedback matnx and implementation to higher 
dimensional system. Kapitaniak (1993) devised a way of analytically controlling chaos 
using the OGY method and the quantitative behaviour of the Feigenbaum period 
doubling sequence. Shinbrot (1993) discussed the merits of the OGY method and reviewed 
various other methods of controlling chaos. 


7.2 Other methods 

Other methods of controlling chaos are due to Hiibler (1989), Jackson (1990), Braiman & 
Goldhirsch (1991), Bandyopadhyay et al (1992) and Chen & Dong (1992, 1993). The 
control of chaos is achieved by modifying the chaotic system by appending a linear 
spring mass system to the original chaotic systems (Narayanan & Jayaraman 1989b). 
However, the combined system also exhibits chaos in certain range of parameters. 
Blazejczyk et al (1993) presented a review on control of chaos. 


8. Computation of periodic solutions 

The analysis of nonlinear systems include (a) location of periodic solutions, (b) stability 
of periodic solutions, (c) local analysis: that is the study of bifurcation of periodic 
solutions as the system parameters are varied, and (d) global analysis: determination 
of the domains of attraction of all periodic motions and strange attractors. This section 
presents the various methods of obtaining periodic solutions. 

8.1 Frequency domain method 

In this method, the solution is expressed as a truncated Fourier series. Since, systems 
with piecewise linear or nonlinear stiffness or damping characteristics are in general 
strongly nonlinear, the conventional solution procedures such as harmonic balance, 
multiple time scales, averaging methods etc. cannot be successfully used to determine 
the steady state solutions. However, strong nonlinear systems may be analyzed by 
numerical-analytic methods such as Incremental Harmonic Balance Method (IHB), 
Fast Galerkin Method (FG), Alternate Frequency/Time domain method (AFT) etc. 

Cesari (1963) obtained steady state solutions of nonlinear dynamical equations by 
using a trigonometric polynomial expansion, which is equivalent to a truncated 
Fourier series. Urabe (1965) gives the mathematical justification for the trigonometric 
polynomial expansion and Urabe & Reiter (1966) show how the Fourier coefficients 
may be computed at each iteration while employing Newton-Raphson procedure, 
with a discrete Fourier transform, once the nonlinear functions are evaluated in time 
domain. Mickens (1988) obtained the bounds on the periodic coefficients of nonlinear 
differential equations. Van Dooren (1973) used Urabe’s approach to obtain the 
combination harmonic of differential type of a damped two degree of freedom system 
with quadratic and cuoic nonlinearities. Samoilenko & Ronto (1979) developed 
a matrix method in which the matrices relating the time and frequency domain are 
formed. These matrices can be considered as a discrete Fourier transform in matrix 
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form. Tamura et < 3 / (1981) developed a matrix method to obtain higher order approxi¬ 
mation for steady state solutions using complex Fourier series expansion. The elements 
of the corresponding Jacobian matrix are obtained by taking advantage of its special 
structure. Kondou et al (1986) developed a general computer-oriented formulation of 
the harmonic balance method. The stability of periodic solutions is also considered. 
The fast Galerkin method is proposed by Ling & Wu (1987) which uses FFT to 
implement Urabe’s method. Cameron & Griffin (1989) developed the alternating 
frequency/time domain method by implementing the FG method in an efficient 
manner. The harmonic components of the nonlinear terms are obtained by using FFT 
while that of the linear terms are obtained by direct substitution of the equations 
obtained from the assumed solution. They have also reduced the number of nonlinear 
algebraic equations obtained by harmonic balancing to half by expressing cosine and 
sine terms in complex form. Ling (1990) discusses the merits of the FG method. 
Buonomo(1992) has developed a trigonometric collocation method to obtain periodic 
solutions of nonlinear oscillators. The nonlinear algebraic equations in terms of the 
undetermined coefficients of the solutions is solved by the Newton-Raphson method. 
The explicit expressions for elements of the corresponding Jacobian matrix are derived. 
Cardona et al (1994) developed a multiharmonic method in conjunction with FFT 
algorithm and Newton-Raphson procedure which is similar to the one developed by 
Tamura et al (1981). In addition to the solution methodology, various other issues like 
initial approximate solutions, convergence, errors due to truncation of higher har¬ 
monics etc. are also addressed by Tamura et al (1981). 

Maezawa (1961) and Maezawa & Furukawa (1973) obtained harmonic and super¬ 
harmonic responses of an sdf oscillator with bilinear and trilinear stiffness subjected to 
external harmonic excitation using the Fourier series expansion. Maezawa et al (1980) 
obtained the steady state solutions of an sdf system with bilinear restoring force, 
subjected to a general periodic excitation by expanding the solution in a Fourier series. 
In their analysis the duration of contact of the mass with the spring of higher stiffness is 
assumed and the corresponding steady state solution is obtained. Stability of periodic 
solutions is investigated using Hill’s determinant approach. 

IHB method is capable of accommodating multiple harmonic components and 
thereby handles strong nonlinearities without much difficulty. It is ideally suited for 
parametric studies since it can step from a known state of vibration to a neighbouring 
state which corresponds to an incremental change in one of the governing parameters 
of the system (Lau etal 1981, 1982). Lau et al (1983) developed a procedure to obtain 
the combination harmonic solutions with little modification of the IHB method. Pierre & 
Dowell (1985) studied the dynamic instability of plates with geometrical nonlinearities 
using IHB method. Few special algorithms have been given for tracing the stability 
boundaries especially for linear viscous damping case. Pierre etal (1985) obtained 
frequency response curves for an sdf and 2 dof dfy friction nonlinear systems by 
modifying the original IHB method. As the dry friction elements introduce a signum 
function in the equation of motion, the evaluation of derivatives and the Galerkin 
integrals (in which Dirac delta function appears) causes considerable difficulties. The 
integration is reduced to a summation and is evaluated by finding the zeros of 
the function. The effect of number of harmonics on the accuracy of the solution is 
discussed. Ferri (1986) shows that the FG method and the IHB method are theoreti¬ 
cally the same, since the Galerkin approximation and incrementation are employed in 
both the methods. In the IHB method, incrementation process is followed by the 
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Galerkin approximation while the reverse is true in the case of the FG method. This 
observation had earlier been made by Neuman & Sen (1972). Wei & Pierre (1989) used 
a Galerkin-Newton-Raphson procedure to solve dry friction damped nearly 
periodic structures. Wong et al (1991) extended the IHB method to bilinear stiffness 
systems and obtained periodic, subharmonic and superharmonic responses and com¬ 
pared their results with those obtained by Maezawa et al (1980). This procedure 
appears to be more involved as compared to the FG method since the integral 
evaluation requires a separate routine to find the zeros of a function involving heaviside 
step function. It appears that the concept developed and used by Pierre et al (1985) to 
evaluate the integrals arising from damping nonlinearity is applied by Wong etal 
(1991) for the case of stiffness nonlinearity. It has also been extended to a general 
piecewise linear stiffness function with multiple number of restraining springs (Lau & 
Zhang 1992). 

Choi & Noah (1988) employed a multi-harmonic balancing procedure to solve an 
unsymmetrical piecewise linear sdf system. Kim & Noah (1991) proposed a modified 
harmonic balance/Fourier transform approach to determine the steady state periodic 
solutions and applied the same to an unsymmetrical piecewise linear single degree of 
freedom system subjected to harmonic excitation. This procedure of obtaining the 
approximate periodic solutions can be considered as an efficient implementation of the 
method proposed by Urabe & Reiter (1966). But the estimation of error involved in the 
approximate solutions is not considered in their analysis. The stability of the solutions 
is investigated by constructing the monodromy matrix of the periodic system. Bouc & 
Defilippi (1987) applied Urabe’s method for multidimensional random vibration 
problems for stationary solutions. Leung & Fung (1989a) constructed bifurcation 
curves and the regions of chaos for a Duffing’s oscillator with zero linear stiffness by 
using the IHB method and compared them with those obtained by Ueda (1985). 
Parametric diagrams indicating the regions of periodic and chaotic solutions are 
constructed for a Duffing oscillator with positive and negative linear stiffnesses. Chaos 
is confirmed when all periodic solutions become unstable. The elements of the 
corresponding Jacobian matrix (or the tangent matrix) are obtained in a special way 
which facilitates the parameter unfolding. Cheung et al (1990) presented the formula¬ 
tion of the IHB method for multidegree freedom system with cubic nonlinearity. An 
incremental arc length approach combined with a cubic extrapolation technique is 
adopted to trace the response curves automatically. 

Lau & Yuen (1991) used IHB method to obtain limit cycle solutions of a single degree 
of freedom and 2 degrees of freedom nonlinear autonomous self-excited systems. In 
these cases a large number of harmonics (in 10 s) have to be assumed while expressing 
the periodic solutions because the periodicity is not known exactly. Choi & Lou (1991) 
developed a new iterative procedure for obtaining the steady state periodic solutions of 
piecewise nonlinear single degree of freedom oscillator. This method iteratively utilizes 
the FFT procedure and harmonic balancing to change the nonlinear algebraic equations 
obtained through harmonic balancing to a sequence of linear algebraic equations 
thereby getting exact linearisation. During each iteration the constant coefficient of the 
solution is adjusted in such a way as to make the constant coefficient of the nonlinear 
stiffness function zero. The stability boundaries are determined by constructing the 
Hill’s determinant for the linearised system. The frequency domain method has been 
used to obtain the steady state vibration of the beams, arches etc. in conjunction with 
finite element method (Leung & Fung 1989,1990a, 1990b; Lewandowski 1992). Leung 
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(1989) used the same procedure to construct the amplitude-frequency relation (back¬ 
bone curve) of the nonlinear natural vibration of a beam. Oravsky (1990) has developed 
a matrix method to evaluate the multiharmonic components of periodic solutions by 
expressing the set of equations of motion in first order form. The Floquet theory is used 
to obtain the stability of periodic solutions. Kanarachos & Spentzas (1992) developed 
a method based on Galerkin approximation of nonlinear differential equations by 
expressing the solution in a truncated Fourier series form. The resulting nonlinear 
algebraic equations are solved by Powell’s minimisation method. Atadan & Huseyin 
(1984), Huseyin & Lin (1991) Huseyin & Wang (1991) developed an intrinsic harmonic 
balancing method which utilises the advantages of both multiple time scale and 
harmonic balance methods. The power of the method in both the determination of 
periodic solutions as well as the bifurcation analysis is demonstrated in a series of 
papers for both autonomous and non-autonomous, parametric and external excita¬ 
tions. However, non-analytical nonlinearities cannot be treated in the present form. 

The IHB method introduces considerable difficulties because the evaluation of 
integrals arising out of Galerkin process is problem dependent. To avoid these 
difficulties Narayanan & Sekar (1993, 1994) developed a fast incremental harmonic 
balance method in which the integrals are identified as Fourier coefficients and are 
calculated using FFT. The corresponding elements of the Jacobian matrix (tangent 
matrix) obtained during Newton-Raphson method is also obtained using FFT. Hence, 
this method is suitable for problems having both analytical and nonanalytical non- 
linearities. This method is demonstrated for systems containing deviating arguments 
also. The response of systems with multiple harmonic excitations which could be 
incommensurate is also dealt with. 

8.2 Time domain method 

This method requires a numerical integration procedure. The differential equations are 
reduced to first order form and the solution may be obtained by Runge-Kutta, 
Adams-Moulton etc. methods (Parker & Chua 1989). Tongue (1987a) and Reinhall 
etal (l%9) have discussed the implications of selection of the step length for integration 
on the solution of nonlinear differential equations. 

8.2a Brute force method/point mapping method: In this method, differential equa¬ 
tions governing the dynamics of the system are integrated using any one of the 
numerical integration procedures. Some of these methods are Runge-Kutta method, 
Adams-Bashforth method, Adams-Moulton method and Gear method. The equa¬ 
tions are mapped successively using the differential equations with initial conditions 
obtained by the previous mapping step. The periodicity is checked once the solution is 
attracted to some attractor (Flashner & Hsu 1983). 

8.2b Shooting method: In this method the differential equations are integrated with 
the specified initial conditions along with the linearised equations with unit matrix as 
the initial conditions for one period of solution assumed. The shoots are obtained from 
the solution of the linearised equations and are added to the current initial conditions 
to get the new initial conditions. This procedure is continued till the solution is 
attracted to a particular attractor. Unstable solutions can also be obtained. Aprille & 
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Trick (1972) initially proposed such a method and it is applied to many problems 
(Chua & Lin 1975). Modifications of the shooting method may be found in Schwartz 
(1983), Meijaard (1991) and Foale & Thompson (1991). Tsuda etal (1992) used 
shooting method for equations with deviating argument. 

Kaas-Petersen (1987b) presented an algorithm to get periodic and biperiodic 
solutions with a use of Poincare map. Stability analysis and continuation of solutions 
with a change in the control parameter are also incorporated. Kleczka et al (1990) and 
Kleczka et al (1991) demonstrate a methodology for the analysis of local bifurcations with 
codimension one (expression for Poincare map with one system parameter) of nonlinear 
dynamical systems using a combination of analytical and numerical methods. Bifurcation 
analysis is performed with use of center manifold theory and symbolic manipulation of 
variables. Any of the general purpose computer algebra programs like MACSYMA, 
MATHEMATICA, MAPLE or REDUCE etc. can be used for symbolic computation. 
Bursal & Tongue (1992) developed a hybrid symbolic-numerical method for integra¬ 
ting ordinary differential equations based on Taylor’s series concept. 

9. Domains of attraction 

Unlike linear systems, nonlinear systems can exhibit multiple solutions for the same set 
of system parameters. The type of solution to which the system eventually gets 
attracted depends on the initial conditions. The set of initial conditions which leads the 
system to a particular attractor is called the domain of attraction. There are instances in 
which periodic and chaotic solutions coexist. Hence, delineation of all the domains of 
attraction is very important to understand the system behaviour. In this section we 
review the various methods of obtaining the domains of attraction of all steady state 
solutions for a given set of system parameters of a dynamical system. The first approach 
is the direct way of getting the domains of attraction by sweeping the whole region of 
interest of the phase space by numerical simulation. Analog computers are also used to 
determine the domains of attraction (Bajkowski & Szemplinska-Stupnicka 1975; 
Tondl 1985). 

9.1 Invariant curves method 

The second approach involves backward mapping of all the points of the boundary of 
a sufficiently small region around a stable periodic attractor. The region is constructed 
such that the forward mapping of all the points in the boundary is contained within the 
region. By repeatedly applying the backward mapping it is possible to get larger and 
larger domains of attraction (Levinson 1944; Hayashi et al 1969; Hsu 1977; Hsu et al 
1977). Point mapping method or shooting method may also be used to obtain the 
periodic solutions. Flashner & Guttalu (1988). used a combined point map¬ 
ping/stability analysis procedure to locate the periodic orbits and also to determine the 
domains of attraction. 

The third approach is based on the concept that the boundary of the domain of 
attraction is nothing but the stable manifolds of the saddle point for particular system 
parameters. If the stable (sometimes referred as c o branch, W s branch or inset) and 
unstable manifold (sometimes referred as a branch, W u branch or outset) of the saddle 
point intersects, the domain of attraction becomes fractal. 
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Once the periodic solution is obtained its stability analysis can be performed. If the 
eigenvalues of the stability matrix is less than unity in modulus then the solution is 
a stable one. If the absolute value of one of the eigenvalues is more than one while the 
other one is less than one then the point is a repeller (saddle). In certain circumstances (if 
the initial condition lies on the unstable manifold itself) the saddle point may be located 
by backward integration. It may be mentioned here that the eigenvector directions 
coincide with stable and unstable manifolds as they approach the saddle point. A very 
small straight line segment can be taken near the saddle point along the eigenvector 
direction such that the two ends of the segment are the arbitrarily selected initial 
conditions and the mapped values for that initial conditions. For stable manifold we 
must use backward mapping while forward mapping should be used for unstable 
manifold. A large set of points can be taken along the segment. For simplicity these 
points can be taken in a linear fashion (Troger 1979; Thompson et al 1987; Foale & 
Thompson 1991). This procedure is called the ladder of starts for obvious reason. Ling 
(1986) has selected an arbitrary length (normally less than 0-01) of the line segment and 
arranged the points such that the distances between neighbouring points form a geo¬ 
metric series. These points can be mapped repeatedly to get the invariant curves in the 
entire region of interest. The initial line segment should be short enough (there is no 
general procedure to select the starting point as this depends on the problem of interest) 
to avoid a deviation from the real invariant manifold so it is sometimes not sufficient to 
map it only once. In such situations a spline extrapolation may be used to arrange the 
mapped points to lie on the manifolds. Alexander (1989) developed a computer 
oriented automatic procedure to produce the invariant manifolds. A modified Powell 
hybrid procedure is used to locate the saddle point. Higher order saddles are also dealt 
with. A pseudo computer code is given. Li & Moon (1990) use the method of ladder of 
starts to draw the invariant manifolds of three well potential problem. First the saddle 
point of the unperturbed system is located and to locate the saddle point of the 
perturbed system the perturbed system is linearised about the unperturbed saddle 
point and the resulting linearised system is integrated to obtain the deviation of the 
original saddle point. 

9.2 Cell mapping methods 

Nonlinear dynamical systems can exhibit hysteresis (multiple solutions) behaviour for 
certain combinations of parameters. Hence, it is very essential to obtain the various 
attractors and the corresponding domains of attraction. Cell mapping methods can be 
used for this purpose. An excellent account of the cell mapping procedures is presented 
in the book by Hsu (1987). 

9.2a Simple cell mapping ( SCM): In cell mapping the concept of complete time 
trajectories is discarded and the state space is considered as a collection of cells. Each 
cell is identified by its cell vector. In SCM all the phase space within any given cell is 
considered to map to the same target cell (Hsu 1980). Once the mapping is obtained the 
cells may be mapped repeatedly till they are attracted to a particular attractor. An 
unravelling algorithm is given in Hsu & Guttalu (1980). As the state space is discretised 
into a finite number of cells one may get some spurious attractors depending upon the 
size of the cells. This difficulty is solved to some extent in generalised cell mapping 








566 


P Sekar and S Narayanan 


method. Kreuzer (1985) analysed chaotic systems using cell mapping approach. Xu 
et al (1985) obtained domains of attraction of 2-degree of freedom self-excited nonlinear 
systems which required enormous amount of computing power. This problem is solved 
in a different way to obtain the domains of attraction by Levitas et al (1992). In their 
approach the n-dimensional initial condition space is reduced to n number of ( n — 1) 
dimensional cross-section spaces and these are further discretised into a large number 
of cells. The mapping of each cell is obtained by noting the terminal cell of the trajectory 
starting with the centre of each cell after the trajectory had made a specified number of 
intersections on the (n — 1) dimensional cross-sectional spaces. Using SCM, domains of 
attractions are obtained and compared with those obtained by Xu et al (1985). Henon’s 
method may be used for getting exact return maps (Henon 1982). Pezeshki & Dowell 
(1987) obtained initial condition maps for the buckled beam by numerical integration. 
Lam etal (1982) identify the spurious attractors and overcome this problem by 
integrating the original set of differential equations with the attractors obtained by 
SCM as initial conditions. They have proposed a parallel processing approach for 
SCM. This procedure appears to be more efficient than reducing the cell size. 
Nordmark (1993) has developed a computer-oriented interactive program to obtain 
the domains of attraction in which he reduces the cell size where it is really required. 

9.2b Generalized cell mapping (GCM ): In GCM a given cell has a finite number 
of target cells which are found by integrating a finite number of initial conditions 
within the initial cell and observing in which target cells they terminate. Based on the 
distribution of these cells that mapped to a given target cell, a probabilistic mapping is 
constructed which allows one to study the system steady state attractors and their 
domains of attraction via Markov chain theory (Hsu 1982; Hsu etal 1982). The 
GCM has been used to calculate the maximum Lyapunov exponent (Kim & Hsu 1986). 
Bestle & Kreuzer (1986) developed an efficient algorithm for solving systems with 
a large number of cells using GCM. 

9.2c Interpolated cell mapping {ICM ): In SCM and GCM only a finite number of 
terminal trajectories are possible. To relax this constraint, Tongue (1987b) and Tongue & 
Gu (1988a) developed an interpolated cell mapping approach in which the exact, 
location of the trajectory is obtained using linear interpolation of the neighbouring cell 
centres. This method is found to be very efficient in determination of the domains of 
attraction. Tongue & Gu (1988b) presented the theoretical basis for the validity of the 
interpolated cell mapping approach. Jayaraman & Narayanan (1990a) have used this 
method to obtain the domains of attraction of the Duffing oscillator with Coulomb 
friction and a square prism kept in a fluid flow. Tongue & Gu (1988c) presented 
a second order interpolated cell mapping procedure which improved the results of the 
linear interpolated cell mapping procedure. Tongue & Smith (1989) have demonstrated 
how Lyapunov exponents may be evaluated using the interpolated cell mapping 
procedure. 

9.2d Fuzzy cell mapping (FCJVf): Sun & Hsu (1990) analysed a first order nonlinear 
system by assuming that a parameter of the system is described by a fuzzy set theory as 
given by Zadeh (1965). 

The other methods of obtaining the domains of attraction are due to Varosi et al 
(1987), Schichtel & Beckmann (1991), and Eschenazi etal (1989). Varosi etal (1987) 
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discretise the entire phase space by a large number of simplices of appropriate size and 
by using barycentric coordinates a piecewise linear approximation for the Poincare 
maps is obtained. Eschenazi et al (1989) use relative rotation rates of the periodic orbits 
with respect to other orbits thereby locating the repellers and trace the invariant 
manifolds and the basin boundary. The idea of relative rotation rates of periodic orbits 
is used to study the bifurcational precedences (McRrobie 1992). Schichtel & Beckmann 
(1991) use Poincare return maps to locate the repellers and by constructing invariant 
manifolds the domain of attraction is obtained. 

10. Characterisation of chaos 

Chaotic behaviour is confirmed by many different ways depending upon the problem 
and situation. Both qualitative and quantitative methods are available to characterise 
chaotic behaviour. Unclosed phase plane plots, presence of strange attractor in the 
Poincare map, continuous power spectrum, irregular time history, intersection of 
stable and unstable manifolds, fractal basin boundary, Feigenbaum sequence of 
bifurcations indicate chaos. Chaos is mostly confirmed by the presence of positive 
Lyapunov exponent and fractional dimension of the strange attractors. 

10.1 Lyapunov exponents 

This analysis is very similar to the Floquet theory for the consideration of the linear 
stability of a periodic motion and has the same inspiration as the linear analysis of 
motion in vicinity of a trajectory. While in the Floquet theory, one is concerned as to 
the motion after one period of a closed orbit, in the computation of Lyapunov 
exponents, this restriction is not there. Lyapunov exponents are the average exponen¬ 
tial rates of divergence or convergence of neighbouring trajectories in phase space. 
Presence of at least one positive Lyapunov exponent indicates chaos with the magni¬ 
tude of the exponent reflecting the time scale on which the system dynamics becomes 
unpredictable. Negative Lyapunov exponents signify closeness of neighbouring trajec¬ 
tories and hence periodic motion (Shaw 1981; Eckmann & Ruelle 1985). 

Shimada & Nagashima (1979) developed a numerical procedure to calculate k- 
dimensional Lyapunov exponents using Oseledec’s multiplicative ergodic theorem 
(Oseledec 1968). The nonlinear equations are integrated with the specified initial 
conditions along with the linearised (perturbed) equations with unit matrix as the 
initial conditions. To avoid overflow error which arises mainly due to the stretching 
and folding of phase space volume, Shimada & Nagashima (1979) applied Gram- 
Schmidt-Reorthognalisation (GSR) procedure to the vectors of linearised flow of the 
system. Kim & Hsu (1986) obtained the largest Lyapunov exponent by generalised cell 
mapping. Wolf et al (1985) presented a computer code to generate Lyapunov exponents 
using the procedure given by Nagashima and Shimada. They have also presented 
another method and the corresponding code for; evaluating the Lyapunov exponents 
from a time history obtained from experimental measurements. The sum of the 
Lyapunov exponents is the time averaged divergence of the phase velocity: hence any 
dissipative dynamical system will have at least one negative exponent, the sum of all the 
exponents is negative which indicates the boundedness of the solution. In a three 
dimensional continuous dissipative dynamical system, the only possible spectra, and 
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the attractors they describe are as follows, (+, 0, —), a strange attractor; (0,0, —), a limit 
cycle; and (-,-,-) a fixed point (Wolf etal 1985). The Lyapunov dimension d L is 
calculated as follows (Farmer et al 1983; Frederickson et al 1983), 



where X- s is the largest Lyapunov exponent and j is defined as 

£^>0; x‘a,<0. 

i = 1 i = 1 

The relation between Lyapunov exponents and the material coordinates are discussed 
by Berger & Rokni (1987) and Rokni & Berger (1991). 


10.2 Dimension 

The definitions and the calculations of various dimensions are discussed in Moon 
(1992). The various dimensions include correlation, capacity information, information 
and Lyapunov dimension. Correlation dimension was developed by Grassberger & 
Procaccia (1983a, 1983b) and it has been the most widely applied dimension measure, 
largely because of the ease with which it can be computed. Efficient box counting 
methods are also available to calculate the correlation dimension (Liebovitch & Toth 
1989). Attractors can be reconstructed using delay coordinates (Packard et al 1980). 
Takens (1980) has given a sound mathematical foundation for delay embedding 
procedure. Broomhead & King (1986) have given a detailed procedure to select the 
optimum delay time and the embedding dimension. 


11. Conclusions 

Different mechanical systems which can exhibit chaotic behaviour are reviewed. 
Discussion focusses on typical mechanical systems which are generally described by 
piecewise linear differential equations like impacting or clearance systems, systems 
consisting of dry friction type damping elements, pipes conveying pulsating fluid, and 
systems which are exposed to fluid flow. Work on typical nonlinear models like 
a two-well potential system which is applicable to two stable state systems like buckled 
beam is reviewed. Models that are relevant for ship dynamics, which are characterised 
by potential well with escape, are given. Other models like pendulum, Duffing and 
Van der Pol type systems are also included. The different types of bifurcations and 
the routes to chaos which normally occur in various systems are presented. A short 
review about the literature available on the various ways of controlling chaos is also 
dealt with. 

The development of various methods of obtaining the periodic orbits and their 
relevance for the bifurcation analysis is extensively reviewed especially for frequency 
domain method. Cell mapping methods for initial condition maps and the construction 
of invariant curves are also discussed. Discussion on characterising chaos like 
Lyapunov exponents, calculation of dimension of attractors is also included. 
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Some aspects of chaotic and stochastic dynamics for 
structural systems 

N SRI NAMACHCKIVAYA, M M DOYLE and N MALHOTRA 

Nonlinear Systems Group, Department of Aeronautical and Astronautical 
Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 
61801, USA 

Abstract. In this paper, the bifurcation behaviour of an externally excited 
four-dimensional nonlinear system is examined. Throughout this paper, 
a two-degree-of-freedom shallow arch structure under either a periodic or 
a stochastic excitation will be considered. For the case when the excitation 
is periodic, the local and global behaviour is examined in the presence of 
principal subharmonic resonance and 1:2 internal resonance. The method of 
averaging is used to obtain the first order approximation of the response of 
the system under resonant conditions. A standard Melnikov type perturba¬ 
tion method is used to show analytically that the system may exhibit 
chaotic dynamics in the sense of Smale horseshoe for the 1:2 internal 
resonance case in the absence of dissipation. In the case of stochastic 
excitation, the stability of the stationary solution is examined by determin¬ 
ing the maximal Lyapunov exponent and moment Lyapunov exponent in 
terms of system parameters. An asymptotic method is used to obtain 
explicit expressions for various exponents in the presence of weak dissi¬ 
pation and noise intensity. These quantities provide almost-sure stability 
boundaries in parameter space. When the system parameters lie outside 
these boundaries, it is essential to understand the nonlinear behaviour. The 
method of stochastic averaging is applied to obtain a set of approximate Ito 
equations which are then examined to describe the local bifurcation 
behaviour. 

Keywords. Nonlinear dynamics; Melnikov analysis; Lyapunov exponent; 
global stochastic bifurcations; arch structures. 


I. Introduction 

The study of nonlinear dynamics is of great interest to engineers due to the wide 
range of applicability of nonlinear phenomena, such as existence of limit cycles, 
hysteretic effects, period-doubling bifurcations and complex dynamics, in mechanical 
and structural systems. This paper deals with the study of such nonlinear effects in 
structural problems which arise mainly due to geometric nonlinearities. The local 



584 


N Sri Namachchivaya et al 


bifurcation analysis of nonlinear structural/mechanical systems is well understood 
when the input to the system is deterministic. However, the global dynamics of 
these problems, including possible chaotic behaviour, is the subject of much of 
the current research. There is a great deal of literature on the computational 
results that have appeared in the study of chaotic dynamics of both structural and 
mechanical systems. However, these methods do not provide a great insight into 
the nature of the complicated dynamics that is present in these nonlinear systems. 
The perturbative Melnikov method presented in this paper detects such complicated 
dynamics due to global bifurcations. A detailed discussion of the higher dimensional 
Melnikov analysis is given by Wiggins (1988). In the first part of this paper, both 
the local and global behaviour of deterministic systems are considered. As an 
application, the dynamics of a shallow arch subjected to time-dependent loading are 
examined. 

In addition to deterministic excitations, practical engineering systems generally 
experience random loading conditions due to environmental disturbances, inherent 
noise, unmodelled dynamics, etc. Various methods are available for evaluating 
the stability and local bifurcation behaviour of these stochastic systems. However, 
global techniques for analysing such nonlinear random dynamical systems are 
not so well developed. There is a great deal of research in progress in the mathematical 
aspects of global analysis and the limited results available are not yet applicable 
to engineering problems. Thus, only stability and local stochastic bifurcation 
results are presented here. Most of the analysis in this paper is based on the general 
results of the authors on Lyapunov exponents, moment Lyapunov exponents and 
stochastic bifurcations. These results are applied to a certain class of structural 
problems. 

In §2, a brief discussion of the physical problem, a shallow arch subjected to 
time-dependent loading, is presented along with various assumptions used in obtaining 
a set of Lagrangian equations of motion. Section 3 contains results of global bifurca¬ 
tions in the presence of 1:2 internal and subharmonic resonance. The stability of the 
stationary solutions under the influence of random excitation is studied in § 4 in which 
results on Lyapunov exponents and moment Lyapunov exponents are presented. 
Finally, §4-2 contains results on stochastic bifurcations. 


2. Equations of motion 

Shallow arch structures subjected to various load conditions have been investigated by 
many authors in the past. Dynamic behaviour of the arch structures under impulsive 
loading was examined by Mettler (1962), Lock (1965), Humphrey (1966) and Hsu 
(1969). Recently Sri Namachchivaya & Doyle (1988) and Tien et al (1994) studied 
bifurcation behaviour and chaotic dynamics of a shallow arch subjected to small 
amplitude periodic excitation. In this study the dynamic response of a shallow arch is 
investigated. The shallow arch is assumed to be subjected to a lateral loading p{x. t), as 
shown in figure 1. 

The equation of motion governing the lateral deflection w(x,t) of this arch can be 
derived using the energy method, and is given as: 

mw tt + 0w t + EIw xxxx -EA h(w, t){w xx + w xx ) = p(x, t), 


( 1 ) 
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f z,w 


Figure 1 . Geometry of the 
shallow arch structure subjec¬ 
ted to lateral loading p(x, t). 


where 


h(w,t) = (1//) 



V 

u(l, t) + 

(W x + w xx w xx )dx 

- 

0 


w(x)= -q 0 sin(7tx/l), 


p(x,t)=:(p 0 + f(t))sin(Ttx/l). 

Here, q 0 is the initial rise parameter, p 0 is the static loading and/(£) is time-dependent 
part of the lateral loading. The spatial function w{x) represents the initial deflection of 
the unloaded arch. The arch is subjected to the following boundary conditions 

w = 0, w xx = 0 at x = 0,/. 

The Galerkin method is used to reduce the equation of motion to the ordinary 
differential equations by selecting appropriate shape functions. The transverse motion 
w(x, t) of the arch is approximated by the following expression: 

w(x, t ) = q x (t)sin(nx/l) + q 2 (t)sin{2nx/l) . (2) 

The non-dimensional equations of motion describing the evolution of the amplitudes 
of two fundamental modes can be written in (Q x = q x - q 0 , Q 2 = q 2 ) coordinates as 

Qi +PiQi + Qi+ Q x (Ql -q 2 0 + 4Q 2 2 ) + ( q 0 - 4) = f(t), 

Q2+p2Q2 + 1 6Q2 + 4 Q 2 (Q 2 i~q 2 o + 4Ql) = o, (3) 

where and P 2 are the viscous damping parameters for the first and second mode, 
respectively, and X 0 represents the non-dimensional leading parameter. The details 
of this derivation and the definitions of non-dimensional variables and parameters 
can be found in Tien (1992). 

This study deals with the dynamic response, stability and bifurcation behaviour of 
a system described by (3). Here/(t) is either a time periodic function or a white noise 
process. The first part of the paper deals with the global behaviour of the periodically 
forced shallow arch system in the presence of 1:2 internal and principal subharmonic 
resonance. 
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3. Deterministic Analysis 

Resonances have long been known to cause complex or chaotic behaviour in both 
dissipative and Hamiltonian dynamical systems. Since these resonances provide 
near-integrable structure in the governing equations, one can use certain analytical 
methods to study their global behaviour. To this end, in this section, consider (3) in the 
presence of parameteric subharmonic and internal resonances. Here it is assumed that 
the external forcing is periodic, i.e./(t) = p cos vt where p and v represent the amplitude 
and frequency of the harmonic excitation, respectively. In the absence of any external 
excitation (i.e., /(t) = 0, the system (3) permits single mode constant solutions 
(2i(0 = Qio> Q 2 W = 0)- Tien et al (1994) considered small perturbations in the neigh¬ 
bourhood of(Q 10 ,0) and obtained nonlinear variational equations corresponding to 
the forced equations of motion (3). In this investigation, a more realistic approach is 
taken. First, the single mode time-dependent solutions for the forced case are obtained 
using a power series expansion. Next, the nonlinear variational equations governing 
the small perturbations are obtained near the time-dependent single mode solutions. 
The external forcing appears as a parametric term in these perturbation equations. By 
using the method of averaging (Bogoliubov & Mitropolsky 1961), the first order 
approximation of the response of these perturbation equations is obtained in-the 
presence of 1:2 internal resonance and the principal subharmonic resonance. 

3.1 Scaling and averaged equations 

In order to investigate the dynamics of the arch for 1:2 internal resonance, the damping 
terms are ignored and the external forcing amplitude is assumed to be small as 
compared to other terms, i.e., 

/x = sy, ^ = ^ = 0, (4) 

where e is a small parameter (0 < e « 1). As mentioned previously, the equations of 
motion (3) permit {Qi{t) = tj(t),Q 2 (t) = 0) as one of the solutions. The following 
expansion of rj(t) is assumed 

tl{t) := tj 0 + etj 1 +e 2 ri 2 + 0(e 3 ). ( 5 ) 

In order to obtain variational equations about this solution, small perturbations x t 
and x 2 in the neighbourhood of single mode solutions {rj(t), 0), are considered, i.e. 

Qi =*!(?) + ex 1, Q 2 =0 + £x 2 . (6) 

The nonlinear variational equations governing the perturbations can be written as: 

*1 + “i*! + e[6r? 0 q l x x + rj 0 (3 x\ + 4 x 2 )] 

+ e 2 [3x 1 (^ + 2q 0 r} 2 ) + rj l (3xl + 4x 2 ) + x^ +4x x x 2 ] + O(e 3 ) = 0, 

x 2 + co 2 x 2 + £[8 >j 0 ?/ 1 x 2 -1- 8^ 0 x l x 2 ] 

+ + 2 tj 0 rj 2 ) + 8^ l x 1 x 2 + 4(xjx 2 + 4x 2 )] + 0(e 3 ) = 0, (7) 

where cu 2 = 3^ + (1 — q%) and co 2 = 4 + (4 — g 2 )]. Here r\ 0 is governed by a third 

order algebraic equation and r\ x , q 2 etc. can be explicitly obtained in terms of known 



Dynamics of structural systems 


587 


parameters and time. This yields 

>?o + (1 “ +($<>-4) = 0» 

ri l =b l cos(vt), 

^ 2-^2 f & 3 cos(vt) + 6 4 cos( 2 vf), ( 8 ) 

where b t - y/(w 2 — v 2 ) and b 2 , b 3 and b A can be similarly expressed in terms of 
unknown quantities. The Hamiltonian corresponding to (7) can be expressed as 

H(x v ,x q ) = H 0 + eH x + O(e z ) 

= + x 2 p2 + + (o\x 2 q2 ] 

+ ZlVoXql + + tl 0 niQx 2 q i + 4 x 9 2 2 )] + 0(s 2 ), ( 9 ) 

where x ql = x t ,x q2 — x 2 ,x pl = x 1 and x p2 =x 2 . The steady state behaviour and 
bifurcation analysis of the system is best described when the following canonical 
change of variables is introduced: 

Xqi = (2a,.) 1 / 2 cos(oy 4- (0,-M), x pi = - m i (2a I ) 1 / 2 sin(m i r + (0,/tw,), 

Vi = 1,2. (10) 

The following detuning parameters are introduced in order to examine the dynamics of 
the system in the presence of principal subharmonic resonance and 1:2 internal 
resonance. 


coJ = |(v 2 -£(r 1 ), co| = T&(v 2 — eo- 2 ). 


( 11 ) 


where 04 is the measure of deviation of the forcing frequency from twice the natural 
frequency of the first mode, and (<r 2 ~ a i) represents the deviation from the 1:2 internal 
resonance. At this stage, the method of averaging (Bogoliubov & Mitropolsky 1961) is 
applied to simplify the system. The averaged equations in the canonical variables can 
be expressed as 


a, = s 


6 4 

-u 1 h 1 ^ o sin( 20 1 ) + -( 2 a l ) 1 / 2 a 2 > 7 o sin( 0 1 — 20 2 ) 


+ 0(b 2 ), 


= e 


1 3 

— +-a 1 h 1 i/ 0 cos (2 0 J 


+ -(2a I ) 1/2 a 2 >/ o cos(0 1 -2 0 2 ) 


+ 0(s 2 ), 


~~~^a i ) ll2 a 2 rt 0 sin(6 l — 20 2 ) 


+ 0(e 7 


a 2 d 2 = e 


^cr 2 a 2 + 5(2a 1 ) 1 / 2 a 2 ^ o cos(0 1 - 2 6 2 ) 


+ 0 (£ 2 ), 


( 12 ) 


where 0 t = ^/cu,. In order to analyse the trivial equilibrium solutions, the averaged 
equations can be transformed to the rectangular coordinates. One finds that the trivial 
solution (a 1 =0,a 2 = 0 ) may lose its stability only through a simple bifurcation and 
may give rise to one or two coupled mode nontrivial equilibrium branches or one single 
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and one coupled mode solution branches. For a detailed account of the stability and 
bifurcation analysis, one is referred to Malhotra & Sri Namachchivaya(1995). 


3.2 Analytical investigation of chaotic dynamics 


In order to examine the possibility of the chaotic dynamics in the system, the higher 
dimensional Melnikov analysis is employed. The details of the framework of this 
method can be found in Wiggins (1988). Originally this method is due to Melnikov 
(1963) who considered a two-dimensional non-autonomous system with a completely 
integrable unperturbed part and small (0 (e)) time periodic terms. The non-resonant 
higher dimensional Melnikov method assumes the homoclinic structure of the unper¬ 
turbed system along with arbitrary perturbations. Generically, certain invariant sets 
may persist under perturbations. Thus, the knowledge of the global dynamics of the 
unperturbed integrable system can be used to develop a measure of the distance 
between the stable and unstable manifolds of that invariant set in the perturbed system. 
This measure is called the Melnikov function, which provide crucial information on the 
possibility of the presence of chaotic behaviour in certain classes of nonlinear systems. 


3.2a Reduction to the standard form: Consider the following canonical transforma¬ 
tion in order to reduce the averaged system of equations to the form appropriate for 
applying the higher dimensional Melnikov theory as described in Wiggins (1988): 


9l = 



q 2 = 2—, p l = a l co l , p 2 =l(a 2 + 4a l )a) 2 . ( 13 ) 

a)-, i 


This transformation leads to the following set of equations in (p l ,q l ,p 2 ,q 2 ) coordi¬ 
nates: 


Pi = £ 


^3^(Pi) 1/2 (P2-Pi) s i n 9! +^% 1 siri2(q 1 + q 2 ) 


+ 0(e 2 ), 


<7i <7 2 16^ 0 (p 2 -3p 1 ) 

4v v v 3 ' 2 (p^i 2 


cos Oh) 


+ 

p 2 = e[ 


— ^ - °x os2(<7 1 +q 2 )j + 0(e 2 ), 

~^-p 1 sm2(q l + q 2 )~^ + 0 ( e 2 ), 


q 2 = 8 



32^o 

v 3/2 


(Pi) 1 ^ 2 cos q | 


+ 0(e 2 ). 


(1 


In these equations, the effect of excitation is present through the terms involving t 
coefficient, b t , since b t = y/(co 2 — v 2 ). Thus, (14) attains the desired form if the time 
rescaled and the forcing terms are assumed to be small compared to the nonline 
terms. For this purpose, the variables and parameters are rescaled in the followi 
manner: 


y = ey', t = t'/e, p 1 =p’JA 2 , p 2 = p' 2 IA 2 , 


( 
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where A = (16f7 0 v" 3/2 ) and t' denotes the slow time. For convenience, the prime (') is 
dropped from the rescaled variables and parameters. The rescaled equations up to 0(e) 
can now be written as: 

Pi = 2p} / 2 (p 2 -Pi^in^ + e[2pp 1 sml(q 1 + q 2 )'], 

41 =Pi + P 2 + Pr 1/2 (P 2 -3pi)cosrii +£[//cos2(^i + g 2 )], 

P 2 = fi C 2 pPiSin 2 (g 1 -g 2 )j, 

4 2 = -p 2 + 2 pJ / 2 cosq 1? 

where 

*1 ^ 2 3 

= # ' 2 = 7 ’ 

In (16), the dot indicates differentiation with respect to t'. The objective is to study the 
global dynamics associated with this system. As a first step, the structure of the 
unperturbed system (obtained by setting e = 0) is examined. This can be expressed as: 

Pi = 2p} / 2 (p 2 -p l )sin^ 1 , 

Qi = Pi + Pi + Pi 1 / 2 (P 2 ~ 3 Pi) cos Qi > 

P 2 = 0, 

q 2 = ~~p 2 + 2p\ /2 cosq 1 . (17) 

It is clear that the first two equations are completely independent of q 2 , and p 2 = p 20 = 
constant. Therefore, it is sufficient to study the phase flow in(p 1 , ^coordinates only. 
The canonical transformations (13) and the periodicity of q t suggests that the range of 
interest in (p x ,q t ) coordinates is: 0<p 1 ^p 2O and 0^q 1 ^2n. The Hamiltonian 
corresponding to the two-dimensional (p t , q x ) vector field is given as: 

#(Pi,<Zi) = (Pi + P 2 )Pi + 2pi / 2 (p 2 -p 1 )cosg 1 . (18) 

It is obvious that the orbits in (p x , q x ) phase space depend only on the three parameters 
(P 20 >Pi>P 2 )- The fixed points of the reduced vector field, in the range of interest, are 
given as follows: 

(P 2 o>4i)» (P 2 o> 2k “ 4i)> (Pi + >0), (pi,2n), (pf, k), 

where 

^cos-^p.o) 1 ' 2 ] 

Pi = (1/18)[<t 2 + 6 p 20 ± a(a 2 + 12p 20 ) 1/2 , 

0 = (Pl+P 2 )‘ 

The linear stability analysis indicates that the phase flow has qualitatively different 
behaviour in region III as compared with regions I and II (figure 2). These three regions 
are defined in the following manner: 

I •• {(ff, p 20 ) I ^ 0, (<r 2 - 4p 20 ) < 0 }, 

H: {fo P 20 )l cr > 0. (<r 2 - 4p 20 ) < 0} 

III:{(<r,p 20 )|(< 7 2 - 4 p 20 )> 0 }. 
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Figure 2. Various global bi¬ 
furcation sets (I, II, III) in 
( p 20 — a) parameter space. 


One can easily check that the equilibrium points (p 2 0 ,4i) and (p 20 , 2n- 4i ) are saddle 
points in regions I and II, and the equilibrium points {p x , 0), {p x , 27t) and (p x , 7t) exist as 
centre points anywhere in (or, p 2 o) parameter space. One also notices that there are no 
saddle points in region III, thus there do not exist any homoclinic/heteroclinic 
connections in region III. The phase portraits corresponding to these global bifurca¬ 
tion sets (I and II) are shown in figure 3. These phase portraits indicate the existence of 
a heteroclinic cycle (A-A') connecting the two saddle points (p 20 , ) and (p 2 o» ~ )• 

The condition for the existence of this heteroclinic cycle is given as: 



( 20 ) 


Furthermore, it will be shown numerically that chaotic behaviour is not possible for 
this system when the system parameters lie outside of region I and II. 


3.2b The Melnikov function : The next step is to determine whether the heteroclinic 
cycle would break under small perturbations, and if it does break, whether the stable 
and unstable manifolds of these heteroclinic connections (A and A) intersect trans¬ 
versely. This information is obtained by calculating the Melnikov function M (Wiggins 
1988) defined by 


M = 


dH B dH dH n 

dp i dq x dp 2 


d t — 


8H_ 

dp 2 


(p ao,4i) J “ 00 


g p *dt, (21) 


where the integrand has been evaluated at any arbitrary point on the heteroclinic cycle. 
In the above expression, g Pl ,g q \g Pl and g 91 are the 0(e) terms in the four-dimensional 
perturbed system given by (16), 


g Pl — 2pp x sin2(<j 1 + q 2 ), 
g 91 — pcos2(q l +q 2 ), 
g P3 — 2pp i sin2(q! + q 2 ), 

0 #1 = 0 , ( 22 ) 

and the Hamiltonian ( H) corresponding to the four-dimensional unperturbed phase 
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Figure 3. (a) Unperturbed phase flow in (p t , q x ) phase space in region I (i.e. a > 0). 
(b) Unperturbed phase flow in (p l5 qf) phase space in region II (i.e. a < 0). 


space is given as: 

H(p 1 ,q l ,P 2 >Q2) = ff Pi + 2 Pi /2 (P2 ~Pi) cosc h ~Mi’ 


(23) 


On substituting (22) and (23) in (21), the Melnikov function for this system can be 
expressed as follows: 


M — 


l{a + p~ ll2 (p 2 -3p 1 )c°sq i }{2np l sm2{q l + q 2)} 


- {2p\ l2 {p 2 - p 1 )sinq 1 } {p cos2(^ 1 + q 2 )} 

+ { -\x 2 + 2p} /2 cosg 1 }{2pp 1 sin 2^ + q 2 )}]& 


-Pi [2pp 1 sin2(g 1 + g 2 )]dr. 


(24) 


3.2c Time parametrized expressions for heteroclinic orbits : In order to evaluate the 
Melnikov function, the explicit expressions for the heteroclinic orbits are required. The 
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heteroclinic orbit A' is the horizontal line connecting the two saddle points as shown in 
figure 3. This orbit is described by the following set of equations: 


Pi(f) = P 2 o. 

4i (t) = <r- 2(p 20 ) 1/2 cosq 1 (r), 


P 2 (0 = P2O» 

q 2 (t)= -^ 2 + 2(p 20 ) 1 / 2 cosg 1 (r). (25) 

To evaluate the Melnikov function for the heteroclinic orbit A', one does not require 
the explicit expressions for q t (r) and q 2 (t). On substituting p l (t) = p 20 in (24) and further 
simplifying the integrand, the Melnikov function assumes the following form 


M ul 


" oo 

[{(7 - 2 (p 20 ) 1/2 cos q t } { 2 pp 20 sin 2 (q x + q 2 )} 

J — 00 


+ { - - fi 2 + 2 p 2 / 0 2 cosq l } ( 2 pp 20 sin 2 ^ + q 2 )}~]dt. 


(26) 


Using a = (ji t + p 2 ) in (26), it is trivial to show M 1 /, 1 = 0 , which means the 0(s) distance 
between the stable and unstable manifolds of A' is always zero and, hence, it can be 
concluded that the heteroclinic orbit A' does not break at order 0(e) perturbation. 
The parametrized expression describing the heteroclinic orbit A can be obtained as 


H(Pi,<ji) = tf(p 20 ,4i), 
which implies, 


p*(t) = <r 2 /4 cos 2 4 j(t). (27) 

Substituting this in the unperturbed equations for q x and q 2 , explicit expressions for 
q x (t) and q 2 (t) can be obtained. Hence, the heteroclinic orbit A can be described by 


P i (0 = 




1/2 


t. 


q t (f) = tan 1 


2 

a 




P 2 W ~ P20 > 


9 2 W = PiH-9 20 , 


+ <?io > 


(28) 


where q iQ is the initial value of q x (t). For region 1(0 ^ a < 2 (p 20 ) 1/2 ), <? 10 = 0 and for 
region II (-2(p 20 ) 1/2 < a <0), 9 10 == <ho is the initial value of q 2 (t) and is used as 
another parameter. Using these expressions in (24), and after some algebraic manipula¬ 
tion, the Melnikov function for the heteroclinic orbit A can be obtained as 


f 00 

M 1 / 1 = 2p sin2q 20 (Pi(t)) 1/2 (p 20 -p x (t)} cosfi^f) + 2p x t)dt 

J — 00 


o 


+ 2 pcos 2 q 


20 


(PiW) 1 / 2 {p 2 o -Pi(0} sin(^ 1 (i) + 2 p t t)dt. 


(29) 


One observes that the integrand associated with the second integral in this expression is 
«n odd function oft, and thus the contribution due to this part is zero. The first integral 
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can be computed easily using (27) and (28) to give 

M { f = C(ia 1 ,(i 2 ,^P2o)sin2q 20 , (30) 

where 

C{^,ti 2 ,ti,p 2 o) = l-PiiPi ~ Pi )pn esc ( (p~^/ 4 j ji 72 ) • ( 31 ) 

It is worth mentioning that C depends on the system parameters pj, p 2 , p and p 20 , and 
is, in general, non-zero. Hence, under the circumstances when C is non-zero, the 
Melnikov function will have a simple zero with respect to q 20 at q 20 = rnt/2. In 
this case, the heteroclinic orbit A may break under small perturbation with transversal 
intersection and the system may exhibit chaotic dynamics in the sense of Smale 
Horseshoe. 

3.2d Numerical results: In order to examine the effect of small perturbations on the 
dynamics of the unperturbed system in more detail, one often resorts to numerical 
simulations. As noted previously, in the absence of any perturbations, the system 
dynamics can be studied completely in (p t , q t ) phase space (figure 4a). The introduction 



qi 



Figure 4. (a) The phase trajec¬ 
tories for the unperturbed sys¬ 
tem (s = 0) in (p t , q t ) plane with 
a — — 1-0, n 2 = 0-5 and p 2 = 10. 
(b) The Poincare map for the 
perturbed system in (Pu^) 
plane with <r- — 10, p 2 = 0-5 
. and e — 0-01. (c) Same as part (b) 
except e=01. 
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of small perturbation couples the dynamics of all the variables ( p 1 ,q 1 ,I,9 ) and in four¬ 
dimensional phase space the dynamics can be very complicated. Although the analysis 
considered earlier provides some insight into the global dynamics of the perturbed system, 
it does not generally yield the complete picture. However, the trajectories in the four¬ 
dimensional phase space can be numerically computed on a two-torus. Using the fact 
that q 2 & T 1 is a phase variable with period n, three-dimensional Poincare maps can be 
constructed in (p x , q t , p 2 ) phase space, with q 2 = q 2Q mod n, where q 20 can be set to zero 
without loss of generality. Furthermore, the momentum p 2 varies very slowly for small 
perturbations. Thus, the Poincare map can be easily represented in (p l ,q i ) space. 

Figure 4b shows a Poincare map in (p x , qj space for a — — 1-0, p 2 = O'5, and e/i = 0-01. 
In this diagram, one finds that most of the periodic orbits of the unperturbed system 
survive as quasi-periodic orbits with non-commensurable frequencies on a 2-torus, 
while the initial conditions taken very close to the unperturbed heteroclinic orbit lead 
to chaotic dynamics in the perturbed system. As the perturbation strength (ep) is increased 
to 0T, the extent of the chaotic attractor increases and various small secondary regions 
appear in a symmetric fashion about the q 1 = 0 axis (figure 4c). These secondary regions are 
small islands of quasi-periodic motion embedded in the layer of stochastic motion. The 
global dynamics of this system were studied for various values of the system parameters. It 
was observed that for a given perturbation strength the extent of the chaotic layer is rather 
small for non-zero values of the detuning parameter a as compared to the case where a = 0. 
The number, location and appearance of the secondary regions are also sensitive to the 
values of the detuning parameters. The influence of small perturbations arising from 
dissipation effects was also examined. It was found that even a very small damping causes 
the secondary regions to disappear, and the quasi-periodic orbits and chaotic attractor 
shrink towards the trivial solution. For additional details on the nature of trajectories in 
the perturbed system, one is referred to Malhotra & Sri Namachchivaya (1995). 

To summarise, in this section the conditions on system parameters for which the 
shallow arch system can exhibit chaotic dynamics under small harmonic excitations 
are explored. It is shown that in the absence of any dissipation and excitation, the 
averaged system corresponding to the nonlinear variational equations can have 
heteroclinic connections for certain values of system parameters. The higher dimen¬ 
sional Melnikov’s method is used to show that one of the heteroclinic orbits may break 
under small time periodic excitations. Under these conditions, the averaged system 
may have Smale Horseshoe type chaos whenever the system parameters <r and p 2Q lie 
inside the domains I and II. Using a standard theorem, these results can be extended for 
the original system. It must be noted that the existence of a simple zero of the Melnikov 
function only provides the necessary condition for chaos to occur. The applicability of 
this method is restricted to this class of problems in the sense that one cannot examine 
the effect of perturbations arising from the dissipation effects. Numerical simulations 
were also performed to confirm the predictions made in the analysis and to further 
explore the effect of small perturbations on the periodic and heteroclinic orbits. These 
simulations reveal very exciting details of the underlying structure of the quasi-periodic 
orbits and chaotic regions in the perturbed system. 


4. Stochastic analysis 

This section examines the response, stability and bifurcations of nonlinear structural 
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systems under stochastic excitations. There has been considerable research in this area 
both from mathematical and engineering points of view. In this section, a review of 
recent results pertaining to two-degree-of-freedom structural systems and some new 
results on moment Lyapunov exponents are presented. This section is divided into 
three parts dealing with Lyapunov exponents, moment Lyapunov exponents and 
stochastic bifurcations. The first two sections address the stability aspects and the final 
section deals with the nonlinear response of stochastic systems. 

The stability of a linear stochastic system can be defined in several ways. The weakest 
definition is that of stability in distribution. A more conservative estimate of the 
stability boundary is described by stability in probability. Thus, if a system is stable in 
probability, it is also stable in distribution. The other commonly used definitions of 
stochastic stability are stability in the rth mean and almost-sure stability, or stability 
with probability one. If a dynamical system excited by noise is stable according to either 
of these definitions, it is stable in distribution and in probability, as well. It is well 
known that, for a linear system with stochastic parametric excitation, the Lyapunov 
exponents are analogous to the real part of the eigenvalue. Stability in the almost-sure 
sense is determined by the sign of the maximal Lyapunov exponent. 

However, rth mean and almost-sure stability do not imply each other, i.e. a system 
can be almost surely stable while its 2nd moments grow exponentially. Kozin & 
Sugimoto (1977), with extensions by Arnold (1991), established a characterization 
between moment stability and almost-sure stability for linear Ito stochastic differential 
equations when the process is ergodic on the entire surface of the n-sphere. It was shown 
that the region of sample stability is the limit of the regions of rth moment sxability for 
r approaching zero. In this section, both Lyapunov exponents and moment Lyapunov 
exponents obtained for the two-degree-of-freedom arch problem using asymptotic 
methods will be presented. 

When the steady state motion of the nonlinear system is unstable, either in the mean 
square or almost sure sense, the nonlinearities act as limiters of the response and give 
rise to a new steady state motion. In order to examine such bounded motions, the 
response of the system beyond the bifurcation point must be examined. Section 4*2 
examines such behaviour. 

4.1 Asymptotic linear results 

The systems under consideration consist of multiplicative two-degree-of-freedom 
systems. Such systems are encountered in the study of mechanical systems subjected to 
fluctuating loading or imposed displacements as well as in the investigation of the 
stability of steady state motions of nonlinear dynamical systems. Consider, for example, 

y'i +2Ca> 1 y 1 + a\y x +f(y l ,y 2 ) = o)(t), 

y 2 + 2^2 y 2 + u\y 2 + y 2 9 (yi >y 2 )=( 32 ) 

whcre/(0,0) = 0, df /dy 2 (y l , 0) # 0, and w(t) is a white noise process. Then the stability 
of the solution (yi = £(t),y 2 = 0) is governed by a set of variational equations defined by 

? + 2Ca> 1 { + a)^+/(^0)«0. 

The maximal Lyapunov exponent for a general four-dimensional system with two critical 
modes parametrically perturbed by a small intensity real noise process is approximated 
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by Sri Namachchivaya & Van Roessel (1993) and Doyle & Sri Namachchivaya (1994) 
using a perturbation approach. In these studies, the natural frequencies are assumed to 
be non-commensurable and the infinitesimal generator associated with the noise 
process is assumed to have an isolated simple zero eigenvalue. 

4.1a Maximal Lyapunov exponent: The focus of this section is to approximate the 
maximal Lyapunov exponents for a two-dimensional system perturbed by a small 
intensity multiplicative real noise process. Consider linear oscillatory systems de¬ 
scribed by equations of motion of the form 

4i + co}q t + e 2 2C^O/4, + £ z M;/(£(*» = °» ^ = l > 2 ’ 

;= i 

where the qp s are generalized coordinates, to,- is the ith natural frequency, and e( 
represents a small viscous damping coefficient. The term £(t) is a small intensity real 
noise process. In this section, an asymptotic expansion for the maximal Lyapunov 
exponent under the real noise assumption is calculated, and readers are referred to 
Pinsky & Wihstutz (1991), Sri Namachchivaya & Van Roessel (1993) and Doyle & Sri 
Namachchivaya (1994) for general calculations of these quantities. 

The method outlined in the last two references above applies to systems of the form 

x = Ax + emt))Bx, xeR 4 . (33) 

Appropriate scaling of the matrix A yields 

A = A 0 -e 2 A l , 

where 


' 0 


0 

0 ■ 


‘*1 

0 

0 

0 ' 

-w l 

0 

0 

0 

0 

and A x = 

0 

<5! 

0 

0 

0 

0 

0 

co 2 

0 

<5 2 

0 

0 

0 

-co 2 

0 


0 

0 

0 

*2. 


The matrix B is described by B = [fcy]. For convenience, the following notation.is 
employed 

H 5 = b 2i- 1.2J ± b 2i,2j- 1 and J?j= b 2i2j ± b 2i _ U2j _ t . 

The quantities <5 1 and are damping parameters. In this analysis, it is assumed that the 
frequencies co x and a> 2 are non-commensurable. The results presented in this paper are 
obtained under very general conditions on the noise term. The term £(t) is a small 
intensity real noise process defined on a smooth connected one-dimensional Rieman- 
nian manifold M (with or without boundary). The smooth function f is 

assumed to have zero mean. 

Before proceeding, a brief description of some of the results of Oseledec’s Multiplica¬ 
tive Ergodic Theorem (Oseledec (1968), as related to four-dimensional systems, is 
necessary. Consider the linear stochastic system in (33) under the assumption that £(t) 
is ergodic. According to the Multiplicative Ergodic Theorem, the Lyapunov exponent 
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of the solution of (33), x(t; x 0 ), for the initial condition x 0 (x 0 # 0) is 

l(x 0 ) = lim ilog || x(f; x 0 ) ||, (34) 

t~* CO ^ 

where 2(x 0 ) takes on one of r fixed or non-random values X x < < X r . Which X x is 

realized depends on the initial condition x 0 . The multiplicities of the Lyapunov 
exponents sum to the dimension of the system, n (in this case, n = 4). Associated with 
each Xj there exists a random linear invariant subspace E h known as an Oseledec space, 
such that E x © E 2 ©.© E r = R fl , with 

lim ilogilx(r;x 0 )|| = A, iff x(t;x 0 )e£,.. 

t~* ±ao ^ 


The dimension of each Oseledec space E t is given by the multiplicity of the associated 
Lyapunov exponent, X- r 

Let G denote the infinitesimal generator of £(0, i.e. 


G (0 = ^)~ + \ <Tfj{0 


d 2 

dttdZj' 


and, as in Sri Namachchivaya & Van Roessel (1993), assume that G has an isolated 
simple zero eigenvalue. This implies that u = constant is the only solution of Gu = 0. 
Consequently, the adjoint operator G* must also have an isolated simple zero 
eigenvalue. The results obtained from this analysis are applicable when the manifold 
M is of arbitrary dimension. 

By the usual transformation i.e. 


x 2 i-\ ~ fjCOS^i, x 2 i -r i sin^j and p t — In(r { ), 
amplitude and phase equations of the form 

P, = S[? t m (35) 

^ = [<o, + e 2 r,W)] +£[Me ( "- p '>,«]/««), P6) 


are obtained. In this form, the amplitude and phase equations are coupled by the 
presence of terms of the form e (Pjf-Pi) for i # j. Since e (PJ ~ Pi) is always positive, one can 
introduce a one-to-one mapping e (p -' -p<) = tan0,0e[0,7t/2]. Thus, applying the follow¬ 
ing transformations to the original system, 


x x = e p cos0 1 cos0, x 2 = — e p sin0 1 cos0, 
x 3 = e p cos 4 > 2 sin 9, x 4 = — e p sin <f > 2 cos 6 , 

yields the following set of equations for the amplitude p , phase variables (<f) l ,(i> 2 ,9) and 
noise process 

0/ = co ; 4- e /(£)h,(0 a ,<f> 2 ,0) + e 2 M0!, 0 2 , 0) , 

0 = ef{&q 2 (0i» 4>2 > 6 ) + fi2 $ 2 ( 6 )> 


d t = p{Qdt + <r(Q°dW n 
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where q t , q ( and h t are given in terms of Hf ., J J and Sfaj = 1,2). Given the structure of 
the A t matrix, it can be shown that h t = 0 for i = 1,2. 

The processes (<f> t , (}) 2 ,6, f) are independent of the amplitude p and form a diffusive 
Markov process with associated generator 

L E = L° + sL l + s 2 L 2 , 


where, in general 




i= 1 


L 1 =/({) 


'a v , a ' 
9 + 


Ll =^Te + Z~ h ‘ 


_d_ 

d<t>i 


For this particular matrix, L 2 = q 2 (dJdQ); we also define the function Q e (<f>y,(f> 2 , $>£) 
such that 


p^+ee'+^e 2 . 

In the present analysis, one can write Q £ in terms of/(£), q t and q t as follows: 

Q E {(}) 1 ,(j) 2 ,d,0 = £M)q t {(j) 1 ,(t> 2 ,9) + £ 2 q 1 {6). 

Then, according to Oseledec’s multiplicative ergodic theorem (Oseledec 1968), assum¬ 
ing the operator U to be ergodic, the maximal Lyapunov exponent is given by 


A £ = <Q £ ,p £ > = 



Q E p £ d<^) 1 d0 2 d^d0, 


where p £ is the unique ergodic invariant measure associated with the generator L £ ,i.e. p E 
solves the Fokker-Planck equation given by 


L £ V = 0, 


provided U is hypoelliptic. 

A formal expansion of the invariant measure is constructed, i.e. 
p £ = p° + ep 1 + ••• + s N p N + —. 

Substituting this expansion and the expansion for U into the Fokker-Planck equation 
yields the following sequence of Poisson equations to be solved for p°, p 1 , p 2 , • ■ •: 

L°’p° = 0, 

L°y = - L^p 0 , 

L°*p 2 =-L 1 *p 1 -L 2 *p°. 

This yields the following asymptotic expression for the maximal Lyapunov exponent 
(see Doyle & Sri Namachchivaya (1994) for proof): 

^=<Q 0 ,p 0 >+6[<e 1 ,P 0 >+<G°,p 1 >] 

+£ 2 c<Q 2 ,p 0 >+<e i ,p i >+<e o ,p 2 >] + .... 
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Since <2° = 0, the expression for the maximal Lyapunov exponent reduces to 

i'=£<Q i ,p o >+8 2 [<e 2 ,p 0 >+<e'.p l >] + -, 

where p° and p 1 satisfy the Poisson equations above with the periodic boundary 
conditions 


p°((j) 1 = + 2ll, (j> 2 , 0, 0 = p°((f) l , <p2 + 271, 0, 0 , 

p 1 (0!, 0 2 ,0, f) = p 1 (0 1 + 2tt, (j) 2 , 6, £) = p 1 (0,, 0 2 + 27T, 0, £). 

Solving the 0(1) Poisson equation with appropriate boundary conditions and con¬ 
sidering the non-commensurability condition on the natural frequencies yields 

p°(te)=v(me)/ 4n 2 , 

where v(^) is the invariant measure satisfying G*v — 0. Note that for arbitrary F(0), the 
inner product (QKp 0 ) =0due to the periodic boundary conditions on <f> 1 and 0 2 and 
the zero mean assumption on /(£). The maximal Lyapunov exponent, up to 0(£ 2 ), 
reduces to 


2 e = e 2 l(Q 2 ,p 0 } + <Q 1 ,P 1 )]. 

Using the solvability condition for the 0{s ) Poisson equation, one can show that 
a unique solution for p 1 exists. The O(s) Poisson equation itself yields an expression for 
Pi in terms of p 0 . The term F(9) in the expression for p 0 can be determined with the aid 
of the 0(s 2 ) solvability condition. Evaluating the solvability condition yields the 
following ordinary differential equation for F (0): 

- ^ [4>«W)] +5^5 ['f'Wtf)] = 0, (37) 

where 

T 2 (0) = A cos 2 20 + Bcos20 + C, 

<D(0) = -i(X 1 -X)sin20 + 'F 2 (0)cot20. 

This is indeed the diffusion equation in 9. This describes the stochastic coupling 
between p x and p 2 in (35). 

Throughout the remainder of this paper, the following notation will be used: 

u = i(tf 2 ViVtfiV 2 + i)r( 0-), 
y 2 = U H 2X J r 2 - Hf 2 J 21 )T(Q + ), 

V = M J n ~^ 22 ) 2 5(0) + i(^i +^ 2 )~|(a 12 +a 2 i)-f / u. 

The sine and cosine spectrums are defined, respectively, as 

f co poo 

r(cu) = 2 R(T)smo)TdT and ,S(co) = 2 R(T)coscoTdT, 
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where R(T ) is the autocorrelation of/(£), i.e. 


*(T) = 


/* 

mm, t) dt, 

J M 


and Q* = a> l ± co 2 . Employing this notation yields 


-y, 

B— 2 ( a i2 a 2 1) = ~ •> 

C = y+i(a 12 + <x 21 ) = y+|a + , 

A; = - <5j + and X { = A £ + y,-, 

where X 1 and X 2 are the Lyapunov exponents for the case when the modes in (33) are 
decoupled. 

Equation (37) can be rewritten as 

^|-4>(e)F(e) + --[4' 2 (9)F(0)]| = O. (38) 

Let A st (0) be the term in the brackets. Then, A a (6) must be constant with respect to 6, i.e. 
.A sf (0) = A sr By examining (38), one can see that there may be singularities in the open 
interval (0, n/2). All possible singular cases must be considered when attempting to 
solve this expression for F(6). The location of the singular points and the behaviour of 
the diffusion process in the presence of these singularities is studied by Doyle & Sri 
Namachchivaya (1994). The white noise case was studied separately by Sri Namachchivaya 
et al (1994). The results of the white noise case can also be obtained as the limiting case 
of the above real noise results. 


4.1b Moment Lyapunov exponents for white and real noise cases: Although sample 
solutions may be stable with probability one, the mean square response of the system 
for the same parameter values may grow exponentially. A concise formulation of the 
relation between almost-sure sample stability and the pth mean stability is presented L>y 
Arnold (1991). More extensive coverage for real and white noise cases was given by 
Arnold & Wihstutz (1986). In this section, asymptotic expansions for the moment 
Lyapunov exponent, the exponential rate of growth or decay of the pth mean, under the 
white and real noise assumptions are calculated and the readers are referred to Doyle 
et al (1995) for general calculations of these quantities. 

For the structural problem under consideration, the oscillators of (32) are decoupled 
yielding two single-degree-of-freedom systems. Thus, in determining the moment 
stability of the four-dimensional system, one may calculate the moment Lyapuno’s 
exponents for each oscillator individually. In the present analysis, the momen 
Lyapunov exponents for a general two-dimensional linear system excited by white o: 
real noise are approximated. These results will be applied to the shallow arch problen 
in §4-lc. 

By definition the moment Lyapunov exponent is given by 

9(P)’= limilog£|x(t,x 0 )|P, peR, x o eR 2 -{0], (3S 

r-»oo ^ 
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where g(p) is a smooth convex function with 0(0) = 0 and g'( 0) = A and the sign of g(p ) 
defines the exponential stability (g( 0) < 0) or instability (p(0) > 0) of the pth moment. 

In order to calculate the moment Lyapunov exponent in the white noise case, 
consider the first-order equations after an appropriate transformation, i.e., 


dx = /4xdt+ £ B l x°dWft), xeR 4 , 


where 


— <5 —co 
co —<5 




and W • are independent unit variance Wiener processes. Projecting the first order 
equations onto a circle, i.e., 

p = In || x ||, 6 = tan _1 (x 2 /x 1 ), 

p = q„(e)+ Z oM8)°<im), (41) 

t= 1 

e = M0)+ E (42) 

i = 1 

one can define the moment Lyapunov exponent g{p) as the principal eigenvalue of the 
problem on S 1 (see, for example, Arnold 1991). 


L(p)<P(p) = g(p)Mp\ VpeR, 


where 


Ifp) = $£ + pM + p£ + 


56 ~ h °dd + l * 1 


-V 

ee 


n d i, d 

& ~ q °lid + 

*-£**4 (45) 

In general, the solution of p(p) to all orders of p is difficult to calculate. However, 
using the convex and analytic properties of g(p) one can write 

g(p) = 0 ( 0 ) + pg'{ 0) + (p 2 / 2 ) 0 " (0) + 0(p 3 ), (46) 

and the associated eigenvalue problem reduces to (Arnold & Wihstutz 1986) 

Lip = A —X Lg — g"( 0) - 2{% 4- 2. - A)tp - St , (47) 

where 


Explicit calculation of various coefficients yields 

h,Q = ft), = ^ > 
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* 1 =i(i>' 21 - fc‘ 12 ) + i(b‘i 2 - b ‘i i)sin(20) + (b‘ 2l + !>i 2 )cos(20), 

9, = Wi i + b‘ 22 ) + 1( b\ 2 + b‘ 21 ) sin(20) + (b[ , - b' 22 ) cos(2 0). 

Assuming that c is of the order e i.e., oq —► £(J i , one can obtain the asymptotic expansions 
of the maximal Lyapunov exponent and g"{ 0) as 

X E — — <5 + £ 2 “ Yj [(^22 ~ fr ' ll ) 2 + (^21 — ^ 12 ) ] ’ 

8 i=l 


„2 2 


<m= e 2 - S C2(i,u + b‘ 22 f +(6' u - ft 22 ) 2 + (b‘ l2 +b‘ 21 n 


Thus, the quadratic expansion of g{p) is 

9 e ( p ) = ~ P & + £ 2 — Y [(^22 ~ ^ii) 2 + (^21 — ^12) ]P 

o , = 1 


+ ^~-iwb‘n+b , 22 ) 2 + (b‘ ll -b , 22 ) 2 + (b[ 2 + 0‘ 2 , ) 2 ] 


For the real noise case, consider the linear system of equations 


x~ /lx t 0 f(£(t))Bx, xeR 2 , 

(48) 

d£ = X 0 ©dt+ i XAQ'AW,, 

(49) 


i= 1 


where A and B are as defined for (40). The following procedure for calculating the 
moment Lyapunov exponent is based heavily on the perturbation method presented in 
§4Ta. Projecting the equations of motion onto the circle yields 


p = g(0) = q o (0) + £/(^)g 1 (0), 
e-co + efm^O), 

d£ = X 0 (£)dt+ Y X t {QodW l9 

i= 1 

and the moment Lyapunov exponent g(p ) solves (43) with 


L(p)= L E + pq{6), 

where L £ is the generator of the Markov diffusion process involving 6 and ^. We 
expand this generator as in § 4- la. For the two-dimensional case, this operator becomes 
U — L° + sL l with 


i° = G© + a,^ L^mhAS)-. 

The associated eigenvalue problem and corresponding solvability conditions are 
Lij/ = X — q(0), 

solvability: A = </i(£), <?($)>, 

— g"{6) — 2(q(9) — X)\j/, 

solvability: g"(0) = 2<**(<!;), (<?(0) - X)\j /}, 
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where if/ and rj are as defined for (47). In the above expressions, ^(£) = ^(£) 0 + s/u(0i 
+ 8 2 g (£,) 2 -\— = {/(£,) is the unique ergodic invariant measure associated with the 
generator U, i.e. //(£) solves the Fokker-Planck equation given by 

L e y(£) = 0, 

provided U is hypoel.liptic. X = X e is the maximal Lyapunov exponent which may be 
calculated using the method of §4-la. 

Expanding the equations for \j/ and rj in powers of e yields the following set of Poisson 
equations. 

Lq •Ao ~ > 

U'l'i — ~ ~~ L^if/ q, 

2 ~ ^ 2 ~ q 2 ~ L y lf/ y — ^2 'Ao » 

UVo — U ~ — ^ 0 )^ 0 ’ 

Mi = Vi - 2[(<?o - 2 0 )<Ai - (2j - qJ(£Mol - L \Vo . 

^ 0^2 = ^2 2[(g 0 — X Q )if/ 2 — (X x ~<?i/(^)) i Ai (2 2 ~ <7 o)*Ao3 ^-‘i r h E 2 ?7o • 

The quantities K 0 , K 1 and U 2 are the terms in the asymptotic expansion for g"(0), i.e. 
cjf"(0) = V 0 + eV l + £ 2 V 2 . The solvability condition for the O(e) Poisson equation for r\ 0 
yields V 0 = 0. Using the fact that q 0 — —S, one can easily verify that X 0 = —5 and X x = 0. 
Employing these results and the O(e) equations for \]/ Q and rj Q implies 

tA 0 = l and rj 0 — const. 

The Poisson equations reduce to 

L 0 if/ 1 = -qJ(£), 

U ^z~ Ei <A 1 , 

L 0 ri x =V x -2q x m, 

Un 2 = v 2 -2 \jh f {£)'!'1 + {<h - 2 2 )] -Uni- 

The solvability conditions for the last two equations yield 
Ei=2</i 0 ,qj(£,)>, 

V 2 = 2<^ 0 ,(A 2 - q 2 ) + qjg) if/ y > + 2 </q,4i/(£)> • 

Due to the zero mean assumption on /(£), = 0. This yields g"(Q) = e 2 V 2 . 

Define 

P = (b 12 + b 2X ) 2 + {b 22 -b n ) 2 and K = 2{b 22 +b yy ) 2 . 

Employing these quantities in the expression for V 2 yields 

9"(0) = 6 2 |^S(2®) + |S(0)|. 

Recall the fact that 

g'(0) = X-X o + e 2 X 2 = —£ + jfiS( 2o>). 
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Solving for the terms in the expression for V 2 then yields the following expression for g"{ 0) 
9 "(0) = i{jSS(2a>) + KS(0)}.' 

Finally, the moment Lyapunov exponent is given by 

g(p) = - Sp + tI {P»cS( 0) + (p + 2)fiS(2ai)}. (50) 

lo 

Detailed calculations of the moment Lyapunov exponents for two-dimensional systems 
under real noise excitation are given by Doyle et al (1995). 

4.1c Shallow arch under stochastic excitation: In this section, the results obtained 
previously are applied to study the dynamics of the shallow arch structure when the 
time-dependent portion of the lateral loading is white noise, i.e.f(t) = w(t). In this case, the 
stationary solutions of (3) are given by Q 1 = £(t) and Q 2 = 0. Perturbing the equations of 
motion about this solution by substituting 


Qi = Z(t) + *li{t) and Q 2 = 0 + r] 2 {t) 
yields the following set of linearized perturbation equations: 

iji + a)?rji-6itii+ Y k iJ r] j f(£(t)) = 0, (51) 

j= l 

where k n = 3, k 22 = 4 and k {j = 0, for i # j, and/(£(t)) = <^(r) 2 - Since/(^(t)) results from the 
nonlinearities in the original equations, it can be assumed to be of O(e), i.e./(£(t)) -+ e/(^(r)). 
Furthermore, the dissipation may be rescaled as /?,• -»• e 2 /^. Using this rescaling, (51) can be 
brought to the form of (33) with 


and 



0 

COi 

0 

0 ' 


'0 

0 

0 

0 ' 

II 

o 


0 

0 

0 

, Ay = 

0 

Pi 

0 

0 

0 

0 

0 

OJ 2 


0 

0 

0 

0 


0 

0 

n ?2 

0 


0 

0 

0 



0 0 0 O' 

~k 11 /(o y 0 0 0 

0 0 0 0 

0 0 —k 22 /o) 2 0 


These equations describe a set of uncoupled oscillators, i.e. there is no 0 process coupling 
the modes at the linear level. In this case, as stated in section 4.1a, the Lyapunov exponents 
are given by 


,=u - 

The maximal Lyapunov exponent is then 
A* = xnax(Ai,A 2 ). 

in order to calculate the results pertaining to the original problem of the shallow arch, 
make use of (51) along with the appropriate parameters. It is obvious that for the arch 
problem, the linear variational equations about the stationary solutions are decoupled as 
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shown in (51). For each degree of freedom, the parameteric coefficients can be expressed as 
b l 2l — — kJ(D i and the remaining bfs are identically zero where the superscript i indicates 
the first or second mode. The moment Lyapunov exponents of the ith oscillator for the 
white and real noise cases can be written as 


and 


g i (p) = e 2 p\-8 + (2 + p) < ^-(—'\ 1, 
1 16 \cOiJ J 

ft(p) = e 2 p -J - d + (2 + p) ^ 


respectively. 


4.2 Nonlinear results based on averaging 


The aim of this section is to shed some light on the so called stochastic bifurcation. The 
questions that must be answered are, (i) What are the hypotheses for the occurrence of 
stochastic bifurcation? and (ii) What does one see as an outcome of the bifurcation 
phenomena, both mathematically and physically? There has been considerable work in this 
line from both theoretical and physical points of view. The effects of stochastic perturbations on 
a dynamical system exhibiting Hopf bifurcation have been studied by Graham (1982), Baris 
et al (1982) and Sri Namachchivaya (1988,1990) to name a few. The original nonlinear equation 
cannot be solved explicitly unless there are some assumptions made with respect to the 
nonlinearities and the noise terms. These assumptions are based on the physics of the 
problem and the phenomena that one is interested in observing. To this end, this section 
will focus on the effects of noise on systems that are close to certain bifurcation points. 

The method of averaging involves the convergence of a parameterized sequence of 
processes to a limit process in some weak sense under strong mixing conditions on the 
noise process. The so called method of averaging, which was obtained on a physical basis 
by Stratonovich has been widely used to approximate physical processes with certain 
smoothness, differentiability, and mixing conditions by a diffusive Markov process. 
A mathematically rigorous proof of the main results was given by Khas’minskii (1968). 
Since then several authors have developed the averaging theory in various directions 
using various assumptions. This method has some advantages because the limit process 
obtained by this procedure is more tractable mathematically due to two main reasons 
namely (i) it approximates a real noise with strong mixing conditions by a Markov 
diffusive process, (ii) and due to the averaging procedure the averaged equations have a 
S 1 symmetry which allows one to examine the applitudes by themselves and a reduction 
in the dimension of the system. However, the disadvantage of this procedure is the weak 
convergence to the limit process. This allows one only to compare the original system 
with the averaged system from the point of view of the distributions. 

4.2a Nonlinear stationary response : In this section, some of the results of Sri 
Namachchivaya (1990) and Arnold et al (1995) that are available based on the method 
of stochastic averaging are summarized. This will provide some general trend of the 
actual behaviour. We shall first recast the original equations in the first order form 
2 

x = A 0 x+ £ o^iAiX + F(x), xeR 2 , 

i= i 


( 52 ) 
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where 


5 'f] CD 
—coS'rj 


( 53 ) 


where rj is a system parameter, the matrices are assumed symmetric and £ l and £ 2 are 
assumed independent stationary noise processes with zero mean. The method of 
averaging allows more general processes provided the dependence between £(co, t) and 
£(co, t + t) becomes weaker in some sense with increasing t where are taken to be real 
processes. Using an appropriate linear transformation x-+Sx where 


S = 


cos (cot) — sin (cot) 
sin (cot) cos (cot) 


(54) 


in (52) and making an appropriate scaling for the noise and dissipative terms yields a set 
of equations in so called standard form 


x = ef{x, &t), 


(55) 


where explicit form of/ is omitted and can be easily calculated using this transformation. 

The method of stochastic averaging can be applied to the above equations, provided 
the noise processes are stationary with zero mean and correlation functions satisfy the 
strong mixing conditions. Thus by applying the results of Khas’minskii (1968) the 
solution of this equation can be uniformly approximated in a weak sense on the time 

interval of by a Markov process which is continuous w.p.l and satisfies the 


(56) 


following Stratonovich equations 

dx = l(X Q - R || x || 2 )/x + (m$ - S11 x j! 2 ) Jx]dt + (cr fl 0 a ) 1/2 /x°d W t (t) 
-(<r°„Jxo&W 2 {t) 


where 


V- = 6 + \I & s i(2co), m° = cD'n- l -Z PiTiilco), 


R = 


16c a 


2,XiX 2 (F 2,XiXi ^2 ,x 2 xJ x lX2 (Fi t x lXl + F 1,A 


’ F 4- F F 1 

I,***! 2,x 2 x 2 — 1 l.xixi-* 2,XiXii 


L , 

"b Jg c l.x,XiXi 4" F l.xiATjX! 4" F 2,x lX iX2 F 2,.* 2 X 2 X 2 } 


s= 


32cu 




2,X2X2 


^F 2 ,x iXi )(F2,X2X2 ■b F2, XlXl ) 


fiF i, X 2X2 F i iXlXl ^F 2xiX2 )(F lXiXi + F lXiXt ) 

+ F ZxiXi — F 2x2X2 )(F 2yX2X2 — F 2xiXi — F 2 XiX2 ) 
- ~ F l<XlXl + 2 F 2 ' XlX2 )(F UX2Xi - F UxiXi + F 2>XiX2 )-\ } 


F {Fi' X2XlX2 + F i x ^ X Xi F 


2 ,XiX 2 X 2 F 2 ,XiXiXi } ’ 



I 
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f 






<£ = 5 E [K,s,(0) + AS,(2®)], <rj, = AS,(2 o>), 

K i = 2(^4,-22 + ^il 1 ) 2 > Pi = ( A i 12 + ^i21 ) 2 + (^i22 — ^il 1 ) 2 » 

and F liXjiJCk denotes (d 2 F i /dx j dx k ){0,0). The stationary density function for the nonlinear 
problem can be written as p(x l ,x 2 ) = p{r, <j>)\J J where J = \/r. Thus, an explicit 
expression for the density function in terms of x l and x 2 is 

P(x lt x 2 ) = 2 + x 2 2 y ~ 1 exp[- A(x 2 + x\Y \, 

provided v is positive. In the above expression, 

« = <j'(0) = A o , v = 9"(0) = <£, A = 4- 

°aa 

It is clear from the expression that the change in the shape of the density function 
depends on v, thus on the quantities g'(0) and g"(0). The above results indicate that for 
v < 0, although the probability density may exist, it is not normalizable. For 0 < v < 1, 
a normalizable probability density exists with infinite value at the origin. At v = 1, the 
density exhibits a finite maximum at the origin with value 2A. For values of v > 1, there 
is a maximum value of the probability density away from zero. These are similar to the 
amplitude equations discussed by Stratonovich (1967). 


4.2b Stability of stationary solutions: Let us assume that the stationary solution of 
x is given by ?i(t). Then the linear variational equations about the stationary solution 
are obtained by putting x = rj{t) + u, as 


where 


du = [(A 0 — 2R || n \\ 2 )Iu 4- (m° — 2S\\rj\\ 2 )Ju + ( rj 2 — rjl)Mu — 2rj l rj 2 Nu]dt 

+ (cr fl 0 a ) 1/2 /uod W x (t) - (a° H ) ll2 Juod W 2 (t) , (57) 


-R 
S R 



0 1 

-1 0 


In order to determine the almost-sure stability of rj(t) one can examine the sign of the 
top Lyapunov exponent of the above equations. To this end projecting the above 
equations on to a circle, by letting 

p = In || u || and 6 = tan" 1 (m 2 /w 1 ) 

yields 

dp = q 0 (n, 0)dt + (<r° ) 1,2 »d W^t) , 
d0=h o Mdt + ( a ° M Y l2 odw 2 (t), 

where 

q 0 (rj,B) = 2 0 - 2R\\rj\\ 2 

- (R 2 4- S 2 ) 112 {(rjl - rj 2 )cos(26 4- 2%) 4- 2 ,r\ 1 rj 2 sin(20 + 2%)}, 
h 0 (ri,6)= —m^ — 2S\\r}\\ 2 

- {R 2 + S 2 ) 1/2 { (ril - j? 2 ) sin(20 + 2 %) - 2 t] x r\ 2 cos(20 4- 2*)} , 
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and 2x = tan ~ 1 {S/R). Since the averaged equations governing the stationary solution 
x = rj(t ) has an S 1 symmetry, letting rj l =rcos9 and rj 2 —rsin6 and i(/ = 0 — (j) + x 
yields 

dp = [A 0 — 2 Rr 2 — r 2 (R 2 + S 2 ) 1/2 cos(2^)]dr + (<r° a ) ll2 od W { (t ), 

d \jj — [Sr 2 +r 2 (R 2 + S 2 ) 112 sin(2^)]dt, 

dr = [X 0 r - Rr 3 ]dt + (<r 0 aa ) ll2 r o d W x {t ), 

dtf = [ - p (2m) + Sr 2 ] dr + (^) 1/2 Q d W x ( t). 

It is clear that the Lyapunov exponents are given by 

X = X 0 - 2RE [r 2 ] - (R 2 + S 2 ) 1/2 £ [r 2 cos(2^)], 

and depending on the joint invariant measure p(r,(j>) one can either obtain the 
maximum or the minimum of the Lyapunov exponent. But due to the S 1 symmetry of 
the averaged equations the amplitude process by itself is a Markov diffusive process, 
and the stationary density is given by 

p(r) = 2(A/r(v)) r 2v ~ 1 exp [ — Ar 2 ], v = a/y, 

provided v is positive where v and A are as defined previously. 

However, the stationary states of the fictitious angle if/ for a given value of the 
amplitude r are given by the angle 

i//= sin -1 [—S/(R 2 4- S 2 ) 1/2 ] + nn, n = 0,1,2,..., 

and the stability of the stationary states is given by the sign of cos(2i J/). The sign of the 
constants R and S define the location of the fixed points on the circle. In general, for the 
deterministic case, it is clear that for a stable periodic orbit it is essential to keep R > 0. 
For convenience, if one assumes S ^ 0, then there are four distinct points on the circle, 
namely: 


^5,1=1 + ^°> <A s ,2 = y+ = { l / u,2 = 2n -'l /0 ■ 

It is obvious for all initial values, except for those values of \j/ = \j/ u l5 the flow on the 
circle will reach either one of the two stable fixed point i{/ s t . It is well known that the 
stationary measure p(r, i p):~ p{r)p{\jj\r) where p(r) is marginal and p{i]//r) is the condi¬ 
tional density. Thus one can define 

D(r^=l pW{ ^.i + ^> i = U, 

= <!/„■„ i = 1,2. 

The Lyapunov exponents are then 

2 lt2 = A 0 -2R£[r 2 ]-£[r 2 ](±R), 

where the negative sign corresponds to the maximal exponent since it is calculated from 
the invariant measure associated with the stable fixed point. By evaluating the expected 
value of the amplitude process, i.e. £[^3 = /. 0 /R, it is obvious that the Lyapunov 
exponents for the trivial and nontrivial solutions respectively, are 

a 1 — a. 2 = a. Q) 2j=0, X\= —2X 0 , 
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g(p) 



Figure 5. The 
Lyapunov exponent 
v — 1. 


moment 
g(p) for 


where 1 0 := f + k( 2co). The sum of the Lyapunov exponents can be calculated using the 
trace formula from the Stratonovich equation for the perturbation u 

I = A 1 + A 2 = 2(1 0 -2R£[|| i) || 2 ])=-2A 0 , 

which matches the sum of the individual exponents obtained from the invariant 
measures. This implies that the bifurcating solution is asymptotically stable. Such- 
results have been obtained by Sri Namachchivaya (1990) by perturbing the amplitude 
equation itself. 


4.3 Discussion of results 

As indicated earlier, the linear problem has been solved using asymptotic and averag¬ 
ing methods while the nonlinear problem has been solved only by method of stochastic 
averaging. The results presented in the stochastic part of this paper based on these two 
methods will be discussed and compared in this section only for systems in R 2 . 

The results in R 2 are shown in figures 5-8. Figures 5 and 6 show the moment 
Lyapunov and Lyapunov exponents obtained using asymptotic methods while figures 7 
and 8 depict the results obtained using averaging. Due to symmetry in x t and x 2 , the 
stationary density is shown only as a function of x t . The top Lyapunov exponent for 



Figure 6. Lyapunov expo¬ 
nents and asymptotic re¬ 
sults. 
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p »w 



Figure 7 . Stationary probabil¬ 
ity density function p(x l 5 x 2 ). 


the trivial solution of the averaged system, X° v is identical to that of the original system 
using the asymptotic analysis. However, the second Lyapunov exponent of the 
averaged system is the same as the top Lyapunov exponent due to the existence of S 1 
symmetry in the averaged equations. This symmetry of the averaged system erroneous¬ 
ly predicts that the Lyapunov exponents will cross the imaginary axis simultaneously. 
However, the asymptotic results, from the trace formula, accurately predict two distinct 
crossings with X\-\-X 2 = — 2b. For consistency, and X 2 obtained from the asym¬ 
ptotic analysis can be expressed in terms of the parameter v as 

x° = vg"(0) and X° 2 = vg"{0)-[3S(2co)/4. 

The moment Lyapunov exponents are the same for the averaged and original 
equations. Although the moment Lyapunov exponents are obtained from the linear 
problem, it has been shown that they play an important role in predicting the 
parameter value necessary for the existence of a normalizable density function for the 
nonlinear response. The relationship between the parameter value at which the 
stationary density attains a finite maximum and the zero of the moment Lyapunov 
exponent evaluated at -d, where d is the dimension of the problem, was first shown by 
Baxendale (1990) and Arnold (1984). It is interesting to note that the value of v = 1 at 
which the density first has a finite maximum corresponds to the value of v for which 
g{ — 2) = 0. In addition, a nontrivial normalizable density is shown to exist for positive v. 
Since the trivial solution is asymptotically stable for v < 0, one expects a nontrivial 





Figure 8. Lyapunov expo¬ 
nents 2 ° and /v°: averaging re¬ 
sults. 
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solution to exist only for v > 0. This indicates that the densities obtained from 
averaging preserve these transitions in the system behaviour although some other 
details may be lost due to averaging (see Arnold et al 1995, for details). It is clear from 
the analysis that, for the deterministic system, a Hopf bifurcation takes place at <5 = 0. 
However, the perturbed system shows a bifurcation of a nontrivial measure from the 
trival measure p = 8 0 (where <5 0 denotes the Dirac measure at the origin) takes place at 
v = 0(<5 = fS( 2co)/8). This new measure displays a peak away from the origin only for 
v> 1 (<5 < —kS{ 0)/8). This second transition can be predicted from the moment 
Lyapunov exponent is evaluated at -d and this corresponds to Baxendale’s large 
deviation phenomenon. 

It is obvious from the moment Lyapunov exponent diagram, figure 5, that there are 
parameter values (for example, v = — 1) at which a® < 0 indicating that the system is 
sample stable while g(p) > 0 for large p (for example, p > 2) indicating the pth mean 
response is unstable. This implies that, although | x(t; x 0 ) \ -> 0 as t oo with probability 
one with an exponent rate l° v there is a small probability that |x(f;x 0 )| is large. This 
makes the expected value of this rare event large for large value of p and results in the 
pth mean instability. 


5. Conclusions 

In the first part of this paper, global bifurcations that are present in the presence of 
internal and parametric resonances for a shallow arch structure under periodic forcing 
are examined. It is found that, in the neighbourhood of 1:2 internal resonance in the 
absence of dissipation, the shallow arch exhibits a typical KAM phase portrait. The 
analysis of similar structural problems in the presence of 1:1 internal resonance and 
dissipation indicates Selnikov-type behaviour in the response. These results are 
presented by Malhotra & Sri Namachchivaya (1995). 

The second half of this paper examines the response, stability and bifurcation 
behaviour of structural systems subjected to white or real noise excitation. The 
maximal Lyapunov exponent and moment Lyapunov exponent are approximated 
using a perturbation approach. This approach is described in more detail in Arnold 
et a/(1986), Doyle & Sri Namachchivaya (1994) and Doyle et al (1995). The behaviour 
of the bifurcated solution is examined using the method of stochastic averaging. 

The practice of examining just the qualitative changes in a family of stationary density 
functions of the Fokker-Planck equation (for example, the transition from a single 
peak to double peak or crater type density function) is not enough to understand 
stochastic bifurcations. This is primarily due to the fact that solutions of the Fokker- 
Planck equation measure the proportion of time spent in a volume element by the 
one-point motion. These have little to do with the asymptotic behaviour and its 
stability which are obtained from two-point motions, hence the Lyapunov exponents. 
Thus, one needs to establish a procedure for finding nontrivial invariant measures for the 
original nonlinear problem without averaging and determine their stability properties. 


The authors would like to acknowledge the support of the Air Force of Scientific 
Research through grant 93-0063 monitored by Dr Spencer Wu and the National 
Science Foundation through grant MSS-PYI-90-57437. 
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Analysis of nonlinear structural dynamics by the concept of 
stability measure 

W SCHIEHLEN and B HU 

Institute B of Mechanics, University of Stuttgart, D-70550 Stuttgart, 
Germany 

Abstract. Engineering structures may collapse under dynamic loading, 
resulting in large displacement motions. The generation of the equations of 
motion, and an approach for the stability assessment of nonlinear vibra¬ 
tions are presented. The Duffing oscillator is used as an example for the 
application of the method presented. 

Keywords. Multibody systems; stability assessment; Duffing oscillator. 


1. Introduction 

Mechanical systems like collapsing engineering structures can be modelled properly 
as multibody systems for dynamical analysis, see Sedlacek & Neuenbaus (1993). 
The complexity of the dynamical equations called for the development of computer- 
aided formalisms a quarter of a century ago. The theoretical background is 
today available from a number of textbooks authored e.g. by Wittenburg (1977), 
Schiehlen (1986), Roberson & Schwertassek (1988), Nikravesh (1988), Haug (1989) and 
Shabana (1989). 

In addition, a number of commercially distributed computer codes were developed, 
a summary of which is given in the Multibody Systems Handbook edited by Schiehlen 
(1990). The computer codes available show different capabilities; some of them generate 
only the equations of motion in numerical or symbolical form, respectively, some of 
them provide numerical integration and simulation codes, too. Moreover, there are 
also extensive software systems in the market which offer graphical data input, 
animation of body motions and automated signal data analysis. 

For the analysis of nonlinear systems in recent years new mathematical tools have 
been developed, see e.g. Holmes & Guckenheimer (1983), Moon (1983), Schiehlen 
(1993a) and Kreuzer & Schmidt (1993). However, these sophisticated tools are restric¬ 
ted mostly to low dimensional systems hardly found in structural dynamics. Therefore, 
the concept of stability assessment is introduced, see Schiehlen (1993b). In this paper 
this concept is applied to the Duffing oscillator representing a wide range of dynamic 
systems with progressive restoring forces. 
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2. Modelling of multibody systems 

According to the free body diagram of a mechanical system, first, all constraints are 
omitted and the system of p bodies holds 6p degrees of freedom. The position of the 
system is given relative to the inertial frame by the 3 x 1 translation vector 

r i == C r il r i2 r £3] T > * = (!) 

of the centre of mass C, and the 3 x 3 rotation tensor 

S; = S i (a 1 -,/?;,y £ ), (2) 

written down for each body. The rotation tensor S f depends on three angles ot h y. and 
corresponds with the direction cosine matrix relating the inertial frame I and the 
body-fixed frame i to each other. The 3p translational coordinates and the 3 p rotational 


coordinates (angles) can be summarized in a 6p x 1 position vector 

x = On r 12 r 13 r 2l - a p y p ] 7 . (3) 

Equations (1) and (2) read now 

r ; = r i(x), Sj = S { (x). (4) 

Secondly, the q holonomic, rheonomic constraints are added to the mechanical system 
given explicitly by 

x = x(y, t), (5) 

where the/ x 1 position vector 

y = Dh y 2 y 3 y f V (6) 


is used summarizing the / generalized coordinates of the system. The number of 
generalized coordinates corresponds to the number of degrees of freedom,/ = 6 p — q, 
with respect to the system’s position. Then, translation and rotation of each body 
follow from (4) and (5) as 

S f = S,-(y, r), (7) 

and the velocities are found by differentiation with respect to the inertial frame: 

dv dr 

v i = f, = ^y + / i =J TI (y, t)y + ?i(y, t), (8) 

OS; . 8S; , , 

0); = S; = — y + — = J^-(y, t) y + tb/y, t), (9) 

The 3 x / Jacobian matrices J Ti and J^,- defined by (8) and (9) characterize the virtual 
translational and rotational displacement of the system, respectively. They are also 
needed later for the application of d’Almebert’s principle. The infinitesimal 3x1 
rotation vector s £ used in (9) follows analytically from the corresponding infinitesimal 
skew-symmetrical 3x3 rotation tensor. However, the matrix Jjj,- can also be found by 
a geometrical analysis of the angular velocity vector co f with respect to the angles 
a,-, y-„ see e.g. Schiehlen (1986). 
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The accelerations are obtained by a second differentiation with respect to the inertial 
frame: 


, / v 5v ( 

■,-J«(y.t)y + ^y+ ¥ . 

( 10 ) 

T , ... do f . da> { 

« 1 =J*,(y>0y+ 17 y+^ r . 

( 11 ) 


For scleronomic constraints the partial time-derivatives in (8), (9) and (10), (11) vanish. 

For the application of Newton’s and Euler’s equations to multibody systems the free 
body diagram has to be used again. Now the rigid bearings and supports are replaced 
by adequate constraint forces and torques as discussed later in this section. 

Newton’s and Euler’s equations read for each body in the inertial frame 

= f f + f', i= 1 (l)p, (12) 

I i « I . + « i I i <o i = lf + l l r , i = l(l)p. (13) 

The inertia is represented by the mass m t and the 3 x 3 inertia tensor I { with respect to 
the center of mass C t of each body. The external forces and torques in (12) and (13) are 
composed by the 3 x 1 applied force vector ff and torque vector If due to springs, 
dampers, actuators, weight etc. and by the 3 x 1 constraint force vector ff and torque 
vector If. All torques are related to the centre of mass C,. The applied forces and 
torques, respectively, depend on the motion by different laws and they may be coupled 
to the constraint forces and torques in the case of friction. 

The proportional forces are characterized by the system’s position and time 
functions 


ff = ff(x,r). (14) 

For example, conservative spring and weight forces as well as purely time-varying 
forces are proportional forces. 

The proportional differential forces depend on the position and the velocity, 

ff = ff(x,x,t). (15) 

A parallel spring-dashpot configuration is a typical example for this kind of forces. 

The Newton-Euler equations of the complete system are summarized in matrix 
notation by the following vectors and matrices. The inertia properties are written in the 
6 p x 6p diagonal matrix 

M = diag{w 1 E m 2 E ••• I { ••• I p j, (16) 

where the 3 x 3 identity matrix E is used. The 6p x 1 force vectors q c , q e , q r representing 
the Coriolis forces, the ideal applied forces and the constraint forces, respectively, are 
given by the following scheme, 

q-uffT-ir ••• ijr- < i7 > 

Further the 6 pxf matrix J is introduced as a global matrix;- 
T —n r I r ... T r ••• T r *1 r 

J — L J n J T2 J R 1 J Rp-l • 


( 18 ) 
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Now, the Newton-Euler equations can be represented as follows for holonomic 
systems in the inertial frame 

M3y + q c (y>y,0 = q e (y 5 y^) + q' - - ( 19 ) 

Holonomic systems with proportional or proportional-differential forces result in 
ordinary multibody systems. 

The equations of motion of holonomic systems are found according to d’Alembert’s 
principle by premultiplication of (19) with J T as 

M(y, t)y + k(y, y, t) = q(y, y, t). (20) 

Here, the number of equations is reduced from 6 p to /, the / x / inertia matrix M(y, t) 
is completely symmetrized M(y, t) = J r MJ > 0, and the constraint forces and torques 
are eliminated. The remaining/ x 1 vector k describes the generalized Coriolis forces 
and the/ x 1 vector q includes the generalized applied forces. 

In addition to the mechanical representation (20) of a multibody system, there exists 
also the possibility to use the more general representation of dynamical systems in the 
state space, i.e., 

x = f(x,t), (21) 

where x means the n x 1 state vector composed of generalized coordinates and 
velocities, and t the time, respectively. 

3. Stability assessment 

The dynamical equations of multibody systems describing non autonomous nonlinear 
oscillations are represented in a canonical form as 

x = f(x,t), x(t 0 ) = x 0 (22) 

following from (21). Here x is the n x 1 state vector, f an n x 1 vector function and 
t means the time. At initial time t 0 the initial state x 0 is given. It is assumed that 
f(0, t) = 0 represents an equilibrium position x = 0. Due to the nonlinearity of the 
system, there may exist additional equilibrium x = x*. 

The stability in the sense of Ljapunov characterizes the qualitative behaviour of 
the equilibrium position x = 0 of the dynamical system (22). For the stability defini¬ 
tion the absolute value norm of a vector is used. The time-variant norm of the n x 1 
state vector 

x(r):=[x 1 (r)---x„(t)] T , te[r 0 ,oo) (23) 

is defined as 

ll x (t)||:= max |x { (f)|. (24) 

l«t<n 

The time-interval norm reads as 

||x(t)|| r := max ||x(f)|| 


(25) 
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where time T may approach infinity, 


l|x(r)IL:= lim ||x(f)|| T . (26) 

T-* oo 

These definitions are also valid for matrices, e.g., 

I|A||:= max £ \a ij \. (27) 

1 j= 1 

The dynamical system (22) is called stable (in the sense of Ljapunov) if for every positive 
e>0 there exists a positive number 5 = d(e)>0 such that for all initial conditions 
bounded by 

\\Xo\\<5 = m (28) 

the corresponding trajectories \{t) remain bounded for all t: 

l|x(t)ll<e. (29) 

The dynamical system (22) is asymptotically stable if it is stable and for all bounded 
initial conditions (28) the corresponding trajectory tends to zero 


lim ||x(f)|| = 0. (30) 


If the dynamical system (22) is not stable it will be called unstable. 

Based on these definitions, there exists a large literature on stability problems, and 
quite a number of textbooks, e.g., Hahn (1967) and Muller (1977). However, the 
stability analysis provides only a qualitative answer. For engineering applications 
some quantitative global information on the dynamical behaviour is of interest. This 
can be obtained by a stability assessment. 

From the stability definitions in the sense of Ljapunov the following local and global 
stability measures are defined. 

The local stability measure 


Sl(x 0 ,£ 0 ):= 


jM 

MIL 


(31) 


characterizes the ratio between a given initial state x 0 and the corresponding maximal 
displacement of the trajectory. The local stability measure SI depends on x 0 and t 0 . 
The global stability measure 


S2(r,t o y~ min Sl(x 0 ,t 0 ) (32) 

x 0 e{x:||x|| = r} 

is defined for a subspace of the initial conditions’ state space. The global stability 
measure S2 characterizes the maximal displacement of all trajectories starting out of 
the initial conditions’ subspace which is by definition a hyper-cube with respect to the 
equilibrium point x = 0. 

By definition it yields 


O^Sl^l, (KS2s$l. 


(33) 


620 


W Schiehlen and B Hu 


In a numerical analysis the integration interval is limited. Then, the local stability 
measure (31) and the global stability measure (32) have to be replaced by 


SI (x t )— X °^ 

M o) ‘ llxWllr 

(34) 

S2 r (r, t 0 ):= min Sl r (x 0 ,£ 0 ). 

(35) 

x 0 6{x:i|x|| = rj 


For autonomous systems 


X = f(x), x(£ 0 ) = x 0 

(36) 


the initial time can be chosen as t 0 = 0 without any restriction to the generality. 

There is a direct relation between the stability and the above defined stability 
measures. For an unstable system one may choose a series of initial conditions 
satisfying ||x 0l || > ||x 0 J| > ••• > ||x 0 J|. Then it yields 

lim||x 0 J| = 0 and lim Sl(x 0n ,t 0 ) = 0. (37) 

n oo n -> oo 

On the other hand, if the local stability measure SI has the nonzero minimum s, then 

' ( 38 ) 

This means |jx(t)|| x -+Q for ||x 0 |j -»0, i.e., the system is stable in the sense of Ljapunov. 

Usually the components of the state vector, x 1 (t),---,x n (f), have different units. For 
the applications of the stability measures it is necessary that all components have the 
same unit. This can be achieved by standardizing operations. 

The stability measures defined characterize dynamical systems of arbitrary dimen¬ 
sion by a scalar number. Therefore, they are especially well suited for the multibody 
system analysis. More sophisticated geometric measures of analysis are usually restric¬ 
ted to two or at most three dimensions. The above defined stability measures may be 
applied to linear systems, too, see Schiehlen & Hu (1995). 


4. The Duffing oscillator 

The Duffing oscillator represents a wide range of dynamic systems with progressive 
restoring forces. The equation of motion of the Duffing oscillator with a negative 
stiffness modelling a vertical Euler beam support column loaded over its buckling point 
reads as 

x + dx — x + x 3 = a cos cat. (39) 

This equation describes a damped, periodically forced nonlinear and nonautonomous 
vibrating buckled beam. It is used here as an example for the application of the method 
presented. If not mentioned otherwise, the parameter of this Duffing equation are 
chosen as follows, 


d — 0T5, a = 0-3, co=l. 
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The Duffing oscillator has a very complex dynamical behaviour. Figure 1 shows 
a typical steady-state chaotic trajectory in the phase plane. It turns out that there is an 
attractive chaotic region. 

According to the definitions of the stability measure, it is proved that the local 
stability measure S 1 and the global stability measure S2 are periodic with respect to the 
initial time t 0 , 

Sl{x 0 ,x 0 ,t 0 ) = Sl{x Q ,x 0 ,t 0 + 2n/oj), (40) 

S2(r, t 0 ) = S2{r, t 0 + n/co). (41) 

Moreover, the local stability measure has one symmetry, i.e. 

51(x 0 ,x 0 ,t 0 ) = Sl(-x 0 , - x 0 ,t 0 + n/co). (42) 

An overall view of the stability measures is gained after the calculation of all initial 
conditions. Using a drawing technique for colour postscript figures developed by 
Schaub (1990), one can graphically represent the local stability measure 51, see figure 2. 
It shows clearly that the local stability measure 51 is more sensitive to initial 
displacements than to initial velocities. Moreover, figure 3 and figure 4 show the 
influence of initial time t Q on the local stability measure 51. Then, a more detailed analysis 
has to be performed in the parameter region found from the stability measures. Using the 
result on the stability measure 51, the global stability measure 52 can also be evaluated and 
the influence of initial time t 0 on the stability measures can be more easily seen (figure 5). 

Numerical results indicate that the initial time f 0 does not play an important role in 
the stability measures of the Duffing oscillator for the small initial conditions and large 
initial conditions. However, in the middle range of the initial conditions the stability 
measures are sensitive to the initial time t 0 , see figure 5. Sometimes, the Duffing 
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Figure 1. The phase curve of 
the Duffing oscillator for the in¬ 
itial time t 0 = 0 and the initial 
condition x 0 = x 0 = 2. 






Figure 2. The local stability 
measure Slj- of the Duffing 
oscillator at the initial time 
r 0 = 0 for T — 200, — 5 ^ x 0 
5 and — 5 ^ x 0 < 5. 



Figure 3. The local stability 
measure S1 T of the Duffing 
oscillator at the initial time 
t 0 = n/A for T = 200, — 2 ^ 
x 0 ^ 2 and — 2 ^ x 0 ^ 2. 
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Figure 4. The local stability 
measure Sl r of the Duffing 
oscillator at the initial time 
t 0 = n/2 for . T = 200, 
— 2 x 0 ^ 2 and — 2 ^ x 0 ^ 2. 



Figure 5. The global stability measure S2 r of the Duffing oscillator at different 
initial time t 0 for T = 200. 





























3ility 

iffing 

time 

200 , 

>< 2 . 










626 


W Schiehlen and B Hu 


oscillator even shows highly different dynamical characteristics. For example, the 
phase curves for two slightly different initial conditions, (1,1) and (1 — 10~ 8 ,1), with 
different initial times t 0 = 0 and t 0 = tt/ 2, are incredibly different. For the initial time 
t 0 = 0, the Duffing oscillator shows a periodical limit cycle, figure 6. However, the 
motion for the initial time t 0 = n/2 at the same initial condition is chaotic, see figure 7. 

In addition, the influence of the frequency of the excitation on the stability measures 
is shown in figure 8, where the frequency is chosen zero. From the phase curves in 
figure 9, one can more easily understand the curious green hole in figure 8. Again, the 
stability measures proved to be valuable in identifying critical parameters. 

5. Conclusion 

Engineering structures may result in highly nonlinear equations of motion, but in 
applications only bounded motions are acceptable. The approach of stability measures 
allows the systematic computation of a basin of bounded motions for systems of 
arbitrary dimension. Then, the sensitivity of the systems under consideration to initial 
conditions and parameters can be investigated in more detail. The stability measures 
are not related to the frequency of the system response, they consider the absolute value 
of all state variables often important in the engineering design of mechanical systems. 
Some fundamental phenomena have been analysed for the Duffing oscillator represen¬ 
ting structures with progressive stiffness. 
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Dynamical behaviour of a friction oscillator with simultaneous 
self and external excitation 

K POPP, N HINRICHS and M OESTREICH 

Institute of Mechanics, University of Hannover, AppelstraBe 11, D-30167 
Hannover, Germany 

Abstract. Friction-induced self-sustained oscillations, also known as 
stick-slip vibrations, occur in mechanical systems as well as in everyday life. 
In engineering applications these vibrations are undesirable and should be 
avoided. In the present paper it is shown how the very robust limit cycles of 
stick-slip vibrations can be broken up by a harmonic disturbance. Based on 
a simple model of a friction oscillator with simultaneous self and external 
excitation the resulting bifurcation behaviour and the routes to chaos are 
investigated for a wide range of system parameters. The influence of 
different types of friction characteristics is elaborated and the admissibility 
of smoothing procedures is examined by comparing results gained for 
non-smooth and smoothed friction characteristics. 

Keywords. Friction oscillator; stick-slip vibrations; self-excitation; bifur¬ 
cations; chaos; non-smooth systems. 


1. Introduction 

Self-sustained oscillations due to dry friction, also known as stick-slip vibrations, 
appear in everyday life as well as in engineering systems. Figure 1 shows some 
examples. The sound of bowed instruments, creaking doors or squeaking chalks and 
the squealing noise of tramways in narrow curves are caused by stick-slip vibrations. 
Other engineering examples are rattling joints of a robot, grating brakes or chattering 
machine tools. 

The explanation of the stick-slip phenomenon can be found in textbooks, e.g. 
Kauderer (1958), Magnus (1961) or Hagedorn (1984). Since we are dealing with self- 
sustained oscillations, the vibration system encounters an energy source, an oscillator, 
and a switching mechanism triggered by the oscillator, which controls the energy flow 
from the source to the vibrating system, cf. figure 2. 

If the energy flow into the vibration system is greater (less) than the dissipation 
during one period, then the vibration amplitude will increase (decrease). If the energy 
input and output is balanced during each period, then an isolated periodic motion 
occurs, which is known to be a limit cycle. Let us take the violin as an example. 
Obviously, the energy source is represented by the motion of the bow and the oscillator 
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Figure 2. Block diagram of 
a self-sustained vibration system. 


colophony). Essential is a friction force with a decreasing characteristic for increasing 
relative velocities of the rubbing surfaces. This will be explained by means of the simple 
friction oscillator shown in figure 3, which has been used as a model for bowed 






Figure 3. Friction oscillator: (a) mechanical model, (b) friction force characteristic, 
(c) limit cycle, (d) time histories of motion, friction force and input power. 
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instruments. The energy source is a moving belt with speed v 0 = constant, driving the 
mass of the discrete spring-mass oscillator. The friction force F R depends on the relative 
velocity v r = v 0 — x between belt and mass. The friction force characteristic F R = F R (v r ) is 
shown in figure 3b. Since the limit value of the static friction force F R {v r -*■ 0) is greater than 
the kinetic friction force F R (v r # 0), the friction force characteristic decreases for small 
values of v r . This has been observed e.g. by Conti (1875), Galton (1878), Franke (1882), 
Bochet (1961) and Kraft (1967). If the belt velocity is adjusted so that the friction force 
shows a negative slope for the equilibrium state x = x s , x = 0, then this state becomes 
unstable. Thus, the amplitude grows and the trajectory in the x, x-phase plane fast reaches 
a limit cycle, cf. figure 3c, where clearly the slip mode A -*■ B and the stick mode B-*A can 
be distinguished. The physical reason for the instability is an energy transfer from the 
belt to the mass. This is qualitatively shown in figure 3d by a sequence of time histories of 
one period duration. Assuming a small mass motion around the equilibrium position with 
a sinusoidal velocity results in a sinusoidal fluctuation of the relative velocity v r = v Q — x 
around the mean value u 0 . Due to the force characteristic the corresponding friction force .is 
larger for smaller values of v r than for larger values. Thus, the product F R x which denotes 
the mechanical power input on the mass shows larger positive than negative values. 
Hence, a positive energy input, A E in = \l s F R xdt > 0, results during each period T s , 
which in turn leads to increasing vibration amplitudes until the limit cycle is reached. 

Figure 4 shows three different types of friction characteristics, in the following 
denoted by I, II, III, and the corresponding phase portraits. Here, the friction coefficient 
p(v r ) = \F R (v r )\/F N , i.e. the friction force F R related to the normal force F N in the contact 
area, has been plotted. Friction characteristic I is due to Coulomb’s friction law and 
consists of a static friction coefficient p(v r = 0) = p 0 and a smaller constant kinetic 
friction coefficient p.(v r # 0) = p. So, the transition from stick ( v r = 0) to slip ( v r ^ 0) is 
not continuous. Friction characteristic II is similar to that shown in figure 3 and has 
been explained earlier. Here, the transition from static to kinetic friction is continuous 
but not differentiable. Friction characteristic III follows from II by a smoothing 
procedure so that the characteristic is continuous and differentiable, however, there is no 
pronounced static friction any more and p(v r = 0) = 0 holds. The smoothing is done by 
multiplication of the non-smooth characteristic with a smoothing function g (v r ), where 


0(*V) = (2/n) arctanfoo,) 


has been chosen. The slope g' of the function g for v r = 0 is given by 


dg(v r ) 

dv r 


2 




(1) 

( 2 ) 


so that the slope is proportional to c x . The angle a between the vertical axis and the 
tangent to the smoothing function g in the origin is defined by 


1 


a = arctan—. 
9 


(3) 


The three corresponding phase portraits are qualitatively similar. Since characteristic 
I has no negative slope for the equilibrium state, this state does not become unstable 
and in its neighbourhood exist periodic solutions in a pure slip mode in contrast to the 
solutions for characteristic II and III, wh§re self-excitation takes place. However, for 
initial conditions far away from the equilibrium state, in any case all trajectories ultimately 




Figure 4. (a) Friction characteristics I, II, III, (b) corresponding phase portraits for pure self excitation. 
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reach the limit cycle. (In case of characteristic I it is better to say limit curve instead of limit 
cycle, since other periodic solutions exist, but this distinction shall not be made in the 
following.) The limit cycle for characteristics I and II consists of a horizontal line due to 
the stick mode whereas for characteristic III there is no line representing constant velocity 
since there exists no real stick mode. The differences in the friction characteristics are also 
reflected in the transition from stick to slip and slip to stick in the corresponding limit 
cycles. The slope of the limit cycle in the transition points is non-smooth (smooth) for 
both points for characteristic I (III), whereas for characteristic II the slope is smooth for 
the transition from stick to slip and non-smooth for the slip to stick transition. Comparing 
the three phase portraits we find the interesting result that all trajectories starting outside 
the limit cycle behave in a similar manner independent of the underlying friction charac¬ 
teristic. This robustness is important, since in applications the friction characteristic may 
vary considerably or is not known at all. 

In engineering applications stick-slip vibrations are undesirable and should be 
avoided since they debase precision of motion and safety of operation or create noise. 
In the present paper it is investigated how the very robust limit cycle of stick-slip 
vibrations can be broken up by a harmonic disturbance. Thus, the simple self-sustained 
friction oscillator of second order shown in figure 3a is extended to a third order system 
by subjecting the left end of the spring to a harmonic external excitation and the 
corresponding dynamical behaviour is investigated. It is well known that self-excited 
one-degree-of-freedom vibration systems with external excitation can exhibit rich 
bifurcational behaviour and also chaos. There exists a fast growing literature on such 
systems with smooth nonlinearities, e.g. Guckenheimer & Holmes (1983), Abraham 
& Shaw (1982), Thompson & Stewart (1986), Kunick & Steeb (1986), Neimark & 
Landa (1987), Moon (1987, 1992). But also in case of self-excitation due to the 
non-smooth nonlinearity of dry friction chaotic behaviour has been found, cf. Popp et al 
(1985), Grabec (1988), Fingberg (1989, 1990), Popp & Stelter (1990), Stelter & Popp 
(1989), Stelter (1990), Stelter & Sextro (1991), Popp (1991, 1992). In the following 
investigation, results will be shown for all three friction characteristics I, II, III. Special 
attention will be paid to the admissibility of smoothing procedures, which will be 
examined by comparing bifurcation diagrams gained for non-smooth and smoothed 
friction characteristics. The benefit of smoothed friction characteristics is the possibility 
to apply common bifurcation theory and its numerical realization in readily available 
computer codes, cf. Seydel (1983, 1993). However, there is a growing need for math¬ 
ematical tools for the analysis of non-smooth nonlinear systems. 

2. Model of a friction oscillator with simultaneous self- and external excitation 

The mechanical model of the friction oscillator with simultaneous self- and external 
excitation is shown in figure 5. The following notation is used: mass m, spring constant 
c, displacement of mass x(t), excitation u(f) = u 0 cosQt, excitation frequency Q, belt 
speed v 0 , relative velocity v r = v Q - x. The equation of motion reads 

mx(t) + cx(t) = F R {v r ) + cu 0 cosQt, (4) 

where the friction force F R depends on the relative velocity v r and the normal force 
F N (F N ^ 0)- During the slip mode ( v r ^ 0) holds 

F R (v r ) = p(v r )F N sgn(y,), 


(5) 
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Figure 5. Mechanical model 
of a friction oscillator with sim¬ 
ultaneous self- and external ex¬ 
citation. 


whereas for the stick mode ( v r = 0) the friction force reads 

F R = c(x(t) — u 0 cosQt), (6) 

and is bounded by 


\Fr\<H>Fn- (?) 

The static and kinetic friction coefficients follow from figure 4. However, for friction 
characteristic III no distinct stick mode occurs, thus, (5) holds for the complete motion. 
For the static equilibrium with u 0 = 0 from (4) the static displacement follows 


(8) 

c c 

The equation of motion can be normalized using 

t = co 0 t, (x) 0 = (e/m) 1/2 , rj = C2/ft) 0 , (*)' = [d(*)/di] = [(*)/a> 0 ], 

v r = v 0 -co 0 x'. (9) 

From (4) and (9) it follows 


x"(t) + x(x) = [ F R {v r )/c] + u 0 cosr]x. 

The corresponding system of autonomous differential equations reads 


( 10 ) 


x\ — x 2 , 

x ' 2 — —X t + [i r n(u r )/c] + u 0 cosx 3 , (11) 


x ' 3 = ti, 

where x 1 = x(t), x 2 = x'(x), x 3 = r\x. 

For friction characteristics II and III, numerical solutions have been performed 
on the basis of (11) using the Advanced Continuous Simulation Language, ACSL 
(1991). For friction characteristics I, however, the general solution can be given 
analytically. For both solution methods special attention has to be paid to the exact 
determination of the transition states from stick to slip and from slip to stick. 


3. Solution methods 

3.1 Point-mapping approach 

As mentioned before, for friction characteristic I an analytic solution can be formulated 
for the stick and the slip modes. The exact states of transition from one mode to the 
other have to be calculated and implemented as initial values for the analytic solution 
of the corresponding modes. 
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Figure 6. Schematic diagram for the point-mapping approach. 


The system behaviour is calculated following the procedure shown in figure 6: 
Starting with the stick mode (u r = 0, x = v 0 ) from (6) and (7) it follows 

\x{x)-u 0 cos^x\^x 0 , x 0 = ji 0 {F n /c). (12) 

The equality sign in (12) holds if the spring force reaches the maximum friction force for 
the stick mode which characterizes the transition point A. With the choice of the 
excitation angle ?/T 0 e[0,27t] from 


\xA 0 ~u Q cosr}T 0 \=x 0 ( 13 ) 

it follows the displacement x Ao where slip initiates. Here, x Ai in each cycle i is bounded 
by x 0 -u 0 ^ x Ai x 0 + u 0 . 

For the subsequent slip mode A -*■ B (r r # 0, fi(v r ) = p = constant) to the next transi¬ 
tion point B the corresponding solution reads 

x(t) = c 0 cos(t-<p 0 ) + [u 0 /(1-?/ 2 )]cos^t + x s , x s = [i(F n /c), (14) 

V r =V 0 —OJqX'^0, (15) 

where c 0 and <p 0 can be determined from the initial conditions for t 0 = x Ao (x(t 0 ) = 
x Ao> X 'M = v oM- 
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rcos rjx A[ 
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, ( 16 ) 


(17) 


Now (14) holds till the mass velocity reaches the belt velocity again, so that the equality 
sign of (15) is valid. Substituting the derivative of (14) into (15) an implicit equation for 
the normalized time Ax Aa -, Bo of the motion from A to B is given: 


*'(**.) = ~ c oSin(t ylo + At >4o ^ jBo - <p 0 ) 

~ lu 0 W ~ ^ 2 )]sin(^ o + rjA t A o ^ Bo ) = v 0 /co Q . 


( 18 ) 
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With the solution Ax Ao ^ Bo of (18) the displacement x Bo of point B, where the transition 
from the slip mode to the stick mode takes place, can be calculated from (14). 

Now the mass again sticks on the belt and moves with it until the restoring spring 
force reaches the maximum friction force acting on the mass. From 

l V 0 At B o ^, ~ U 0 COS (>7( T 5o + = *0- X B 0 ’ ( 19 ) 

the time is given implicitly. The corresponding displacement x Ai of the 

transition point A at the time 

T 1 =T 0 + AT ylo ^ Bo +Ar Bo ^ i , (20) 

can be calculated as mentioned above using (13). Hence, the iterative procedure starts 
again. So, with the help of (13)—(20) a sequence of the states x A . and x Bi can be 
calculated. 

In contrast to a similar mapping approach used by Guckenheimer & Holmes (1983) 
for the analysis of a jumping ball on a vertically moving surface, here the point¬ 
mapping is given in implicit form. Thus, for solving the implicit equations a suitable 
Newton method has to be chosen. 

3.2 Numerical integration 

For friction characteristics II and III, where the kinetic friction coefficient is not 
constant, other solution methods have to be chosen. The system behaviour for these 
characteristics has been calculated by numerical integration of (11). For the system with 
characteristic II also an exact determination of the points of transition from stick to slip 
A and from slip to stick B is necessary. Passing those points during integration 
demands starting a special routine that determines the exact states of transition by 
iteration. For the smoothed friction characteristic III no distinct stick mode occurs. 
Smoothing the friction characteristic results in stiff system equations for small values of 
the relative velocity. In this case appropriate integration routines and/or special 
stepsize control has to be used. In this work the numerical integration has been done 
using the program package ACSL. With the supplied language element SCHEDULE 
an easy and sufficient implementation of the iteration to the transition points from one 
mode to another is possible. With the help of stepsize control algorithms the integra¬ 
tion steps are adjusted dynamically in order to keep the error in each state variable 
below a prescribed value. At this point it should be mentioned that solving the system 
equations utilizing the point-mapping approach (if possible) leads to an enormous 
saving of CPU-time in the order of factor 1000. 

3.3 Numerical bifurcation analysis 

For smooth nonlinear systems like the friction oscillator with friction characteristic III 
program packages for the bifurcation analysis can be applied. Here, several computer 
codes are available, for instance BIFPACK (Seydel 1993), AUTO (Doedel 1981) and 
PATH (Kaas-Petersen 1987). The results of §4-3 have been obtained with BIFPACK, 
a program for continuation, bifurcation and stability analysis. 

For the task continuation and branch switching the system equation (10) can be 
reformulated into a boundary-value problem as follows. Periodic oscillations obey the 
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boundary conditions 

x(0) — x{T), x'(0) = x'{T), (21) 

where Tdenotes the period and satisfies 

T=n2iz/rj, with neN, (22) 

so that u(x = 0) = u (t = T) holds. 

As a consequence of (22) the integration of the system equation (10) for one period 
requires an adapted integration interval 0 < x < T whenever rj is changed. Shifting the 
dependence of rj to the right-hand side of the differential equation, the integration 
interval is transformed to unity length (Seydel 1983). 

We introduce the normalized time £ 

£ = (1/T)t, (23) 

so that 0 ^ ^ 1 holds for one period and use the derivative 


d (*)^ T d (*) 
d£ dr ' 


(24) 


With the notation yi(0 = and y 3 (f) = l f» s0 that y 3 represents the 

bifurcation parameter, the system of autonomous differential equations reads 


y 3 ' 


dy 2 27i 

d £ y 3 


-yi + u 0 cos{2nn£) + g{v r ) 


c 



(25) 


Here, g(v r ) denotes the smoothing function (1) and v r = v 0 — co 0 y 2 . The boundary 
conditions now read as follows: 


^i(O) — 3^i(l) = 

3^2(0)-y 2 (l) = 0, (26) 


*(0) = if- 

The Jacobian matrix, can be determined analytically from (25). Now the required input, 
cf. (25) and (26), for the use of BIFPACK is available. 

The following work done by BIFPACK can be summarized as follows (Seydel 1983): 

— First the initial (and final) values y x and y 2 of the periodic trajectories for a fixed 
number n have to be calculated, so that the boundary conditions are satisfied. This 
can be done by using shooting methods: 

An initial guess for y 1? y 2 leads after the integration for the interval 0 ^ £ < 1 to 
a nonzero residual vector 


Pi(0) — Pi(l) 
_^ 2 (0) — p 2 (!) 


r(yi(0),y 2 (0))- 


(27) 
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Detecting the initial values of a periodic orbit requires a vanishing residual 
vector rO^OXy^O)). So, a combination of an integrator, which determines the 
initial and final states for a set of guessed initial values, with a Newton method, that 
approximates the zero crossing of the residual vector calculated from the output of 
the integrator serves as simplest shooting method. In professional multiple shoot¬ 
ing methods the integration interval is subdivided several times, so that better 
convergence of the Newton method can be obtained. 

— For a suitable next step of the bifurcation parameter a predictor-method calculates 
an initial guess for the corresponding initial state, and the multiple shooting method 
leads to the determination of the periodic orbit. 

— Path tracing repeats the procedure described above again and again. During 
continuation turning points can be approximated on the basis of a suitable test 
function. Additional subroutines carry out the stability analysis for calculated 
periodic orbits. 

Thus, efficient methods are placed at the users disposal, that - compared to classical 
numerical integration - supply additional information like unstable branches and the 
detection of branch points. 


4. System behaviour for different friction characteristics 

In this chapter the system behaviour of the friction oscillator with simultaneous 
self- and external excitation for the different friction characteristics I, II and III is 
elaborated. 

4.1 Friction characteristic I 

4.1a State space behaviour: The system behaviour for a fixed set of bifurcational 
parameters n 0 , F N /c and rj is represented by the phase trajectories. Figure 7 shows 
different types of phase trajectories, where the velocity v of the mass is shown as 
a function of the displacement x . In contrast to the system without external excitation 
represented by the phase portraits shown in figure 4 the system with external excitation 
can exhibit one-periodic solutions (figure 7a), two-periodic solutions (figure 7b), 
higher-periodic solutions and also chaotic system behaviour (figure 7c). As an 
interesting detail one can see that for large amplitudes u 0 of the excitation short 
interruptions of the stick mode become possible. Pushed by spring forces the mass 
overtakes the belt (figure 7d) or slows down reentering the stick mode after a small 
wiggle (figure 7e). In the following we confine ourselves to small excitation amplitudes 
Uq, so that interruptions of the stick mode as mentioned above can be excluded. For 
this restriction (13) of the point-mapping approach can be simplified by assuming 
x At > u 0 cos{rjx). 

The trajectory of a solution with high periodicity in the three-dimensional state space 
M=U 2 x S 1 is presented by a flattened torus (figure 8). One can distinguish the 
circular ring area on top of the torus characterizing the stick mode with constant 
velocity of the mass (v r = v 0 — co 0 x' = 0) and the remaining surface for the slip mode. 
The thick line represents the trajectory of the one-periodic motion of figure 7a. After 
one rotation around the centre axis of the torus, i.e. after one period of the excitation 





Figure 7. Phase plane plots for 

(a) rj = 0-75, F N /c = 10, w o = 0-5, 

(b) fj = 2-15, F n /c = 10, u 0 = 0-5, 

(c) rj = 1-97, F N /c = 10, u 0 = 0-5, 

(d) tj = 0-5, F w /c = 10, u 0 = 4-0, 

(e) ti = 0-41, F„/c = 5, u 0 = 2-6. 
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Figure 8. Trajectory in the 
three-dimensional phase space 
for rj — 0-6, F N /c =10, p Q = 0-5, 
and a one-periodic solution for 
ri = 0-75, F n /c = 10, u Q = 0-5. 


(n = 1), the trajectory has also rotated once (p= 1) around the body of the torus and 
returned to its initial state. The ratio of revolutions along and around the torus is called 
the winding number W, here it yields W —n/p — 1/1. 

One way to visualize the geometric structure of the attractor is the Poincare-map P, 
where a projection on a hyperspace £ of the state space M is chosen that fulfills the 
transversality and recurrence conditions for the trajectories, e.g. Kunick & Steeb 
(1986). For the friction oscillator the choice of £ can be done in a simple, illustrative 
way: The Poincare-map can be carried out by determining the intersection points of the 
trajectories on the torus M with a plane £ transversal to the torus corresponding to 
ijx = constant. In other words, the trajectory is flashed stroboscopically with the 
frequency of the excitation. In figure 9a the trajectory of the chaotic solution (figure 7c) 
in the three-dimensional state space is shown together with the plane £. From figure 9b 
it can be seen, that the attractor of the chaotic motion exhibits a Cantor-set-like 
structure. 

4.1b Bifurcational behaviour : For a more global examination of the bifurcational 
behaviour of the system, representative points of the trajectory for each set of 
bifurcation parameters have been extracted. In the bifurcation diagram the dis¬ 
placement x A (transition from stick to slip) and x B (transition from slip to stick) is 
plotted as a function of the bifurcation parameters rj (figure 10a) and F N /c (figure 10b). 
For clarity, once again the phase plane plot of a one-periodic solution is shown. In this 
figure one can also distinguish one-periodic, higher-periodic and chaotic solutions. For 
increasing values of F N /c (figure 10b) the length of the stick line grows, so the distance 
between the points x A and x B becomes larger. However, the interval where the 
transition from stick to slip takes place, given by x 0 — u 0 ^ x A . ^x 0 + u 0 , remains 
constant. 

In order to gain further insight it is useful to characterize the phenomena which 
change the system behaviour. 

4.1c Winding number : From figure 8 it seems obvious that the number n of rotations 
of the trajectory around the centre axis of the torus, i.e. the number of excitation 
periods, and the number p of revolutions around the body of the torus is different for 
different system parameters. Evaluating the winding number, e.g. Devaney (1986), 
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Figure 9. Chaotic trajectory in 
the three-dimensional phase 
space for >7=1-97, F N jc= 10, 
U q = 0-5. (b) Corresponding 

Poincare-map. 


i 



,4 


I 


! 


defined by 

W = lim * ( V. Ta) , (28) 

j-+o o 2nj 

leads to the interesting result shown in figure 11. An increase of the bifurcation 
parameter t] changes the bifurcational behaviour and results apparently in a monotonic 
increase of the winding number. Regions of the same periodicity yield a constant 
winding number. In the case of a regular system behaviour the winding number is 
rational and given by the ratio W=n/p. Irregular system behaviour leads to an 
irrational winding number. The behaviour of the winding number for changing system 
parameters as shown in figure i lb is called a devil’s staircase and has also been found in 
other dynamical systems, e.g. Devaney (1986), Parlitz & Lauterborn (1987). A diagram 
showing the inverse period 1/p versus the frequency ratio t] is given in figure 12 and 
visualizes the self-similarity of the devil’s staircase. 

The similarity of the results for the winding number to those of the well-known circle 
map e.g. Devaney (1986), Hale & Kocak (1991), leads to the question whether for the 
friction oscillator also a one-dimensional map can be given. 

4.1d A one-dimensional map: From the point-mapping approach (cf. figure 6) one can 
define the map H in the following way: 

H:S l = [Q,2%^Er}x = r}x 0 ^H(r}x)=r}x 1 e[Q,2Ti] = S 1 . 



(29) 
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(a) 



Figure 10. Bifurcation diagram with variation of parameters (a) rj for F N /c = 10 
and Uq — 0-5, (b) F N /c for rj = 2-1 and w 0 = 0-5. 


For this one-dimensional mapping approach also a more illustrative explanation can be 
given: The limit curve of the flattened top of the torus (figure 13) is given a priori by (13) for 
all values rjx. Starting the solution procedure at any limiting point r]x 0 , after some time 
rjx 1 the trajectory will return to one point of the limiting curve. So, the H-map plots the 
arrival point rjt t as a function of the start point rjx 0 . 

Figure 14 shows the graph of H{rjx) for a one-periodic motion. This way of 
description may be used to gain further insight into the different kinds of changes in the 
system behaviour by means of a graphical analysis. First, one has to draw the diagonal 
D = {(x,x) |xeR}. A vertical line from the start value of the iteration (rj t o ,0) to the 
graph of H crosses the graph at ( r]x 0 ,H(r]x 0 ) = rjx^. Then a horizontal line from this 
point to D meets the diagonal at (rjx^rjx^. So, a. vertical line to the graph followed by 
a horizontal line back to D yields the image of the point rjx; under H on the diagonal. An 
orbit is given by drawing line segments repeatedly vertically from D to the graph and 
then horizontally from the graph to D. For the given set of parameters this procedure 
yields the following result: The orbit runs into a stable fix point representing the 
one-periodic motion. The intersection points I HD of the curve H(rjx) with the diagonal 
D can be used as indicators for one-periodic motions. If \H'(I hd )\ < 1, the fix po ; ~* 
stable, otherwise it is unstable, cf. Leven et al (1982). Besides one- and higher-p" 
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Figure 11. (a) Bifurcation diagram for F N /c = 10 and u 0 = 0-5 with variation rj, (b) 
corresponding winding number showing a devil’s staircase. 1 


motions chaotic system behaviour also can be analysed utilizing the H-map. In the 
following it shall be pointed out how the transition from regular motion to chaos takes 
place. This scenario is called routes to chaos. 



Figure 12. Diagram showing the inverse 1/p of the period p of the oscillation 
versus the frequency ratio r\ for F N /c = 10 and u 0 = 0-5 (compare figure 11). 
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Figure 13. Visualization of the definition of the map H{r\x). 


4. le Intermittency: One route to chaos is intermittency. This phenomenon is charac¬ 
terized by intervals of apparently periodic motion interrupted by aperiodic bursts. This 
behaviour was first observed by Manneville & Pomeau (1980) for the Lorenz equation, 
where for a critical bifurcational value rj c the stable periodic motion disappears and 
chaos initiates. The phenomenon intermittency can be understood intuitively if we look 
at some suitable parts of the H-map. Figure 15a shows the situation for rj < rj c . We can 
distinguish two fixed points represented by the two intersections of the H- map with the 
diagonal. One of them is stable, the other is unstable. For r} = rj c the graph of the H- map 
becomes tangential to the diagonal. In figure 15b, which shows the situation for r\ > rj c , 
these two fixed points have disappeared in a saddle-node bifurcation. Following the 
iteration procedure in figure 15b near the diagonal, each iteration results in a small 
change of H(gx). In this region the solution seems to be stable, but after some iterations 
the seemingly stationary behaviour will be interrupted by sudden bursts. Evaluating 
the system response in time domain for t] <rj c (figure 15c) and r\ > rj c (figure 15d) and the 
corresponding phase plane plots (figure 15e and figure 15f) the route to chaos via 
intermittency can be summarized as follows. 

With increasing frequency ratio t] the curve H(rix) moves away from the diagonal. 
After a saddle node bifurcation no further stable fix points exist and a chaotic motion 
starts, characterized by seemingly periodic motions interrupted by sudden bursts. 

4. If Period doubling: Period doubling in nonlinear system has been observed in all 
branches of physics, chemistry and biology as well as in many technical problems. Also, 



Figure 14. Graphical analysis 
for the H -map with g — l-55, 
F n /c = 10 and u 0 = 0-5. 



Figure 15. Visualization of intermittency for F N /c = 10, u 0 = O'5 and rj = 1-8 (left), 
r ] = 1’9 (right): (a), (b) H- map, (c), (d) time history of displacement jc, (e), (f) phase 
plane plot. 
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Figure 16. Bifurcation dia¬ 
gram with variation of the fre¬ 
quency ratio rj showing period 
doubling for F N /c= 10 and 
u Q = 0*5. 


for the friction oscillator period doubling cascades can be found (figure 16). For 
r\ = 1-925 the system exhibits a seven-periodic motion. For increasing rj the system 
undergoes a bifurcation with a change to a periodic motion with twice the period of the 
original oscillation. The seven-periodic orbit becomes unstable and a stable 14- 
periodic orbit appears. Thereafter, a period doubling sequence occurs leading finally to 
chaos. One can also distinguish periodic windows. Those periodic windows undergo 
period doubling bifurcations, again leading to chaotic motion. 

4.2 Friction characteristic II 

For friction characteristic II also the system exhibits one-periodic (figure 17a), 
two-periodic (figure 17b) or higher-periodic solutions (figure 17c). Because of the 
smooth transition of the friction characteristic from stick to slip the phase trajectory is 
smooth at the transition point A. The different types of motion can also be distin¬ 
guished by the power spectrum of the time series. For the periodic motions the 
fundamental frequency Q and the corresponding higher harmonics are visible as sharp 
peaks (figure 18a, b). For the chaotic time series, the energy of the oscillator splits up to 
a broad band of frequencies. Thus, the onset of broad-band spectra can serve as an 
indicator of chaotic motion. 

The investigation of the bifurcational behaviour for friction characteristic II leads to 
similar results to those presented for friction characteristic I, e.g. figure 19 shows two 
bifurcation diagrams where the parameters rj (figure 19a) and ( F N /c ) (figure 19b) have 



Figure 17. Phase plane plots ( F N /c = 10, u Q = 0-5) for (a) t] = 0-94, (b) rj = 1-15, (c) 
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(a) 





Figure 18. Frequency spectra 
(Fiv/c = 10, u Q = 0-5) for (a) 
t] = 0-94, (b) »7 = 1*15, (c) rj = 1-05. 


been varied. The results can be compared to those shown in figure 10. Here, a self¬ 
similarity appears as follows: At each side of the one-periodic solution in figure 19b, 
periodic orbits with period two can be found. Each of these periodic motions again is 
surrounded by two periodic orbits of period three, and so on. As can be seen, the 
intervals of periodic system behaviour in the bifurcation diagram are interrupted by 
chaotic parameter regions. 

4.3 Comparison of the results for friction characteristics I and II 

Comparing the results gained for the different friction characteristics shows that the 
global bifurcational behaviour is independent of the kind of investigated friction 
characteristic. The bifurcational behaviour given by bifurcation diagrams with vari¬ 
ation of rj and F N /c is qualitatively similar (cf. figures 10 and 19). The described 
phenomena seem to be robust with respect to the kind of friction characteristic used. It 
should also be mentioned that the more local phenomena like period doubling and 
intermittency described in §4T can be found for both types of friction charateristics. In 
order to give an overview of the solution depending on the bifurcational parameters 
rj and F N /c , corresponding parameter maps have been calculated, cf. figures 20a and b. 
Here, the periodicity of the solution is visualized by a colour code. Regions of 
one-periodic orbits are marked in blue, regions of five- or higher-periodic orbits 





Figure 19. Bifurcation diagram with variation of (a) rj for F N fc = 10 and u 0 — 0-5, 
(b) F n /c for rj = 2-1 and u 0 = 0-5. 


including chaos are represented in red. In this diagram the results shown in figures 10 
and 19 are marked by solid lines. For each value t] (F N /c) on the horizontal (vertical) line 
the periodicity can be read with the help of the given colour code. Thus, by means of 
these parameter maps the system behaviour can be found for any set of parameters with 
a fixed excitation amplitude u Q = 0-5 within the plotted range. 

Comparing the two parameter maps obviously the general structure of the tongues 
for one-periodic motions is similar. For small values of rj the limit curves of the lowest 
two blue regions are nearly identical. For high values of rj and F N /c the tongues of high 
periodic motions disappear for friction characteristic I. However, within the inves¬ 
tigated parameter range the blue regions of one-periodic orbits for friction characteris¬ 
tic I are larger than those for characteristic II, whereas the red intervals of high periodic 
orbits are larger for characteristic II than that for I. The number of tongues characteriz¬ 
ing orbits of equal periodicity is smaller for characteristic I than that for II. Another 
interesting difference between the results for the two friction characteristics can be 
observed e.g. for a constant value F N /c = 10 and r\> 3. Between the regions of 
one-periodic and high-periodic orbits for friction characteristic I there are two-, three-, 
and then four-periodic solutions without higher-periodic solutions in between, in 
contrast to the results for friction characteristic II. 
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Figure 20. Parameter maps depending on rj and F N /c for u 0 = 0 - 5 for (a) friction 
characteristic I, (b) friction characteristic II. 

4.4 Friction characteristic III 


As shown before, the smoothing procedure gives a, significant change of the friction 
characteristic. For increasing values of the argument c x of the smoothing function 
(cf. (1)), i.e. decreasing slope angle a of the smoothed characteristic, it becomes obvious 
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(b) 



(c) 
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(d) 



Figure 21. Phase plane plots with ij = 
smoothed friction characteristics: a = 0-< 





V r 



V r 

21, F N jc= 15 and u 0 =0-5 for 
3° (a), 0-4° (b), 7-0° (c), 50-0° (d). 
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that the stick mode does not exist any more (figure 21), and the transition from stick to 
slip and vice versa becomes smooth. Now, for a fixed set of parameters but varying 
slope angle a, one-periodic and higher-periodic orbits can be found. Increasing 
a reduces the diameter of the closed curve representing the trajectory in the phase plane. 
Also the maximal velocity of the mass reduces. For high values of a the smoothed friction 
characteristic behaves monotonously and only periodic motions can be observed. 

In the following it will be investigated whether the smoothed characteristic III is a good 
approximation of the original characteristic II. If the smoothing procedure is valid then the 
question arises, which argument c x or which angle a should be chosen? For a small angle a 
the bifurcation diagram (figure 22a) gives a good approximation of the original non¬ 
smooth friction characteristic (figure 19b). As mentioned before the transition point x A 
from stick to slip for friction characteristic III does not exist any more. Thus, in the 
bifurcation diagrams the points x(u = 0) are shown, where the transition from positive to 
negative mass velocity takes place. Increasing the angle a results in a shift of the 
periodic solutions to higher values of F N /c (figure 22b and c). The bifurcational 
behaviour depending on F N /c and the slope angle a is shown in the parameter map 
(figure 23). This plot can be interpreted in the following way: For very small values of 
the angle a the solution approximates very well the bifurcational behaviour of friction 
characteristic II shown in figure 19 (compare the parameter regions of the one-, two-, and 


(a) 




Figure 22. Bifurcation dia¬ 
gram with variation of F N jc for 
>7 = 2-1, u o = 0-5 (a) Cj = 1000, 
a = 0-1°, (b) c x = 45, a = 2-0°, (c) 
Ci = 20, a = 4-5°. 
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three-periodic orbits). With increasing slope angle a corresponding regions of periodic 
solutions shift to higher values of F N /c, but the basic structure remains. For high values 
of the angle a (> 50°) this structure is destroyed. As a simple explanation for this a look 
at the smoothed friction characteristic shows that for all relative velocities the slope of 
the characteristic is positive, cf. figure 21d. With this validation of the smoothing 
procedure for stiff smoothing functions further insight into the bifurcational behaviour 
can be obtained applying numerical bifurcation analysis. 

4.5 Bifurcation analysis for friction characteristic III 

As described in §3.3 in the parameter regions of periodic solutions of known winding 
numbers and with a guess for the initial values y, (0), y 2 (0) path following can be done 
by means of BIFPACK. Compared to numerical integration additional information 
can be given by the detection of branch points and the branch tracing for unstable 
periodic solutions. As a consequence of the chosen kind of normalisation, cp. (23), for 
a winding number W = n/p there are n initial state vectors leading to the same periodic 
trajectory. In the phase portraits (figure 24a) the corresponding initial values y^O), 
y 2 (0) for a one-periodic solution with n = 3 are marked by crosses. The plot of the 
excitation function (figure 24b) and the system response (figure 24c) in time domain 
show the different periods T/n with n = 3 and T, respectively. The results of the 
path-following procedure are given in figure 25a, where y t [u(r) = 1] is plotted versus?/. 
Here, turning points are also detected and marked by crosses. The bifurcation diagram 
based on numerical simulations plotted in the same manner, cp. figure 25b, shows 
similar results for stable periodic solutions. However, in contrast to the bifurcation 
diagram (figure 19a), the periodic solutions with W — n/p are represented by n points. 
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Figure 24. Influence of the winding number: (a) phase plane plot for rj = 2-4, 
F n /c = 10, m 0 = 0-5,- 1 ] = 3, (initial states j/j, y 2 marked by crosses), (b) time history of 
the excitation with period T /3, (c) time history of the system response with period T. 


In summary, for periodic solutions there is good agreement of the bifurcation 
parameters found by BIFPACK compared to numerical simulations. Additional 
information is given by BIFPACK with respect to the determination of unstable 
branches and the detection of branch points. 
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5. Conclusions 

This paper deals with friction-induced vibrations. Therefore, the friction characteristic 
plays an important role. Since in reality this characteristic is not known exactly, three 
different models of friction characteristics have been investigated, two of them are 
non-smooth. Thus, we have to cope with non-smooth dynamics. This is presently 
a challenge for dynamisists and mathematicians. 

The friction characteristics chosen allow to build up self-sustained stick-slip vibra¬ 
tions, which are undesired in technical applications. It has been shown that the very 
robust limit cycle of stick-slip motion can be broken up by an external harmonic- 
disturbance. The resulting friction oscillator with simultaneous self- and external 
excitation shows rich bifurca,.anal behaviour which has been analysed using different 
methods. A first general result is, that the bifurcation diagrams and bifurcation maps 
for the different non-smooth friction characteristics are qualitatively similar, thus, the 
bifurcation phenomena are robust. Furthermore, the results for non-smooth friction 
characteristics can be approximated very well by a smoothed characteristic, if the 
initial slope is very steep. As a consequence, beside numerical simulation methods 
computer codes for conventional bifurcation analysis can also be applied. A 
comparison of corresponding results shows good agreement with respect to the 
bifurcation parameters leading to periodic motions. However, behaviour other than 
the periodic cannot be investigated by such codes. The most promising is the third 
method developed and demonstrated in this paper, which is a point-mapping approach 
superior to the other methods with respect to computing time. This approach gives 
insight into the periodic as well as the chaotic behaviour. The way of mapping 
considered clearly shows the intermittency route and the period doubling route to 
chaos. Evaluating the winding number a devil’s staircase has been found. The mapping 
approach together with the analysis of the winding number gives deep insight into the 
general dynamical behaviour, although the model under consideration is based on 
a non-smooth mathematical description. 


The authors are grateful to Prof. A Mielke, Institute of Applied Mathematics, Univer¬ 
sity of Hannover, for his helpful discussions and valuable suggestions. 
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Computation of the Kolmogorov entropy for MDOF systems 

F BONTEMPI 1 and F CASCIATI 2 

1 Polytechnic of Milan, P.zza L. da Vinci 32, 20133-MiIan, Italy 

2 University of Pavia, Via Abbiategrasso 211, 27100-Pavia, Italy 

Abstract. The dynamics of a truss structure with several bars and nodes is 
investigated. The bars are made of some nonlinear material and the truss 
structure shows large displacements under some time-dependent loads. The 
resulting equations are an example of a model which governs the motion of 
a general MDOF structure after some discretization procedure, like FEM. 
Attention is focused on the response strangeness met during the numerical 
integration of the motion equations. As a principal tool of investigation, 
one uses the Kolmogorov entropy, the value of which is able to detect the 
sensitivity to initial conditions. The material can present either nonlinear 
elastic softening or elasto-plastic behaviour. 

Keywords. Bifurcation diagram; chaotic motion; Kolmogorov entropy; 
truss structure. 


1. Introduction 

Deep studies have been developed, during the last decade, on the understanding of the 
so-called chaotic behaviour of a nonlinear dynamic system, i.e. the sensitivity of the 
motion to the values which specify system parameters, external disturbances and, 
especially, initial conditions (Lichtenberg & Liebermann 1983; Wedig 1988; Baker 
& Gollub 1990). With reference to the dynamical systems of interest within structural 
engineering, one has a variety of methods for detecting chaos and they have been 
recently summarized in some doctoral theses (Kapitaniak 1990; Kunert 1992; Bon- 
tempi 1993) among others. Two parallel analyses can be conducted in order to make 
the study redundant and, hence, more reliable; in the first one the larger value of the 
Lyapunov exponent is computed (Benettin et al 1980; Wolf et al 1984; Wedig 1987, 
1988); the second approach assesses the value of the Kolmogorov entropy (Hamilton 
& Brumer 1982; Termonia 1984). In the first case a positive value means chaos or 
stochasticity (Bontempi et al 1991), while regular behaviour is denoted by a non¬ 
positive value; in the second case regularity is just expressed by a null value (Bontempi 
& Casciati 1994). 

The previous algorithms are easily implemented for structures with a few degrees of 
freedom. The objective of this paper is to show how large structural systems can be 
managed toward a Kolmogorov entropy computation. This is done with reference to 
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a multi-degree-of-freedom plane truss-structure for the analysis of which both material 
and geometric nonlinearities are incorporated. 

2. Definition of the system 

In this paper reference is done to the truss structure of figure 1. Each node has two 
degrees of freedom but the symmetry of the structure makes possible to write the 
equations of motion by considering six of the ten nodes. Let q be the vector of the 
unknown displacements {q t ,q 2 ,---Q6’Q i> 42 >*• • *36}* The geometrical constraints are 
imposed by two hinges in the nodes 1 and 2 and two horizontal supports at nodes 5 and 
6. There is a single harmonic vertical force F{t) = /4-sin(co-f) along q 6 , the values of the 
parameters being co = 1T3 and A = 0-275. All the bars have area S = 10. With reference 
to figure 2, the material constitutive law requires the following parameters to be 
assigned 

• E = the modulus of the initial elastic range = 1; 

• a y = the yielding stress = 0-01. 

This paper studies the sensitivity of the motion to changes in the values of the hardening 
modulus H, for different natures of the constitutive relation - nonlinear elastic or 
elasto-plastic. The general equation of motion is written as 

Mq + Cq + R(q,q) = F(f), (1) 

in which M = mass matrix, C = damping matrix (both of them are diagonal matrices: 
M u = 1, C, = 0-02, i = 1,6). The nonlinear restoring force vector is denoted by R(q, q) and 
the force vector F(t) has the only one non-zero component: F 6 {t) = F(t) — A-sin (art). 



Figure 1. Geometry of the truss structure considered through the paper as 
reference example. 
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Figure 2. Definition of the constitutive relation: (a) nonlinear elastic; (b) elasto-plastic. 


One adopts an updated Lagrangian formulation (Bathe 1982) in order to allow the 
geometrical effects, i.e. large displacements; in this way the stiffness matrix of the single 
bar is computed on the basis of the updated coordinates of the nodes of the structure. In 
particular, the transformation matrix from the local to the global coordinate system 
takes into account the variation of the inclination of the single bar. 

The integration method is the Newmark one (Bathe 1982; Argyris & Mlejnek 1991), 
with the choice /? = 0*5 and y = 0-25. The integration time step is AT = T F /100 ( T F 
denotes the period of the external excitation: T F = 2-n/co). The six natural oscillation 
periods for the truss structure are 7\ = 1-96, T 2 = 1-99, T 3 = 3T8, T 4 = 3-20, T s — 20-2, 
T 6 = 72-12, the lower periods correspond to axial behaviour, while the higher ones 
characterize a flexural behaviour of the whole structure. Therefore the truss structure 
behaves essentially as a slender beam, with two main possibilities to deform itself, 
roughly as a single sinusoidal wave or as a half sinusoidal wave. 


3. Bifurcation diagrams 

As a first synthetic tool for understanding the dynamics of this MDOF structure, one 
builds the bifurcation diagram (Baker & Gollub 1990); in this way the dependence on 
the nature of the constitutive law (i.e. nonlinear elastic or elasto-plastic) and on the 
value of the hardening parameter H is detected. 

Consider (1) and select the kind of material and the nature of the displacements 
(either infinitesimal (small displacements) or finite (large displacements)); the four cases 
of table 1 arise. 


Table 1. Different kinds of dynamic problems to be inves¬ 
tigated. 


Small Large 

displacements displacements 
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a 



•■i 

i 


b 

Figure 3. Bifurcation diagram for large displacements: (a) the nonlinear elastic 
case (LE); (b) the elasto-plastic case (LP). 


For them, the bifurcation diagrams are built in the following manner: 

• the variability range for the parameter H, the influence of which must be studied, 
selected: &H = 

• N„ + 1 values of the parameter H are allocated in this range: H t = ff mln 
AH/N h ■ k, k = 0,... N h (here N H — 150); 

















Computing the Kolmogorov entropy for MDOF systems 


659 


• for each value H k one integrates the equation of motion; after the transient period T tr 
(numerical experience suggests T tr = 100- T F ), one extract from the vector q, at each 

p,:instant t h = T„ + h • T sn , h=l,...,N s (here N s = 100), the component q 6 , and 
specify a point (H k , q 6 ) in a diagram with abscissa H and ordinate q 6 . The choice of 
the time period T sn is critical, because it determines the appearance of the diagram, in 
the presence of a periodic force of period T P , the natural choice turns out to be 

The interval AH ~ H max — H min of the hardening parameter was selected to be 
[—1,1] for the nonlinear elastic case and [0,1] for the elasto-plastic case. The effect of 
the softening branch (i.e. the situation H < 0) can be observed only in the former case. 
The results obtained in the large displacement case are shown in figure 3: 

• in the nonlinear elastic case (figure 3a), the strangeness of the motion (that character¬ 
izes quasiperiodic or aperiodic (i.e. chaotic) oscillations) appears in the softening 
area; 

• in the elasto-plastic case (figure 3b), the motion appears strictly periodic for H ^ 0-3, 

, while it is sometimes strange for 0 < H ^ 0-3. 

Figure 4 shows the same cases for small displacements. For the elastic case, no chaotic 
situation is shown in the hardening branch; some strangeness remains in the elasto- 
plastic case. Clearly, deeper computations are necessary to declare this case truly 
chaotic: in fact the time integration of elasto-plastic structures presents special features 
(Bontempi & Casciati 1995). In this paper, however, attention is focused on the measure 
of the irregular behaviour, irregularity that can also result from numerical inaccuracies. 

4. Some simulations and the definition of the embedding phase space 

Some numerical simulations show the kind of motion the structure develops and 
provide quantitative measures of the strangeness (i.e. the chaoticity). The appearances 
of these simulations for the selected material law and for the fixed values of the 
hardening parameter H are represented in figures 5 and 6 in terms of time history and of 
the projection of the Poincare section in the plane (q 6 , q 6 ). The transient is left, to show 
that, in some cases, one has transient chaos. 

Essentially, the aim of these simulations is to define the dimension n e of the 
embedding phase space (Jackson 1991). Instead of working in the true phase space of 
dimension n (for the structure studied here, n = 12), a single time series could be able to 
give the main aspects of the dynamics of the structure in the reconstructed phase space 
of dimension n e < n, in which the system dynamics can be embedded. With n e known, 
this is done by replacing at each instant t k (t k = t 0 + k-AT, k = 1,...) the true representa¬ 
tive vector of the system y(t fc ), the 2 -n components of which are q t (t k ) and q { (r k ), with 
vector z(t k ) of n e components 

9 

■ l Z k+0-Ak \ 

\ Z k+l-Ak 1 




“k + 2-Ak 

Z . . , .. 


(2) 
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Figure 4. Bifurcation diagram for small displacements: (a) nonlinear elastic behav¬ 
iour (SE); (b) elasto-plastic behaviour (SP). 


with 

h — 3 6 (**)> k = (3) 

Equation (3) represents the time series extracted from the time history of a particular 
degree of freedom of the structure (here just q 6 {t)). 
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Table 2. Summary of the quantitative measures for the simulations in the non¬ 
linear elastic case (large displacements). 


H 

4 ( 2 ) 

4 ( 3 ) 

4 ( 4 ) 

4 ( 5 ) 

4 ( 6 ) 

K 

k 

max 

1-0 

1-93 

1-90 

2-20 

2-47 

2-28 

0-00 

-0-29 

0-1 

1-85 

2-03 

2-37 

2-30 

2-07 

004 

0-02 

0-0 

1-78 

1-99 

2-13 

2-39 

2-37 

0-00 

-0-98 

-0-2 

2-09 

3-02 

3-36 

3-45 

3-49 

0-30 

0-35 

-0-5 

1 86 

1-89 

2-14 

2-08 

2-38 

003 

0-02 


The vector z(t k ) is therefore obtained from an ordered set of values of the same 
displacement, each of them sampled with a convenient time delay T s = Ak-AT The 
choice of the time delay T s should be guided by the fact that there must exist a 
topological correspondence between the true representation in the original phase space 
and the one in the reconstructed phase space. Numerical experience shows that an 
optimal value of A k separates cases of excessive correlation from cases too scattered 
(Jackson 1991). In the present example the optimal value was Ak — 20; this means 
a time delay T s = 20-AT = 20- 7>/100 = T F /5. 

The problem is how to determine the dimension n e . This is realized by 

• selecting a dimension n e for the reconstructed phase space (i.e. the system dynamics is 
represented by (2) with n e — n c ); 

• computing the dimension of correlation of the underlying attractor of the system: 

d G (n c ) = lim [lg 2 (C(R, n c ))/lg 2 (K)], (4) 

R-0 

where C(R,n c ) is the correlation function, i.e. the fraction of points of the attractor 
which are not farther away than R from a sampling point of the attractor itself 
(Baker & Gollub 1990). 

Repeating the previous steps for different increasing values n c , the embedding dimen¬ 
sion n e can be regarded as the smallest integer greater than all the attractor dimensions 
d G {n c ) previously computed. 

In tables 2 and 3 one has some quantitative measures obtained from the time series 
of (3), for large displacements. The first row of both the tables corresponds to the 


Table 3. Summary of the quantitative measures for the simulations in the elasto- 
plastic case (large displacements). 


H 

4 ( 2 ) 

4 ( 3 ) 

4 ( 4 ) 

4 ( 5 ) 

4 ( 6 ) 

K 

^"max 

10 

1-93 

1-90 

2-20 

2-47 

2-28 

000 

-0-29 

0-3 

1-91 

2-19 

2-30 

2-44 

1-99 

0-47 

0-52 

0-2 

2-00 

2-12 

2-25 

2-62 

2-27 

0-84 

0-83 

01 

1-90 

2-11 

2-40 

2-58 

2-58 

0-70 

0-81 

0-0 

2-00 

2-51 

2-54 

2-51 

2-55 

0-48 

0-51 
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perfectly linear elastic case. The first five columns of these tables give the numerical 
values of the correlation dimension d G (n c ), with increasing value of n c from 2 to 6. 
From these measures of the dimension of the attractor, one realizes the possibility of 
working in a phase space with embedding dimension n e = 4 (this is in agreement with 
the observation that the structure has two principal natural modes). 

The last two columns of the two tables show the values computed in the embedding 
space of the Kolmogorov entropy K and of the maximum Lyapunov exponents x max . 
The latter one is computed by standard procedures (Wolf etal 1984), while K is assessed 
by the procedure illustrated in the next section. 


5. The computation of the Kolmogorov entropy 


A quantitative evaluation of the behaviour of a dynamical system can be obtained 
partitioning its phase space. This operation is conducted through a grid of boxes 
V t ,i = 1,..., N hox of characteristic size s, which is indicative of the capacity to measure 
the status of the system (1) in the phase space. The numerical quantities extracted 
through the partition of the phase space are only approximated. This approximation 
increases as the number of boxes increases (i.e. the size s decreases). The number N box of 
boxes, however, is clearly related to the computational effort. Also the extension of the 
region of the phase space considered for the partition depends on analogous computa¬ 
tional considerations, despite the fact that it is necessary to extend the partition beyond 
the points of the phase space visited from the system during its motion. 

The first elementary measure to characterize the dynamics of the system is just the 
number of boxes N hox which are visited during the motion. A more significant 
information is given by the probability mass function. 


Pi(t) 


p(y,t)-dVi, 

Vi 


(5) 


of finding the status of the system in the ith box of volume V i . In (5) p(y, t) denotes the 
probability density function in the phase space. This allows one to study systems with 
many attractors, with external noise and/or with sensitivity to the initial conditions. 

Related to the form of the probability mass function, there is a quantity that 
measures the disorder connected with the motion of the system. This quantity is the 
entropy of the system (Lichtenberg & Liebermann 1983; Baker & Gollub 1990). 
Mathematically, it is defined by the relation: 


/(£)=-EPi(0-lg2(Pi(0)» (6) 

i 

where the summation covers all the visited boxes (i.e. where p t # 0). It is clear that, 
numerically, this expression depends on the discretization of the phase space. Also, this 
quantity is a function of the time because the quantities pft) are continuously varying in 
time. The average velocity of variation of the entropy, over the ( diffusion ) period 
required for the system to reach a stationary status, is called the Kolmogorov entropy 
(Baker & Gollub 1990; Casciati 1993) 


K{t) - <d/(r)/3t>. 


(7) 



Computing the Kolmogorov entropy for MDOF systems 


665 


It has the physical meaning of the velocity with which one loses information about 
the status of the dynamical system in the phase space. It is to remark again that this 
loss of predictability can result also from numerical inaccuracies. In this case, a positive 
K warns against numerical difficulties in the integration procedure. 

In the presence of a high number of degrees of freedom, for computing the entropy 
I(t ), and, hence, K , one must afford the partition of a high dimensional phase space. 
For a space of dimension n , the partition with boxes of side £ of the cube of side L 
which contains the attractor, requires a number of boxes 

N b0X c*[L/£]”. (8) 

with L/e= 10 as a minimum, (8) leads to N box oc 10". The advantage of operating 
in the embedding space (with n e «n) turns out to be evident. Following this idea, 
a computational technique has been conceived and implemented. It is based on 
the renormalization of the Gibbs set (Bontempi 1993; Casciati 1993), whose evolution 
is built from the knowledge of a single variable of the dynamical system (in this 
paper from (3)). One assigns a characteristic size e and proceeds along the following 
steps: 

• a reference point z(t £ ) is selected in the embedding space; 

• at the instant t h all the points j # i of the series (2) that are within a single box of side 
£ centred around the point i are considered to form the Gibbs set; note that these 
points j should refer to times well different from the one of the point i, i.e. i » j or i« j, 
because otherwise one considers points that are just the positions assumed by the 
reference point during its diffusion interval. These points are initially disposed in 
a single box of side £: therefore the entropy of this set is zero since the set is perfectly 
ordered; 

• the evolution of the Gibbs set is followed until an instant t f — t t + A7j sufficient to 
expand the Gibbs set (but such that the size of the Gibbs set remains lower than 
0T D,D being the maximum size of the attractor in the embedding space. In other 
words, A Inhere = T F /2) must be lower than the diffusion time interval necessary for 
the system to reach the stationary status); 

• at this point, one considers a local partitioning of the phase space with a grid of just 
one or two boxes along each dimension, around the position reached for the 
reference point. The number of boxes necessary to contain the deformed Gibbs set is 
then roughly only 

^box°c2"*. (9) 

One computes pft f ) and then I (t f ). The Kolmogorov entropy K during the interval 
of time t f — t t = ATis just 

K = I(t f )/t f -t i ; (10) 

Repeating the previous steps, along the evolution of the reference point (that means 
switching from z(q) to z{t f )), one obtains an approximation of K as soon as the 
measure stabilizes. In particular the second step corresponds to a renormalization; 
it is the trick by which only local partition of the embedding space (instead of 
a global one) is necessary. 
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A complete study of the behaviour of the structure of figure 1, is shown in figure 7 
for the nonlinear elastic material law, and in figure 8 for the elasto-plastic case. In 
both cases, the top of the figure shows the effect of the presence of large displacements, 
while the bottom gives results within small displacements. As already emphasized, 
the embedding space dimension is n e = 4, according to the results in tables 2 and 3. 
The main remark coming from these pictures concerns the spread of the irregular 
motion region along the design variable (H in the case under investigation). 




Figure 7. Spectrum of the Kolmogorov entropy and of the maximum Lyapunov 
exponent for the nonlinear elastic case: top, with large displacements (LE); bottom, 
small displacements (SE). 
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Figure 8. Spectrum of the Kolmogorov entropy and of the maximum Lyapunov 
exponent for the elasto-plastic case: top, with large displacements (LP); bottom, 
small displacements (SP). 


6. Conclusions 

This paper proposes an operative approach to the assessment of the Kolmogorov 
entropy for multi-degree-of-freedom systems. The value of the Kolmogorov entropy, 
permits one to reach conclusions on the predictability (whose loss can be due either to 
numerical inaccuracies or a truly chaotic behaviour) of the vibration of the structural 
system under investigation. The idea is to make available an approach parallel to the 
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standard computation of the maximum Lyapunov exponent. Its need is two-fold; from 
one side two alternative algorithms form a redundant, and hence more reliable, tool of 
investigation; from the other side the calculation of the Lyapounov exponents still 
suffers numerical problems. 

The algorithm for the computation of the Kolmogorov entropy is based essentially 
on two concepts: (1) the Gibbs set (i.e. a collection of trajectories of the system to study) 
and (2) the embedding phase space (i.e. the reduced phase space in which the dynamics 
of the system can be represented). Both these ingredients are obtained from the time 
history of just a single leading degree of freedom of the structure. Clearly, an exact 
computation of the Kolmogorov entropy for MDOF systems would require a multi¬ 
variate analysis; in this sense, the algorithm proposed provide only an approximation 
of K. Anyway, consider a set of similar structures characterized by different values of 
the parameters that define the geometry, the material and the excitation. For the same 
choice of the leading degree of freedom, the associated time histories are always 
sufficient to detect predictability. For the structure considered in this paper, the 
numerical effort to compute the Kolmogorov entropy was approximately five times the 
one required by the computation of the maximum Lyapunov exponent. 


This research has been supported by grants from the Italian Ministry of University and 
Scientific and Technological Research (MURST). 
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Nonperiodic flutter of a buckled composite panel 
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Abstract. The nonlinear vibrations of a composite panel subjected to 
uniform edge compression and a high-supersonic coplanar flow is analysed. 
Third-order piston theory aerodynamics is used and the. effects of in-plane 
edge restraints, small initial geometric imperfections, transverse shear 
deformation, and transverse normal stress are considered in the structural 
model. Periodic solutions and their bifurcations are determined using 
a predictor-corrector type Shooting Technique, in conjunction with the 
Arclength Continuation Method for the static state. It is demonstrated that 
third-order aerodynamic nonlinearities are destabilizing, and hard flutter 
oscillations (both periodic and quasiperiodic) of the buckled panel are 
obtained. Furthermore, chaotic motions of an uncompressed panel, as well 
as a buckled-chaotic transition, and chaos via period-doubling are possible, 
and the associated Lyapunov exponents are computed. A coexistence of the 
buckled state with flutter motion may also occur. Results indicate that edge 
restraints parallel to the flow do not significantly affect the immediate post- 
critical response, and that a higher-order shear deformation theory is required 
for a moderately thick/flexible-in-transverse-shear composite panel. 

Keywords. Panel flutter; non-periodic motion, chaos; shear deformation; 
laminated composite plates. 

1. Introduction 

For the panel flutter problem, it is well-known that at post-critical flow speeds the 
induced membrane forces limit the oscillation amplitudes, thus preventing immediate 
panel failure. However, aerodynamic nonlinearities are more pronounced and usually 
destabilizing at high-supersonic speeds, and may cause the panel to experience hard 
flutter (i.e., a violent transition from equilibrium to finite motions). The application of 
in-plane compression (e.g., due to aerodynamic heating) can significantly alter the 
dynamic behavior of the panel (especially when the Euler bucking load is exceeded), as 
could the restraining of edges from in-plane motion. Thus, transitions to various 


A list of symbols is given at the end of the paper 
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periodic and nonperiodic motions may occur for small variations of the control 
parameters (i.e., compressive load and flow speed). Furthermore, composite structural 
components, which are preferred in aerospace applications due to their high stiffness/ 
strength to weight ratio, are usually characterized by weak rigidities in transverse 
shear. 

When considering the aerodynamically nonlinear analysis of panels for various edge 
restraint conditions, Bolotin (1963) obtained hard (dangerous) and soft (benign) flutter 
scenarios, Librescu (1965) determined the nature of the flutter boundary, and McIntosh 
(1973) obtained nonsymmetric limit cycles and the critical loads yielding amplitude- 
sensitive instability. Holmes (1977) showed that a buckled equilibrium may coexist with 
flutter about the flat state, and that the transition from a buckled state to non-sinusoidal 
(periodic) flutter [see also Dowell (1966, 1984) and Shiau & Lu (1992)] occurs via a 
sudden jump. 

Dowell (1982) showed that chaotic motions [also obtained by Shiau & Lu (1992) for 
composite panels, and by Yuen & Lau (1991)] may be suppressed by increasing the static 
pressure load. Chaos via period-doubling and intermittency was observed by Dowell 
(1984) while increasing the compressive load [cf. Swinney (1983) and Seydel (1988) for chaos 
scenarios]. Aperiodic motions and coexistence of many stable and unstable limit cycles 
were discovered by Yuen & Lau (1991). When penetrating the chaotic regime by increasing 
the compression, Dowell & Virgin (1990) observed a temporal loss of correlation (i.e., 
temporal chaos) followed by one that is spatial also (i.e., spatio-temporal chaos due to 
prominent higher modes). Zhao & Yang (1990) demonstrated that chaos is more likely 
for supersonic flow over a buckled panel than incompressible flow past an airfoil. For 
a fluttering plate undergoing a pull-up manoeuvre, Sipcic (1990) and Sipcic & Morino 
(1991) increased the manoeuvre load factor and obtained chaotic responses via inter¬ 
mittency, torus-bifurcation, period-doubling, and a jump phenomenon. 

The primary forcing-flutter resonance problem was studied by Dzygadlo (1970). 
Kuo et al (1973) obtained higher flutter speeds (beyond which forcing-flutter and 
pure-forced response may coexist) when considering forcing-flutter interaction. 
The effect of the chord-to-length ratio on modal convergence and flutter amplitudes 
of a cantilevered plate was studied by Weiliang & Dowell (1991). Barai & Durvasula 
(1994) analysed the effects of shear deformation, ply orientation, and aspect ratio 
on the critical speeds of hybrid composite panels, for which they found that piston 
theory was the most appropriate aerodynamic model. Finite element method based 
analyses were done by Rao & Rao (1980), Han & Yang (1983), and Sarma & Varadan 
(1988) for various in-plane and out-of-plane boundary conditions, by Gray et al 
(1991) who presented results indicating the destabilizing effect of third-order piston 
theory aerodynamics, and by Liaw & Yang (1993) for laminated panels possessing 
structural uncertainties and subjected to uniform and non-uniform compression. 

In the present study, the nonlinear dynamic response of a flat, laminated panel subjected 
to a high-supersonic gas flow of arbitrary orientation is analyzed. The model incorporates 
the combined effects of geometric and aerodynamic nonlinearities, transverse shear 
deformation, in-plane edge loads and edge restraints, and small initial geometric imper¬ 
fections (e.g., manufacturing defects). Arclength continuation is used to determine the static 
state, periodic motions about which are obtained by a shooting technique that affords a 
systematic investigation of bifurcating motions as well. Direct numerical integra¬ 
tion is considered for nonperiodic states, which are characterized using Lyapunov 
exoonents. 
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Figure 1. (a) Cross-section of 
a symmetric laminate; (b) Panel 
with flow and edge-compres¬ 
sion. 


2. Panel flutter equations 

Consider a flat rectangular panel (/ x x l 2 ) of thickness h, symmetrically laminated with 
2m-1-1 transversely isotropic layers (Librescu 1975) having their plane of isotropy 
parallel to the mid-plane. Furthermore, (* 3 , are orthogonal Cartesian axes, and the 
undeformed mid-plane (x 3 = 0) contains x w (&> = 1,2) (cf. figure 1). 

The governing equations are obtained by following Librescu & Reddy (1989), 
Librescu & Stein (1991), Chandiramani (1993), and Chandiramani et al (1994), and by 
considering quasi-steady, third-order piston theory aerodynamics (e.g., Bolotin (1963). 
Thus, for a shear-deformable, simply-supported panel subjected to a high-supersonic 
flow, the HSDT (higher-order shear deformation theory) equations describing the large 
lateral motion u 3 [x CJ> f] (measured from the initially unstressed, imperfect state 
tf 3 [xJ)are 

(■b "F c)C ,ioo}nn "F 0‘5(n 3iU£!) u 3i7t7C -F 2 m 3c0<0 u 3>jnt — tt 3oJn u 3oy7l — 2u 3 W7t /i 3iW7C ) + 

2^//^( W 3,juoa> W 3 ,7t “F W 3,j«oto^3,7t "F ^3,ncom U 3,n “F U 3,nm U 3,tio> “F ^ U 3,itu} ^3,tico) == ^ 
^ U 3,tomaa ~ 8 aco 8 /}p[C ,mp{ U 3,a() *F ^3,<x/)} ~ ^ ,cap{ U 3,ocp ~F &3,ap } ),**] ~ 

j/[u 3 + u 3 ] 4- Sj/[u 3 + u 3 ], aa + m 0 [ti 3 + cu 3 - T(u 3>aa + cw 3i£ J] = 0 
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Equations (1)! 2 expressed in terms of the Airy’s stress function C[x w ,t] and the 
transverse deflection u 3 are referred to herein as the von-Karman-type compatibility 
and transverse force equilibrium equations, respectively (see Librescu and Stein 1991), 
while j#[-] denotes the nonlinear aerodynamic operator whose expression (1) 3 is 
consistent with high supersonic flight Mach numbers. Here, u 3 is assumed small 
relative to u 3 , and b H = 1 for the HSDT. Furthermore, e aco , c, k, p^, c M , U^, A, and 
M{ = U^/ c OT ) are the 2D permutation symbol, viscous structural damping coefficient, 
polytropic gas coefficient, undisturbed air pressure, speed of sound, flow speed, 
coplanar flow angle (cf. figure lb), and flight Mach number, respectively, and 5 f = 0, 
b f — 2{b f ~b f ~ 1] when flow occurs past both faces (upper face only). This affords 
a consistent representation of either one-sided or two-sided high-supersonic flow which 
is modelled through the presence of aerodynamically nonlinear terms in w 3 (see(l) 2 3 ). 
The coefficients m 0 , b, c, d, D, S, and T are given in Chandiramani (1993), and the 
Einsteinian summation convention is implied for repeated Greek indices (which range 
over values 1,2). Terms containing the coefficients 5 and T represent static and dynamic 
contributions, respectively, arising from transverse shear deformation of the composite 
panel, whereas D and m 0 denote the panel bending rigidity and mass per unit area, 
respectively. The classical plate theory (CPT) equations are obtained by considering an 
infinite transverse shear rigidity and b H = 0 in (l) t 2 . 

Considering the representations {u 3 , u 3 } T =E^ n=1 {T rmi (r), A mn } T smrrntx x /l x smnnx 2 /l 2 
(which satisfy the out-of-plane boundary conditions), solving (1) : for C (cf. Chan¬ 
diramani 1993, Chandiramani et al 1994), and then using Galerkin’s method for (1) 2 , 
the nondimensional panel flutter equations read 
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r,s 

where the coefficients are defined in Chandiramani (1993), and % a implies no summa¬ 
tion over the index a. The tracer d Mm equals 1 (0) when the edges x w = 0, l a are movable 
[immovable/partially restrained] in the in-plane direction normal to the edge, and 
■^rsmn^rsmJ is 0 if the edges along the x x [x 2 ] direction are movable. Furthermore, 
a mn(= A mJ h )> d mn( = A mJ h l L ti ^ £ s> £ a . and x are the dimensionless modal- and 
imperfection-amplitudes, applied edge loads (positive in tension), flow speed, struc¬ 
tural- and aerodynamic-damping coefficients, and time (henceforth replaced by t for 
convenience), respectively (cf. appendix A). The (4x1) and (2 x 2) cases of modal 
truncation were considered using MACSYMA (k x / truncation implies k chordwise 
and l spanwise modes). 

3. Flutter response 

For temporally constant edge loads, considering the representation a,•,-[£] = a tJ + a i7 -[r] 
and (2) yields the static aeroelastic equilibrium equations and the equations describing 
the motion of the panel d tj [£] about its static equilibrium state d i} . Introducing the state 
vectors y and z of dimension N and 2N, respectively (N = kl ), these two systems of 
equations have the form 

? [yi 2, L* ] = 0, z = f [z; A, L* ] (3) 

The components of y [z] are d u [d (V and d, 7 ]. For the (4 x 1) case, y = {d x t , d 2 x , a 31 , a 4 x } T 
and zs{5 11 ,fl 21 ,fl 31 ,fl 41 ,fl u ,5 2 i,5 3 i,a 4 j} r , andfor the (2 x 2) case, y = {a u ,a 12 , 
d 2l ,d 22 } T and z = {d 11 ,d 12 ,d 21 ,d 22 ,d 11 ,d 12 ,d 21 ,d 22 } 7 ’. Furthermore, the N-[2iV-] 
dimensional vector f[f] is a nonlinear function of y[z]. 

3.1 Determination of static equilibrium state and its stability 

The arclength continuation method (ACM, e.g., Seydel 1988, Nayfeh & Balachandran 
1994) is used to solve (3) x . Varying one of the control parameters, e.g., 2, parametrizing 
(3)x in terms of the arclength s along a solution branch, and considering the Euclidean 
arclength normalization, one obtains 

f[y M; A[s]] = 0, (y 1 - y* f + • • ■ + (y N - y*) 2 + (2 - 2*) 2 - ds 2 = 0, (4) 

where (y*,2*) and (y,2) denote solutions for parameter values s* and s( = s* + ds), 
respectively. The continuation scheme is initiated by solving. (3)x for a specified 2. 
Thereafter, the solution branch is generated by considering y* = y and 2* = 2 for each 
subsequent arclength step of specified size (ds), and solving (4) for (y,2) using the 
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Newton-Raphson method. The direction of continuation is governed by the initial 
guess provided for any one unknown, e.g., X. A good initial guess for X is X* ± ds, with 
the choice + / — determining the continuation direction which is suitably altered in 
order to negotiate a turning point. 

The Jacobian = V y f, where V y (-) = [pifjdy x ,., d(-)/dy N f is singular (with rank 

N — 1) at turning points and branch points. Moreover, the (IV + 1) x N ‘augmented 
matrix’ A = [V y f|df/tU] has rank A[< iV] at a turning point (branch point). Thus, by 
monitoring the eigenvalues of both J : and the matrices formed by eliminating the ith 
column (; = 1 -» N) of A, the passing of turning points and branch points is detected. 
Branch points did not occur in the examples of this study. 

The local stability of the static equilibrium state is determined from the eigenvalues 
of the Jacobian J 2 = V z f| 2=0 , whereby if every (any) eigenvalue resides in the left- 
fright-] half complex plane the state is asymptotically stable (unstable). As the control 
parameter (or arclength parameter) is varied, a Hopf bifurcation occurs at (y 0 , 2 0 ') if J 2 
has a simple pair of pure imaginary eigenvalues (undergoing a transverse crossing of the 
imaginary axis) and no other eigenvalues with zero real part. At the bifurcation point, 
the static equilibrium state loses its asymptotic stability and a periodic solution of (3) 2 is 
born (e.g., Seydel 1988), i.e., flutter ensues. A supercritical (subcritical) Hopf bifurcation 
is characterized by stable (unstable) periodic solutions beyond (before) the bifurcation 
point, yielding soft (hard) flutter. The control parameter values at the Hopf bifurcation 
are termed the critical or flutter values. 

3.2 Dynamic solutions and bifurcation behavior 

The amplitudes of flutter oscillation about the static equilibrium state may be obtained 
by a direct numerical integration (NIM1) of the initial value problem (IVP) defined by 
(3) 2 and some initial conditions. However, convergence to a steady state is often very 
slow and strongly influenced by the choice of initial conditions. Thus, when seeking 
a periodic solution (i.e., the state and period T) of the autonomous system (3) 2 , 
a shooting technique (NIM2) is considered (e.g., Parker & Chua (1989), Nayfeh 
& Balachandran (1994)). This consists of a predictor and a Newton-Raphson type 
corrector. An initial condition v (a 2N-dimensional vector) that lies on the limit cycle 
(periodic solution) is sought, whereby the IVP is transformed into the two-point BVP 

z = f[z;2,L*], z[0;v] =v, z[T;v] =v, where z = z[r; v]. (5) 

The corrections ( S V ,ST ) to an initial guess (v 0 , T 0 ) are governed by the periodicity 
condition z[T 0 + <5T; v 0 + <5v] = v 0 + <5v. Introducing (5) x into the linearized equa¬ 
tions obtained after a Taylor expansion of this condition, and augmenting the result 
with the Mees criterion which requires that Sv be normal to f, one obtains 

f[ z ['Po^ v o]’^^] 5 'P + [Vvz[T 0 ;v 0 ] — I]<5v = v 0 -z[T 0 ; v 0 ], 

f T C z [T 0 ; v 0 ]; a, L*]<5v = 0. (6) 

In the linear algebraic system (6), I is the identity matrix and V v z is the Jacobian of 
z with respect to v. Differentiating (5) t with respect to v, one obtains V v z = V v f which, 
upon consideration of (5) 4 , yields a system of linear ODE’s in dzfd.j. Further, 
integration of (5) : subject to (5) 2 , followed by differentiation with respect to v, yields thq^ 
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initial conditions for V v z. The IVP thus obtained reads 

A(V v z) = V 2 fV v z, V v z[0; v] = I. (7) 

Thus, periodic solutions are obtained by the following algorithm. 

Step 1-Predictor : Obtain z[T 0 ; v 0 ] and V v z[T 0 ; v 0 ] by numerically integrating the 
IVP defined by the (2 N) 2 4- 2N ODE’s (5^ and (7) t and associated initial conditions 
(5) 2 and (7) 2 . 

Step 2-Corrector: Using z[T 0 ; v 0 ] and V v z[T 0 ; v 0 ], solve (6) to obtain the correc¬ 
tions <5v and ST. If |<5v|and|<57"[ are below a specified tolerance, the algorithm has 
converged onto a limit cycle with period T = T 0 and containing the state v = v 0 . 
Otherwise, the updated, initial guesses, i.e., <5v + v 0 ->-v 0 and ST + T 0 ->T 0 , are used 
and the algorithm is repeated. 

The stability of a periodic solution z [t] is ascertained from the eigenvalues of 
V v z [T a ; v 0 ] (i.e., the Floquet multipliers, cf. appendix B). At bifurcation values of the 
control parameters, the nature of the non-decaying (oscillatory) disturbance z* (which 
yields a bifurcating solution z = z + z*) is governed by how the critical Floquet 
multiplier l k leaves the unit circle in the complex plane. This determines the type of 
bifurcation that may occur (e.g., Seydel 1988, Nayfeh & Balachandran 1994). 

A stable limit cycle possessing inversion symmetry (appendix B) undergoes a 
symmetry breaking bifurcation when X k leaves the unit circle through +. Beyond the 
bifurcation point, stable, symmetry broken (unsymmetric) and unstable, symmetric 
limit cycles coexist, and their periods are locally preserved. A stable, symmetric 
[unsymmetric] limit cycle undergoes a tangent bifurcation when \ leaves the unit circle 
through — 1 [ + 1]. At the bifurcation point (also called a turning point), a stable and 
unstable limit cycle branch merge. Beyond the bifurcation point, the solution jumps to 
another state which could be a dynamic one (periodic or nonperiodic). A stable, 
unsymmetric limit cycle undergoes a period-doubling bifurcation when l k leaves the unit 
circle through — 1. Beyond the bifurcation point, stable, period-doubled and unstable, 
periodic solutions coexist, and locally the period of the former is twice that of the latter. 
A secondary Hopf (Niemark) bifurcation occurs when two complex conjugate Floquet 
multipliers leave the unit circle transversely. This implies a Hopf bifurcation of a stable 
periodic solution, which introduces an additional frequency into the bifurcating 
solution which is two-period quasiperiodic (phase-locked) if the two frequencies are 
commensurate (incommensurate). Stable, quasiperiodic (QPM) and unstable, periodic 
solutions usually coexist beyond the bifurcation point. 

It is noted that the NIM2 cannot be used to obtain nonperiodic (i.e., quasiperiodic or 
chaotic) solutions, and its convergence is sensitive to the initial guess and the step size of 
numerical integration (especially when multiple periodic solutions coexist). The nu¬ 
merical integration was performed using the fifth and sixth order Runge-Kutta- 
Verner method ( IMSL-D VERK). 

3.3 Characterization of motion 

The Lyapunov exponents, which measure the average exponential rates of divergence 
or convergence of nearby phase space trajectories, were calculated using the algorithm 
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and program of Wolf et al (1985). In this method, primary and neighboring trajectories, 
evolving from points at the center and surface, respectively, of an infinitesimal 
2iV-dimensional hypersphere, are obtained from the IYP (5) 12 and (7). The generation 
of principal axes (of the hypersphere) is ensured by choosing initial conditions (7) 2 , and 
their evolution (i.e., stretching or contraction) is computed at each specified Lyapunov 
time step. A Gram-Schmidt re-orthonormalization of the principal axes is done at each 
Lyapunov time step in order to prevent a divergence in their magnitudes. The yth 
Lyapunov exponent is defined as lj = lim f _ (l/t)log 2 (d j [t]/dj [0]) (j = 1 -> 2 N), where 
dj[t] is the length of the jth principal axis. 

The value of l } depends upon the Lyapunov time step and Runge-Kutta integration 
time step, and a long-time integration is required to obtain an acceptable value of lj. An 
autonomous system (e.g., (3) 2 ) possesses one zero exponent (except for a stable 

2 N 

equilibrium point), and a dissipative system (e.g., (3) 2 ) is characterized by £ lj < 0. 

• j= i 

Stable equilibrium points possess all negative exponents, periodic solutions have one 
zero and 2IV — 1 negative exponents, X-period quasiperiodi.c motion (i.e., with K in¬ 
commensurate frequencies) possesses K zero and 21V — K negative exponents, whereas 
a chaotic attractor has at least one positive exponent (indicating an exponential 
expansion in the corresponding principal direction). The Lyapunov (Kaplan-Yorke) 

dimension d L is calculated using d L = m + (l +1 2 + .+ ^ m )/l 2 m+J where, after 

ordering the exponents as l x 3s l 2 ^.^ l 2N , m is defined as the largest integer 

satisfying X x -\-l 2 + .+ l m ^ 0 (e.g., Seydel 1988, Parker & Chua 1989). Thus, d L is 

equal to 0,1, and K for a point, periodic, and K-periodic attractor, respectively. 
A chaotic attractor possesses a non-integer d L . 

4. Discussion of results 

Unless stated otherwise, an aerodynamically nonlinear, 4x1 mode analysis is per¬ 
formed for a square panel with movable edges and l x /h = 50, and subjected to flow past 
both faces with A = 0°. A three layered composite panel (i = 1,2,3, where indices 1 and 
3 represent the outer layers) is considered. The data used are: p x = 7-092 x 10 5 N.m " 2 ; 

= 3-299 x 10 2 m.s -1 ; k = 1-4; e s /e yl = 0-l (assumed); h = 2-54cm; h (2> = 0‘5h <1} (i.e., 
the thickness of the core layer is twice that of the face layers, cf. figure la); d xx = 0-2, 
(remaining d u = 0); p <0 = 3-064 x 10 3 kg.m~ 3 ; E <x> = £ <3> = 1-01 x 10 12 N.m -2 ; 

E <i>/ E v> = £«•)/£<«> = 5 ; V G <» =40; and ^ = = °‘ 25 - Here E <o and ^<o are 

the Young’s modulus and Poisson’s ratio in the plane of isotropy, respectively, whereas 

E (i> ,fi (i) , and G' (i) are the Young’s modulus, Poisson’s ratio, and shear modulus in the 
planes normal to the plane of isotropy, respectively, and <i> denotes the i th layer. 

The following notation is used. Periodic motions are denoted as SS-stable sym¬ 
metric, SB-symmetry broken, SN-stable nonsymmetric, Pn-period-n, US-unstable 
symmetric, and UN-unstable nonsymmetric, whereas QPM denotes quasiperiodic 
motions. The maximum z x for which z 5 = 0 is termed the flutter amplitude (z^^. 

4.1 Results using linear and nonlinear aerodynamics 

A 2 x 2 mode, HSDT analysis of a rectangular (<t> — l x Jl 2 = 0-5) panel with A = 30° is 
performed, Figure 2 considers linear aerodynamics and l x fh = 24. Figure 2a shows that 
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the flutter motion undergoes a symmetry-breaking bifurcation followed by a period¬ 
doubling and subsequently a bifurcation to quasiperiodic motion at X — 81-3654 
(2-period QPM, cf. table 1 for 2 = 81-3655). The Poincare section in figure 2b verifies 
this transition from P2 motion to QPM by indicating a transformation of two discrete 
points into two closed loops. Figure 2c displays the associated frequency spectrum. 
This contains sidebands around the fundamental frequency and its higher- and 
sub-harmonics, thus indicating the presence of additional, incommensurable frequen¬ 
cies. At 2 = 92-64 (i.e., the second symmetry-breaking bifurcation point), the US 
solution regains its stability. For 2 = 92-54, a phase trajectory (not included) indicating 
bounded, nonperiodic motion between the US and two neighboring unstable-SB limit 
cycles is obtained. Furthermore, the emerging pattern in figure 2d, the broadbanded 
frequency spectrum in figure 2e, and data in table 1 indicate that this motion is chaotic. 
This scenario for chaos conforms to the conjecture that chaotic motion could emerge 
when all coexisting equilibria (static or dynamic) are unstable (e.g., Dowell 1984). 

Figure 3 displays results for flow past the upper face (i.e., second- and third-order 
aerodynamic nonlinearities present, cf. Case I) and past both faces (i.e., only third-order 
aerodynamic nonlinearities present, cf. Cases II and III). The P2 motion for Case 
I ( l x /h = 100) undergoes a bifurcation to QPM. For Case II (^/h — 100), the SB solution 
bifurcates to QPM, and the SS branch (occurring after the US solution regains its 
stability) approaches a turning point (2 « 160-45) beyond which a jump in response is 
expected. A subcritical Hopf bifurcation occurs at 2 = X F = 74-49 for Case III 
{^/h = 50). This implies that hard flutter motion occurs when 2 > X F , and could also 
occur for 2 < X F if the flat panel is appropriately disturbed. 

4.2 Flutter of compressed and buckled panels 

A uniaxially compressed panel is considered (L*<0). Stability boundaries for an 
imperfect panel are shown in figure 4a. As the control parameters are varied, starting 
from L? = 2 = 0 (i.e., the initially imperfect state), the stable configuration changes 
continuously. For low values of — L*, a Hopf bifurcation occurs at the stability 
boundary and flutter ensues. At high values of — L* (i.e., the region marked with dots), 
the loss of stability results in a jump either to flutter or a new buckled state 
(snap-through). Depending upon the loading path (aerodynamic and compressive) 
considered, multiple flutter speeds are possible within a certain range of Lf . Further¬ 
more, for a given flow speed, the CPT overpredicts the critical compressive load, and is 
insensitive to transverse shear flexibility variations, whereas the HSDT yields a lower 
critical compressive load for a panel with greater transverse shear flexibility. For the 
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Figure 4. Stability boundaries: (a) imperfect panel; (b) perfect panel. 


perfect panel, the stability of the trivial state {0} was studied, and associated stability 
boundaries are shown in figure 4b. Based on [ R, C], where R and C are the number of 
real and complex eigenvalues of J 2 residing in the right-half complex plane, respective¬ 
ly, the X — L* plane is divided into regions I-IX. The trivial solution is stable in I and 
undergoes divergence instability when crossing into II, and flutter instability (as the 
result of a Hopf bifurcation H {0}) when crossing into IV. However, as will be seen later, 
this is not the only mechanism for the onset of flutter. Buckled equilibria exist in II and III. 
Figure 4b reveals that restraining the edges parallel to the flow lowers the divergence 
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Table 2. Motion past H [X = 43, cf. figure 5a). 



T 

( z iL 

Motion type 

Stability 

0-620518£ + 01 

0-5005£ + 00 

0-25023£ — 03 

SN 

Stable 

0-621268£ + 01 

0-5117£ + 00 

0-87581£ —01 

UN 

Unstable 

0-621268£ + 01 

0-1023£ + 01 

0-15611£ + 00 

P2 

Stable 

0-621338£ + 01 

0-1028£ + 01 

0T7596£ + 00 

P2 

Unstable 

0-621338£ + 01 

0-2056£ + 01 

0-17604£ 4- 00 

P4 

Stable 

0-621349£ + 01 

0-2057£ + 01 

0-17770£ + 00 

P4 

Unstable 

0-621349£ + 01 

0-4115£ + 01 

0-17773£ + 00 

P8 

Stable 


compressive load and increases the flow speed required for flattening (which occurs 
when crossing from II into I), whereas it does not visibly affect the flutter speed. 

For the imperfect panel, the region in the X — Lf plane where the transition from 
Hopf-bifurcation instability to jump-type instability occurs was studied (i.e., transition 
to the region marked with dots in figure 5a). The variation of the static solution (j/j ) 
with L\ is shown in figure 5a. As the compressive load is increased, with X — 43, initially 
the deformation increases and then the panel tends to flatten until a He pf bifurcation 
( H ) occurs, resulting in soft flutter about the equilibrium state. The motion begins as an 
SN limit cycle, then undergoes a sequence of period-doubling bifurcations resulting in 
P2 up to P8 motion (cf. table 2), and finally becomes chaotic (cf. figure 5b and c, and 
table 1 for L* = — 6-235 which is well past the P8 motion value). Thus, chaotic motion 
has resulted after a period-doubling sequence. For 2 = 37, motion about TP1 and 
motion past H (cf. tables 3 and 4, respectively) indicate that for a range of L*, the static 
state coexists with either periodic or quasiperiodic motions, i.e., the panel can remain 
buckled or can flutter. A similar observation was made by Holmes (1977) for a perfect 
panel undergoing cylindrical bending. 

For the perfect panel, the region in the X — L* plane where the divergence and flutter 
boundaries meet was studied (cf. figure 4b). Figure 6a shows the static solution (j^) for 
Lf = — 5-5, — 5-8, and — 6-2 when X is varied. It also shows the stable motion ((zi) mflX ) 
past H {0} for L* = — 5-5 (for L\= — 5-8, cf. table 5). For L\ = — 5-5, as X is increased 
the buckled panel gradually flattens out and then flutters beyond H{0}. It is seen that 
for Lf = — 5-8 (cf. table 5 which reveals a period-doubling sequence) there exists a range of 
X for which the panel can either remain buckled or flutter about {0}. Increasing X for 
L* - — 5-8, when passing the TP a sudden transition from the buckled state to periodic 


Table 3. Stable motion about TP1 at £*=—7-712 (cf. 
figure 5a). 



T 

( Z l)max 

Motion type 

0-7602£ + 01 



QPM 

0-7712£ + 01 

0-2774£ + 00 

0-5599£ + 00 

SN 

0-7912£ + 01 



QPM 
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Table 4. Stable motion past H(X = 37, cf. figure 5a). 



T 

( Z l) m ax 

Motion type 

0-643286£ + 01 

0-6604£ + 00 

0T350£ — 03 

SN 

0*643415£ + 01 

0-1343E + 01 

0-3968£ — 01 

P2 

0-643418£ + 01 

0-2688E + 01 

0-5965£ — 01 

P4 

0*643419£ + 01 

0-5379£ + 01 

0-6017£ — 01 

P8 


motion about {0} occurs. The motion, shown in figure 6b for X = 36-8456, consists of the 
panel snapping back and forth, and vibrating about multiple unstable equilibria (similar 
limit cycles were obtained by Sipcic & Morino 1991). This is followed by chaotic 
motion when X = 36-9456 (cf. figure 6c and table 1). For = — 6-2, a buckled-chaos 
transition was observed when passing the TP (cf. figure 6d and table 1 for X = 34-8739). 

Next, is varied with X held constant. For X = 24 and — L\ increasing, the buckled 
state (cf. figure 7a) undergoes a sudden transition to QPM (cf. figure 7b and table 1) 




(d) 


jPoincarfi section: z 6 = 0 , z g > oj 


Poincarfi section : z g = 0 , z g > o| 

. 

20 



Z 5 


' 




0 

w 

'-*-**. fcj-* 1 * 4 1 > 4 « 1 1 1 . » « 1 1 1 . 1 „ 1 I.^U. 1 . *-i.. 

-20 



-0.75 -0.50 -0.25 0.00 0.2S 0.50 0.75 1.00 1.25 


Figure 6. CPT, perfect panel; (a) Static (yj and dynamic ((z^ ) modal ampli¬ 

tudes, H-supercritical Hopf bifurcation point, TP - turning point* (b) L*= — 5-8, 
periodic motion past TP, 2 = 36-8456; (c) L*= -5-8, chaotic motio/past TP^ 
X = 36-9456; (d) L* = - 6-2, chaotic motion past TP, X = 34-8739. 
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Table 5. Stable motion past // {0} (L* = — 5-8, cf. figure 6a). 


X 

T 

( Z l)max 

Motion type 

0-351134£ + 02 

0-1752E + 01 

0-6123£ — 02 

SS 

0-352254F + 02 

0-1652£ + 01 

0-6768£ — 01 

SB 

0-352494£ + 02 

0-3309E + 01 

0*5316£ — 01 

P2 

0-352531E + 02 

0-6619E + 01 

0-5916£ —01 

P4 

0-352539£-f 02 

0-1324£ + 02 

0-6049£ — 01 

P8 

0-352540£ + 02 

0-2648£ + 02 

0-6076£ — 01 

P16 



Figure 7. HSDT, perfect panel; (a) Static modal amplitudes, H1-H3 (subcritical)-, 
H 4 (supercritical)-Hopf bifurcation point, TPl-TP5-turning point; (b) hard flutter 
motion (QPM) about HI (cf. table 7 also). 


about the nontrivial equilibrium state when passing HI (subcritical, cf. table 6). The 
appearance of wrinkles in the Poincare sections suggests the possibility of chaos via 
Torus-Bifurcation (cf. Swinney 1983) in the neighborhood of = — 15-968. Further 
increasing — results in shrinkage of the Poincare section (cf. figure 7b) and 
eventually a transition from QPM to periodic motion (SN), followed by a sequence of 
period doubling and then QPM (cf. table 7). As — L? is decreased, QPM continues 
prior to HI, with the Poincare section reducing in size (cf. figure 7b) until all motion 
suddenly ceases when — L\ < 15-893, i.e., the fluttering panel snaps back to the buckled 
state. These results show that hard flutter occurs about HI. The motion about H{0} 
(supercritical, occurs at L\ — — 4-979) and about H4 (supercritical), not included here, 
consists of a period-doubling sequence and reveals the coexistence of flutter motion 
with the buckled state. For X = 27 (HSDT), the buckled panel experiences a transition 


Table 6. Unstable motion prior to HI (cf. figure 7a). 



T 


( Z 1 ^max 


Motion type 

0-1592£ + 02 

0-9226£ - 

-01 

0-7868£ — 

03 

UN 

0-1587£ + 02 

0-9513£- 

-01 

— 0-4812£ — 

01 

UN 













686 


Naresh K Chandiramani et al 


Table 7. Stable motion past HI (cf. figure 7a, b). 



T 

^1 ^max 

Motion type 

0-1607E + 02 
0-1617£ + 02 

0-9983E —01 

-0-1401 £ + 00 

QPM 

SN 

0-1647£ + 02 

0-2021£ + 00 

— 0-1437£ + 00 

P2 

01681E + 02 

0-4110E + 00 

-0-1792E + 00 

P4 

0-1683£ + 02 



QPM 


to chaos at L\ = — 4-965 (chaotic at = — 4-975 also) which is similar to that for the 
case in figure 6c. 

Results for a compressed, perfect panel with edges x 2 = 0, l 2 either movable or 
immovable, and edges Xj = 0, l t movable are shown in figure 8. As the compressive load 
is increased with k = 40, the restrained panel buckles, then suddenly flattens out 
(beyond the TP), and subsequently flutters about {0}. The unrestrained panel remains 
flat prior to experiencing flutter. Moreover, the edge restraint has no visible effect on 
the amplitudes of periodic motion in the vicinity of H {0} and on the associated critical 
speed. This is so because the compression induced by the edge restraints is normal to 
the flow. 


5 Conclusions 

The nonlinear dynamic response arising from a high-supersonic flow over a uniformly 
compressed, shear-deformable, composite panel was studied. Third-order piston the¬ 
ory aerodynamics was considered along with small geometric imperfections and 
various in-plane boundary conditions. Periodic and nonperiodic motions were ob¬ 
tained using the shooting technique (NIM2) and direct numerical integration, respect¬ 
ively, in conjunction with arclength continuation for the static state. The results permit 
the following conclusions. 

• Flutter of a compressed panel occurs either due to a Hopf bifurcation instability or as 
a jump from stable equilibrium to motion. Within the regions in the k — L\ plane 



Figure 8. Effect of edge re¬ 
straint on response, perfect 
panel, HSDT, edges x l = 0, 
movable (M), x 2 = 0, l 2 movable 
(Af) or immovable (I), TP-turn¬ 
ing point. 
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considered, it was found that periodic or quasiperiodic dynamic states may coexist 
with the stable static state, i.e., the panel could remain buckled or could flutter. For 
compressed panels, chaotic motions were obtained for imperfect panels via the 
period-doubling scenario, and also for perfect panels where a sudden transition from 
the buckled state to one of chaos (followed by complicated periodic motion) 
occurred. A chaotic response in the form of bounded nonperiodic motion between 
three unstable periodic solutions was observed for the uncompressed panel. More¬ 
over, hard flutter motions (quasiperiodic and periodic) about the nontrivial equilib¬ 
rium state can also occur. 

• The restraining of edges parallel to the flow does not visibly affect the flutter speed 
and immediate post-critical response of perfect panels. However, the possibility of 
buckling prior to flutter is greater for restrained panels. 

• The presence of aerodynamic nonlinearities, especially third-order ones, may cause 
hard flutter oscillations as well as a jump in dynamic response. 

• The critical loads predicted by the HSDT [CPT] are lower (unchanged) for a panel 
with greater transverse shear flexibility. Moreover, the HSDT generally yields lower 
critical loads than the CPT. 


Appendix A 

Definitions of non-dimensional quantities 
L* ~ l \,\ — l\ KPx U °° , t — 


n 2 D 

“Too a 
Coo K )£) 1/2 


D c„ 


D 


m 0 li 


1/2 


t) c 


n j£)j 


1/2 


(Al) 


Appendix B 

Floquet theory and stability of the periodic solution 

Applying a perturbation selution z*[f] to a periodic solution z[t] with period T, the 
following linearized disturbance equations are obtained from (3) 2 : 

z* [t] = A [t\X, L *] z* [r], where A[t; X, L *] = Vj| I=i . (1) 

The period of A is T A = T except when f contains only odd nonlinearities and the limit 
cycle is symmetric (i.e., inversion symmetry implying z[t] = — z[f + 0-5 T]), in which 
case T a = 7/2. One possible fundamental matrix solution of (Bl), i.e., Z* [t], is defined 
by the matrix IVP: 

Z* = A[a,L*]Z*, Z*[0] = I. 1B21 

Then, from Floquet theory one concludes that if th 
possesses distinct eigenvalues Xj (the Floquet mul 
asymptotically stable (unstable) if every (any) Floq 
unit circle whose centre is at the origin of the comp._ 
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is seen that O is identical to V v z[ T A \ v 0 ]. Hence, after a periodic solution is obtained by 
the shooting technique, its stability is ascertained as a by-product by determining the 
eigenvalues of V v z[T x ; v 0 ] (e.g., Seydel 1988, Nayfeh & Balachandran 1994). Further¬ 
more, for an autonomous system it can be shown that when T A = T\T A = T/2], i.e., for 
an unsymmetric [symmetric] limit cycle, one of the Floquet multipliers is + 1 [ - 1], 


Appendix 
List of symbols 

^mn 

b,c,d,S 

Clx^t] 

h,l x ,l 2 ,<fr 
J l5 J 2 

Ll,X 

m 0 ,D 

s 

U„, M,A 

y,z[t] 
z[t] ,Z*[t] 

ico’df’df 

^■’Poo’^ao 


v, T 

P<i» ^<i>> P<i>’ 
*<* G <0> 

References 


amplitudes of lateral motion and lateral imperfection; 
nondimensional amplitudes of lateral motion and lateral imper¬ 
fection; 

static and dynamic components of a m „[r]; 

structural coefficients given in Chandiramani (1993); 

Airy stress function associated with in-plane stress resultants; 
viscous structural damping coefficient, and non- dimensional 
structural and aerodynamic damping coefficients; 
thickness, length, width, and aspect ratio of panel; 

Jacobian matrices for determining bifurcation and stability 
behavior of static state (y); 

nondimensional edge-load and fiow speed parameters; 
mass per unit area and bending rigidity of panel; 
arclength parameter; 

flow speed, Mach number, and coplanar flow angle; 
lateral motion and lateral imperfection; 
state vectors associated with a mn , d mn {_t]\ 
periodic solution of z and corresponding applied disturbance 
tracers associated with HSDT, in-plane edge restraints, one¬ 
sided flow and two-sided flow; 

polytropic gas coefficient, undisturbed air pressure, and speed 
of sound; 

Lyapunov exponents and associated dimension, 
initial condition and period associated with limit cycle; 
material properties of ith layer, i.e., density; Young’s 
modulus and Poisson’s ratio in the plane of isotropy; Young’s 
modulus, shear modulus, and Poisson’s ratio in planes normal 
to the plane of isotropy; 

monodromy matrix and its eigenvalues (Floquet multipliers)- 


Barai A, Durvasula S 1994 Flutter of hybrid laminated flat panels with simply supported edges in 
supersonic flow. J. Sound Vib. 169: 373-386 

Bolotin V V 1963 Nonconservative problems of the theory of elastic stability (New York: Macmillan) 
Chandiramani N K 1993 Nonlinear flutter of composite shear-deformable panels in a high- 
supersonic flow PhD dissertation, Virginia Polytechnic Institute and State University, 
Blacksburg 




Nonperiodic flutter of a buckled composite panel 


689 


Chandiramani N K, Librescu L I, Plaut R H 1994 Flutter of geometrically-imperfect shear- 
deformable composite panels using nonlinear aerodynamics. J. Sound Vib. (accepted) 
Dowell E H 1966 Nonlinear oscillations of a fluttering plate. AIAA J. 4: 1267-1275 
Dowell E H 1982 Flutter of a buckled plate as an example of chaotic motion of a deterministic 
autonomous system. J. Sound Vib. 85: 333-344 

Dowell E H 1984 Observation and evolution of chaos for an autonomous system. J. Appl. Mech. 
51:664-673 

Dowell E H, Virgin L N 1990 On spatial chaos, asymptotic modal analysis, and turbulence. 
J. Appl. Mech. 57: 1094-1097 

Dzygadlo Z 1970 Local analysis of nonlinear forced vibration of a plate of finite length in plane 
supersonic flow. Proc. Vib. Probl. 11: 353-368 

Gray C E, Mei C, Shore C P 1991 Finite element method for large-amplitude two-dimensional 
panel flutter at hypersonic speeds. AIAA J. 29: 290-298 
Han A D, Yang HTY 1983 Nonlinear panel flutter using high-order triangular finite elements. 
AIAA J. 21: 1453-1461 

Holmes P J 1977 Bifurcations to divergence and flutter in flow-induced oscillations. J. Sound Vib. 
53: 471-503 

Kuo C C, Morino L, Dugundji J 1973 Nonlinear interaction of panel flutter with harmonic 
forcing excitation. AIAA J. 11:419-420 

Liaw D G, Yang HTY 1993 Reliability and nonlinear supersonic flutter of uncertain laminated 
plates. AIAA J. 31: 2304-2311 

Librescu L 1965 Aeroelastic stability of orthotropic heterogeneous thin panels in the vicinity of 
the flutter critical boundary - Part I. J. Mec. 4: 51-76 
Librescu L 1975 Elastostatics and kinetics of anisotropic and heterogeneous shell type structures 
(Leyden: Noordhoff International Publishers) 

Librescu L, Reddy J N 1989 A few remarks concerning several refined theories of anisotropic 
composite laminated plates. Int. J. Eng. Sci. 27: 515-527 
Librescu L, Stein M 1991 A geometrically nonlinear theory of transversely isotropic laminated 
composite plates and its use in the postbuckling analysis. Thin-Walled Struct. 11:177-201 
McIntosh S C 1973 Effect of hypersonic nonlinear aerodynamic loading on panel flutter. AIAA 
J. 11: pp 29-32 

Nayfeh A H, Balachandran B 1994 Applied nonlinear dynamics: Analytical, computational, and 
experimental methods (New York: John Wiley) 

Parker T S, Chua L O 1989 Practical numerical algorithms for chaotic systems (New York: 
Springer-Verlag) 

Rao K S, Rao G V 1980 Large amplitude supersonic flutter of panels with ends elastically 
restrained against rotation. Comput. Struct. 11: 197-201 
Sarma B S, Varadan T K 1988 Nonlinear panel flutter by finite element method. AIAA J. 26: 
566-574 

Seydel R 1988 From equilibrium to chaos:Practical bifurcation and stability analysis (New York: 
Elsevier) 

Shiau L C, Lu L T 1992 Nonlinear flutter of two-dimensional simply supported symmetric 
composite laminated plates. J. Aircraft 29: 140-145 
Sipcic S R 1990 The chaotic response of a fluttering panel: The influence of maneuvering. 
Nonlinear Dyn. 1: 243-264 

Sipcic S R, Morino L 1991 Dynamic behavior of fluttering two-dimensional panels on an 
airplane in pull-up maneuver. AIAA J. 29: 1304-1312 
Swinney H L 1983 Observations of order and chaos in nonlinear systems. Physica D7: 3-15 
Weiliang Y, Dowell E H 1991 Limit cycle oscillation of a fluttering cantilever plate. AIAA J. 29: 
1929-1936 

Wolf A, Swift J B, Swinney H L, Vastano J A 1985 Determining Lyapunov exponents from a 
time series. Physica D16: 285-317 

Yuen S W, Lau S L 1991 Effects of in-plane load on nonlinear panel flutter by incremental 
harmonic balance method. AIAA J. 29: 1472-1479 
Zhao L C, Yang Z C 1990 Chaotic motions of an airfoil with nonlinear stiffness in incompressible 
flow. J. Sound Vib. 138: 245-254 




Sadhana , Vol. 20, Parts 2-4, April-August 1995, pp. 691-707. © Printed in India. 


Coupled in-line and transverse flow-induced structural 
vibration: Higher order harmonic solutions 

R GHANEM 1 , P SPANOS 2 and S SWERDON 2 

department of Civil Engineering, The Johns Hopkins University, Balti¬ 
more, MD 21218, USA 

department of Civil Engineering, George R Brown School of Engineering, 
Rice University, Houston, TX 77251, USA 

Abstract. Coupling of in-line and transverse flow-induced vibration can 
be significant in several structural engineering applications. In this study, 
a particular model already proposed in the literature is adopted. The 
relevant equations of this model feature nonlinearities and coupling be¬ 
tween the two components of motion. The level of coupling depends on the 
relative properties of the flow and the structure. A harmonic balance 
technique is used to obtain an equivalent coupled system of linear equa¬ 
tions. Higher order harmonics can be readily treated, yielding a quite 
general formulation which can be tailored to match the available computa¬ 
tional resources. A parametric study is conducted using the proposed 
technique, and insight regarding the significance of the coupling of the two 
components of motion on the overall structural response is obtained. The 
proposed technique is applicable to any other refined model of coupling of 
in-line and transverse flow-induced vibration which may become available 
based on theoretical or experimental fluid mechanics developments. 

Keywords. Offshore structures; vibrations; harmonic balance; vortex 
dynamics. 


1. Introduction 

The mechanics of wave forces on structures is an important aspect of engineering. Of these, 
vortex-induced oscillations is probably one of the most difficult to quantify. Theoretical 
justifications of vortex-induced oscillations have proceeded hand in hand with experi¬ 
mental verifications (Griffin et al 1973; Blevins 1990). A comprehensive review of the 
state-of-the-art on this topic was presented by Sarpkaya (Sarpkaya 1979). Some 
structures can be viewed as undergoing oscillations which are governed, in the in-line 
direction (parallel to the flow) by Morrison’s equation (Blevins 1990). This equation, by 


A list of symbols is given at the end of the paper 
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itself, cannot account for the observed transverse oscillations along a direction perpen¬ 
dicular, or transverse, to the flow. The magnitude of these transverse oscillations can be 
such that ignoring them may yield unconservative estimates of the associated structural 
loads. This problem can be remedied by including vortex-induced forces in the equations 
governing the motion of the structure. The inclusion of these forces results in two coupled 
differential equations which must be solved simultaneously. The numerical integration of 
these equations, although not trivial, is possible using specialized numerical analysis 
tools (Griffin et al 1973). In many instances, however, the cost of computation is 
significant, and a quick good estimate of the behavior of the structure is more valuable 
than an accurate result which may require unavailable numerical codes. This fact has 
prompted a number of researchers to seek approximations to the solution of the coupled 
system of equations using various techniques. The most important of these simplifications 
involve transposing the problem to the frequency domain. Invariably, this procedure has 
been carried out either by linearizing the equations of motion (Rajabi et al 1984) after 
uncoupling them, or by linearizing the equations in the in-line direction and neglecting the 
motion in the transverse direction (Chen & Lin 1989; Spanos & Chen 1980). Obviously 
these simplifications are incapable of capturing the nonlinear nature of the structural 
system which may involve oscillations at sub-harmonic frequencies. Furthermore, these 
approximations neglect the effect of the coupling between the in-line and transverse 
oscillations; this simplification has not been validated experimentally. 

In this paper, an approximate technique is presented for the solution of the coupled 
equations governing the motion of a structure which is modelled as a two-degree-of- 
freedom system. The approximation involves an expansion in higher order harmonics, 
thus permitting a useful representation of nonlinear effects. Also, the coupling between 
the two components of motion is included, and its effect is investigated by comparing 
the results to corresponding results associated with the uncoupled equations. 

2. Equations of motion 

Figure 1 shows a two-degree-of-freedom model of a vibrating structure subjected to 



Figure 1. Two-degree-of-freedom model of an offshore structure. 
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loads produced by oscillatory flow. The structure is represented by a cylindrical 
component whose planar motion is restrained by two viscous dampers and two linear 
springs. The current velocity is assumed to be zero, and the cylinder is subjected to an 
oscillating flow of magnitude U. This does not affect the development of the general 
solution technique. In addition to this externally applied flow, vortex and flow-induced 
oscillations will develop in the form of excitations which are nonlinear functions of the 
response in both the in-line and the transverse directions. The motion of this simplified 
model is governed by the equations 


m 0 x + 2£ 0 a) 0 m 0 x + eo 2 m 0 x =f x (t), 

(1) 

m 0 y + 2^ 0 co 0 m 0 y + co 2 m 0 y =f y (t). 

(2) 


In the above equations x and y denote the motion of the structure in the x and 
y directions respectively, while f x (t ) and f y (t) denote the resultant forces acting, 
respectively, along these directions. The excitations f x (t) and f y (t ) are composed of 
a drag component f D , an added mass component, /,, and a component f v which is due 
to vortex shedding at a frequency co v . The combined excitations can be written as 
(Rajabi et al 1984; Dailey et al 1987) 

/x(0 = fxD +fxl + fxV 

= jC D pD((u — x ) 2 + y 2 ) 1/2 (ii — x) + C M pA{u — x) + pAx 

-\C L pD{{U v - x) 2 + y 2 ) 112 |j>| cos(co„t)» (3) 

and 

fyi 0 =fyD +fyM + fyv 

— j[C D pD((u - x) 2 4- y 2 ) 1/2 y + (1 - C M )pAy 
+ jC L pD{{U v - x) 2 + y 2 ) 1/2 \U v - x| cos {(o v t), (4) 

where 

li = a)U( — sin(cof) -f cos(cut)) (5) 

denotes the water particle velocity which is assumed, without loss of generality, to be in 
the x direction. In the above equations C D denotes a drag coefficient, C M denotes an 
added mass coefficient and C L denotes a coefficient. Furthermore, the symbols p, D, and 
A denote the density of the fluid, the cross-sectional area of the immersed cylinder, and 
its diameter, respectively. It will prove practical for the subsequent development to use 


the nondimensional time r defined by the equation 

t = cot. (6) 

It will also prove expedient to define the following normalized coefficients 

C A = C M -l, (7) 

£jv = £o( m o/ m ) 1/2 > ( 8 ) 

co N = m 0 (m 0 /m) 1/2 , (9) 

/? = (o/co N , (10) 

m = m 0 + C A pA, (11) 

iu = U/D, (12) 
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U n = coU. 


Furthermore, let the function v(t ) be represented by the equation 

v(t) — cos (t) — sin(r). (14) 

Using the above notation, and denoting differentiation with respect to r by a prime, the 
equation of motion in the x direction becomes 

x" + 2 £ 0 (co 0 /a>)x' + (co 0 /(o) 2 x = f x (t)/(m 0 co 2 ). (15) 

Substituting equation (3) for f x (t) gives 

x" + 2Z 0 —x' + x = - x') 2 + y' 2 ) 1/2 (u' - x') 

co \ co J 2 m 0 

+ - X") + ^x"~ - xf + y' 2 y< 2 1/| cos A 



NORMALIZED MAXIMUM IN-LINE DISPLACEMENT 

Figure 2. Maximum value of the normalized in-line displacement; V = 2-2 
pD 2 jm = 0-2275. 






[flT»l 


M 


NORMAUZED MAXIMUM TRANSVERSE DISPLACEMENT 

Figure 3. Maximum value of the normalized transverse displacement; V r — 2-2, 
pD 2 /m = 0-2275. 


or equivalently 




ic„pD _ 2+/2)!/2(u , _ X , } + CmM u „ 

2 m 0 


m r 


4— «^-x') 2 + / 2 ) I,2 IV|cos(^tY 

2 m 0 V" / 


(17) 


Dividing both sides of the equation by the inertia term (1 + ( C A pA/m 0 )), and dividing 
again by D, yields the following equation 

1 C dP D 2 


x" 1 X 1 

— + 2-^— + 


D fiD p 2 D 2(1 +(C A pA/m 0 ) m 0 

, x '\ 


x'V fy'\ 2 l ll2 ( 
pv(t)-~ J + ( - 1 ( pv{x 
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1 CupA ^ 1 1 Cl(,d: 
(1 +(C A pA/m„)) m 0 2(1 + (C A pA/m 0 )) m 0 


Noting that 


m °( 1 + ^r) =m ’ 


the above equation of motion in the x direction simplifies to 


x ( M»(T) - 


1 C L pP 2 

2 m 


x'V fy\\ y {u v 

a -+ 7 - — cos —t 

h D) \D I ) D \a> 



NORMALIZED MAXIMUM IN-LINE DISPLACEMENT 

Figure 4. Maximum value of the normalized in-line displacement; V r = 2-2, 
P D 2 /m = 0-01136. 
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NORMAUZED MAXIMUM TRANSVERSE DISPLACEMENT 

Figure 5. Maximum value of the normalized transverse displacement; V r = 2-2, 
pD 2 /m = 0- 01136. 

In a similar fashion, the wave forces in the y direction can be expressed in the following 
form, featuring a drag, an inertia, and a vortex term 

/y(0 = fyD + fyl + fyV 

= ~^C D pD((u - x) 2 + y 2 ) 1/2 y + (1 - C M )pAy 

+ ^C L pD((U v - x) 2 + y 2 ) l/2 \U v - xj cosKr). (21) 

Repeating the normalization procedure applied to (1), (2), governing the motion in the 
y direction, is transformed into the following normalized nondimensional form 




y_ 
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2 m 





cos 


( 22 ) 
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Of special interest in analysing offshore structures subjected to vortex-induced vibra¬ 
tion is the condition of locking. This phenomenon corresponds to the case where the 
frequency of vortex-induced oscillations coincides with the fundamental frequency of 
vibration of the structure, in which case, 

co y = o) 0 . (23) 


3. Solution of the equations of motion 


The equations of motion developed in the previous section can be rewritten as 


D 


• 2£ N x' 1 x 

-I——-1-= F 

P D + j3 2 D x ’ 


tJkL ,iz =F 

ID 0 D p 2 D r 


(24) 



NORMALIZED MAXIMUM IN-LINE DISPLACEMENT 

Figure 6. Maximum value of the normalized in-line displacement- V =16 
pD 2 /m = 0-2275. • r 
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NORMALIZED MAXIMUM TRANSVERSE DISPLACEMENT 

Figure 7. Maximum value of the normalized transverse displacement; V r — 16, 
pD 2 /m = 0-2275. 


A solution will be sought to the problem by resorting to the harmonic balance 
technique. This requires expanding the response functions x{t) and y(t) in a Fourier 
series of the form 

x(t) ^ 

— = x 0 + Z C x 2 i -1 sin(it) + x 2 i cos(ir)], (25) 

u ;= 1 

../ r \ 00 

-z- = y 0 + Z l> 2 i -1 sin(iT) + 3 > 2 /Cos(iT)]. (26) 

u i= 1 

The functions F x and F y are also expanded into their respective Fourier series in the 
form 

F x {*)=f l o ) + Z Uii- iSin(ir) +/£>cos(ir)], (27) 

i = 1 

F y ( t) = fo ] + Z Uil- isin(it) +/ ( //cos(it)]. (28) 

i— 1 

Substituting the last four equations back into the differential equations of motion 
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NORMALIZED MAXIMUM IN-LINE DISPLACEMENT 

Figure 8. Maximum value of the normalized in-line displacement; V r = 16, 
pD 2 /m = 0-01136. 


results in equations of the following form involving coefficients of same order in the 
expansions for x(t) 

1 00 

-£ 2*0 + Z -i 2 * 2 i-i sm(it) + cos (it) + x 2i _ 1 (l/^ 2 )sin(it) 

P i= 1 

— i 2 x 2 i cos[h) — ix 2 i (l^ N /p)sin(n) + x 2[ -(l/j5 2 ) cos(ir) 

=fo ) + t U { 2 1- isin(ix) +/« cos (it)] , (29) 

i= 1 

Equating coefficients of sin and cos on both sides of the above equation results in the 
following algebraic equations for the Fourier coefficients of x(t) 

| HVfi 2 ) - ' 2 )x 2i - 1 - 2i(£ N /j3)x 2l =f'S- 1 , 

1 2i«„/fflx 21 -, + ((W 2 ) - i 2 )x 2i =/<£>. (30) 

These can be solved in closed form to yield the following expression for the Fourier 
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coefficients 

1 *o = W. 

1 2, '-‘ ((l/fi 2 )-iY + (2i(( H /fi)) 2 ’ 

v / , 2 ?(d//; 2 )-i 2 )-/ , 2 ?-i2i(^^) 

1 21 ((l/j3 2 )-i 2 ) 2 + (2i(^//3)) 2 ■ 1 ' 

A similar treatment for the equations in the transverse direction results in the following 
equations for the Fourier coefficients of y(f) 

J'o = W. 

/ ( /;- 1 ((l/<3 2 )-i 2 )+/ , j’;2i(V)3) 

1 y2 ‘~ l «l/j3 2 )-i 2 ) 2 + (2 ' 

{ y2t (a/is 2 )-i 2 ) 2 +(m tl /p)) 2 ' 1 ’ 



NORMALIZED MAXIMUM TRANSVERSE DIS 

Figure 9. Maximum value of the normalized transvers 
pD z /m — 0-01136. 
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NORMALIZED MAXIMUM IN-LINE DISPLACEMENT 

Figure 10. Coupled vs. uncoupled predictions; maximum value of the normalized 
in-line displacement; V f = 16, pD 2 /m - 0-2275. 


Obviously, in the above expressions, and /[ y) denote the Fourier coefficients of the 
functions F x and F y which themselves are nonlinear functions of x(t) and y(t). 
Therefore, in the implementation of the above procedure, an iterative scheme is 
followed whereby previous estimates of x(t) and y(t) are used in evaluating the 
numerical values of ff ] and /[ y) . The iteration is continued until a predetermined 
convergence tolerance has been achieved. In the numerical results shown in the next 
section, the norm 


IM = ( X 

i = 0 


(k ) 1 


X) 


1 )’ 


y] 


I N 

i z {xr 

i = 0 




1/2 


(33) 


of the approximation error is used in establishing the convergence criterion. In the above 
expression, xf ] denotes the ith Fourier coefficients as obtained at the /cth iteration. 

For comparison, the results obtained from the coupled equations of motion are 
compared to those obtained using the uncoupled equations. These are obtained from (20) 
and (22) by neglecting the effect of the vortex-induced forces along the in-line direction, and 
by neglecting the contribution of the in-line motion to the transverse motion. The 
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uncoupled equations are given, in their normalized form, by the expressions 


x" 2 x' 1 x _ 1 C D pP 2 
D f3D p 2 D 2 m 


fiv(r) - 






+ C -- A fiv( t ). 


m 


and 


D + 0 D + 0 2 D 


1 CppD 2 

2 


m 


y'\ , i Qp£ : 


D ) + 2 


ii COS -T 

m \ cd 


CD,. 


(34) 

(35) 


4. Numerical results 


A parametric study was carried out to assess the range of validity of the proposed 
approximation scheme. Different values of the reduced velocity coefficients V r were 
considered, as well as different values of the mass ratio coefficient M r = m/pD 2 . The 
reduced velocity coefficients can be readily related to the coefficient p appearing in the 
equations of motion using the relation 


V r =(2n/cD N )(U v /D) = 27tPp. 


(36) 


Values of V r equal to 16 and 2-2, were used in the parameteric study. These values 
correspond to situations selected from offshore engineering applications where vortex- 
induced oscillations are expected to occur ( V r = 16), and situations where they are not 
expected to occur {V r — 2-2) (Blevins 1990). Furthermore, values of 1 /M r equal to 
0-01136 and 0-2275 were used, being representative of a heavy structure, and a less 
heavy structure, respectively. Also, a value of £ N equal to 0-02 was used as representa¬ 
tive of the amount of structural damping present in a lightweight structure. The value of 
the other coefficients were: C D = 1-45, C L = 1 and C M = 1-25. 

In the numerical implementation of the procedure described in the previous section, 
convergence problems were encountered with the iterative procedure as described 
above. Fora value of V r equal to 16, the iterative algorithm failed to converge. This was 
due to a relatively large amplitude of oscillations. The problem was circumvented by 
adding the following term to both sides of each of the equations of motion, 


ipC 0 DE 0 x , E 0 = (l/T) 


rr 


x|df, T = 2tc/cd. 


(37) 


The effect of this additional damping was to reduce the magnitude of the terms used in 
evaluating the components f\ x) and/f 0 , while having no net effect on the final result. 
The nondimensional form of the above term is 


(C D /4n)(pD 2 /m) 


rzn 

\u’ — x’\dzx'. 


J o 


(38) 


Once this artificial damping was introduced into the governing equations, convergence 
to within a value of ||e|| equal to 0-001 was achieved within three to five iterations, 
requiring a fraction of the time needed for the numerical integration of the equations 
of motion. The approximate technique was carried out for orders of the Fourier 
expansion equal to 1, 2, 3, 4, and 5, respectively. Only results pertaining to the first 
order and fifth order approximations are displayed in this paper. Furthermore, the 
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results shown herein relate to the maximum value of the displacement within one 
period. The magnitude of this parameter is deemed an important indicator reflecting 
the maximum level of stresses to be expected on the structure. Figures 2 and 3 show 
the results corresponding to a value of V r equal to 2-2, and pD 2 /m equal to 0-2275. 
Both in-line and transverse displacements are featured. Figures 4 and 5 show the results 
corresponding to a value of V r equal to 2-2 and a pD 2 /m equal to 0-01136. Note the 
quite good agreement between the time integration results and the approximate 
harmonic balance results. Figures 6-9 show the results corresponding to a value 
of V r equal to 16. In this case, good results are obtained for a value of pD 2 /m equal 
to 0-01136. Note that even the secondary peak at a value of j 3 equal to 0-35 was 
successfully identified. For a value of pD 2 /m equal to 0-2275, quite good results 
are again obtained for the in-line displacement. For the displacement in the transverse 
direction, however, the results, although acceptable, are not of the same quality as 
those for the other cases. It should be noted, in this context, that for this particular 
case, the results associated with the uncoupled equations differ substantially from 
the results associated with the coupled equations. Noticeable differences between 
coupled and uncoupled results were also observed for the cases corresponding to the 
in-line displacement with V r equal to 16 and pD 2 /m equal to 0-2275, and for the 



NORMALIZED MAXIMUM TRANSVERSE DISPLACEMENT 

Figure 11. Coupled vs. uncoupled predictions; maximum value of the normalized 
transverse displacement; V r = 16, pD 2 /m = 0-2275. 
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NORMALIZED MAXIMUM TRANSVERSE DISPLACEMENT 

Figure 12. Coupled vs. uncoupled predictions; maximum value of the normalized 
transverse displacement; V r — 16, pD 2 /m = 0-01136. 

transverse displacement with V r equal to 2-2 and pD 2 /m equal to 0-2275. The results 
corresponding to these three cases are shown in figures 10-12. These differences are 
such that a design approach based on the uncoupled equations of motion may prove 
unreliable. 


5. Conclusions 

An approximate technique is presented for evaluating the response of structures 
modelled as two-degree-of-freedom systems. The effect of vortex-induced oscillations is 
taken into consideration, as well as the coupling of the equations associated with the 
various hydrodynamic loads on the structure. Linearization of the equations of motion 
is circumvented through the development of an iterative algorithm which imple 
a Fourier series of the response along each direction; Fourier series of arbitrari 1 
order can be readily incorporated. The computer time 
proposed technique is a fraction of the time required to 
integration. This fact makes the proposed method well sui 
small personal computers that are limited in their access to s 
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software. The accuracy demonstrated by the numerical examples suggests that the 
technique may still be quite appealing even in a more sophisticated setting. Further¬ 
more, using both the proposed technique and numerical integration, the effect of the 
coupling in the equations of motion was investigated. It was shown that the results 
associated with the uncoupled equations can be significantly different from those 
associated with the coupled equation. This fact suggests that caution should be 
exercised by researchers and practitioners who base their professional recommenda¬ 
tions on one of the many available techniques that rely on the uncoupled form of the 
equations of motion. Finally, although the proposed technique was implemented for 
a simplified two-degree-of-freedom model of a structure, its extension to deal with 
multi-degree-of-freedom systems can be readily accomplished. The results from the 
present investigation can be regarded as warranting a research endeavor in that 
direction. 
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c M -i; 
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energy of one period; 

ith Fourier coefficient of f x (t); 

ith Fourier coefficient of f y (t); 

drag force in direction j; 
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total force in direction x; 
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period of oscillations; 
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reduced velocity; 

in-line displacement of cylinder; 

ith Fourier coefficient of x(t); 
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ith Fourier coefficient of y(t); 

non-dimensional frequency of oscillation; 

non-dimensional water elevation; 

damping ratio; 

normalized damping ratio; 

frequency of wave forces; 
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natural frequency of cylinder; 
frequency of vortex oscillations; 
error in iterations; 
density of fluid; 
nondimensional time. 
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Analysis of the equations of motion of linearized controlled 
structures 

FAI MA 

Department of Mechanical Engineering, University of California, Berkeley, 
California 94720, USA 

Abstract. The linearized equations of motion of controlled structures 
possess coefficient matrices that lack the familiar properties of symmetry 
and definiteness. A method is developed for the efficient analysis of line¬ 
arized controlled structures. This constructive method utilizes equivalence 
tranformations in Lagrangian coordinates and does not require conversion 
of the equations of motion to first-order forms. Compared with the state-space 
approach, this method can offer substantial reduction in computational effort 
and ample physical insight. However, it is often necessary to draw upon some 
type of decoupling approximation for fast solution. Many numerical 
techniques involve discretized equations resembling those of linearized 
controlled structures. These numerical techniques can also be greatly 
streamlined if the method of equivalence transformations is incorporated. 

Keywords. Linearized control structures; equivalence transformation; 
classical modal analysis; Lagrangian coordinates. 


1. Introduction 

A time-honoured procedure for the approximate solution of nonlinear mechanical 
systems is linearization, or in some cases piecewise linearization. The linearized 
equation of motion of an n-degree-of-freedom controlled structure can be written as 

Aq + Bq + Cq =f(t), (1) 

where A , B and C are arbitrary square matrices of order n. Embedded in these real 
coefficient matrices are the control gains, and as a result they need not possess any of 
the familiar properties of symmetry or definiteness. The Lagrangian coordinate q and 
the generalized excitation/(f) are n-dimensional vectors. The various ways in which 
equations of the above type can arise have been discussed by Soong (1990) and other 
authors. These equations tend to manifest on a widespread scale in recent years, owing 
mostly to the increasing use of control devices in structures. Development of a fast 
method for the analysis of such equations is thus much deserving. 

In theory, it is always possible and in fact customary to investigate a linearized 
controlled structure with the state-space approach, whereby the second-order equation 



(1) is recast into a first-order system of dimension 2 n. After conversion into a first-order 
form, a large variety of numerical techniques are then available for subsequent analysis. 
But the state-space method has never appealed to structural engineers. An inordinate 
amount of computational effort is usually given as a reason. More importantly, there is 
serious absence of physical insight in tackling a first-order equation recast from the 
Lagrangian formulation. The reluctance to employ state-space techniques has led to 
the development of a whole array of special techniques for analysing subclasses of 
linearized controlled structures in the Lagrangian coordinate q. Some of these develop¬ 
ments have been reported by Huseyin & Leipholz (1973), Fawzy & Bishop (1976), 
Inman (1983), and Caughey & Ma (1993). 

Equations of the type (1) are coincident with those governing linear nonconserva¬ 
tive vibrations, which have drawn relatively scant attention in classical theory. 
To be sure, it would be preferable if classical modal analysis could somehow be 
modified to treat systems with nonsymmetric coefficients. That might indeed be 
assumed in some earlier investigations. A literature survey in the testing of aircraft 
flutter and in the stressing of ship hulls, for example, reveals rather occasional use of 
terms such as logarithmic decrements, modes, or natural frequencies. But the meanings 
of these terms are not clear. When does a linearized controlled structure possess 
classical normal modes? What are the natural frequencies of a system with nonsymmet¬ 
ric coefficients? A theoretical basis, on which these concepts can be properly explored, is 
greatly desired. 

Were system (1) symmetric and definite, the coefficient of acceleration would be 
a positive-defnite mass matrix M, and the coefficients of velocity and displacement 
would respectively be positive-semidefinite damping and stiffness matrices D and K. In 
this case, (1) would take the familiar form 

Mq + Dq + Kq=f(t). (2) 

In linear nonconservative vibrations, the difference between the matrices B and D is 
sometimes accounted for by gyroscopic forces, and that between C and K by circula¬ 
tory forces (Muller & Schiehlen 1985). Hence B = 0 implies an undamped nongyro- 
scopic structure, and so on. While this type of terminology may not be applicable to 
linearized controlled structures, it will be convenient to adhere to such terminology. 
Applications in which A is not symmetric are given, for example, by Schmitz (1973) as 
well as Soom & Kim (1983). 

The purpose of this article is to expound a fast method for the analysis of linearized 
controlled structures. This constructive method utilizes equivalence transformations in 
Lagrangian coordinates, and does not involve conversion of the equations of motion to 
first-order forms. As will be evident, the method represents a direct extension of 
classical modal analysis. The same method can certainly be applied to the analysis of 
linear nonconservative vibrations. The organization of this article is as follows. In § 2, it 
is shown that a controlled structure in which B = 0 can be completely decoupled by an 
equivalence transformation. In addition, the meaning of natural frequencies will be 
clarified. Equivalence transformation is applied to a general linearized structure in § 3. 
There, comparison is made with classical modal analysis of viscously damped systems. 
An illustrative example is given in §4, in which practical implications of the method 
are also discussed. A summary of findings is provided in § 5. The terms, linearized 
controlled structures and linear nonconservative vibrations, will be used interchangeably 
in exposition. 
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2. Undamped nongyroscopic structures 

Consider a linearized controlled structure in which B~ 0. The equation of motion is 
M + Cq =f{t). (3) 

This system can be decoupled if and only if there exist two nonsingular matrices U and 

V such that VAU, VCU are diagonal. Two square matrices P and Q, related by 
P=VQU, are said to be connected by an equivalence transformation. An equivalence 
transformation between P and Q preserves the rank of the matrices. If V = U~ 1 , the 
equivalence transformation is called a similarity transformation. In the event that 

V = U T , the equivalence transformation is a congruence transformation. The classical 
modal transformation is an example of congruence transformation. A congruence 
transformation is also a similarity transformation if U is an orthogonal matrix. 
Equivalence transformations that are neither similarity nor congruence transform¬ 
ations are rarely used in structural analysis. Nevertheless, in what follows, it will be 
shown that a linearized controlled structure in which B — 0 can be decoupled by 
equivalence transformation in practically every situation. 

Let u be a column vector of order n and a be a scalar constant. If 

q = u e at , (4) 

is a homogeneous or complementary solution to (3), the generalized eigenvalue 
problem 

Cu — XAu (5) 

must be satisfied, where X — — a 2 . Eigenvalue problems of this kind were traditionally 
addressed in the abstract theory of matrix pencils. Emphasis was usually placed on 
symmetric and definite pencils. As a consequence, results applicable to the above 
eigenvalue problem are scattered and rather incomplete. An extension based on the 
presentation of Zurmiihl & Falk (1984) will be made here. Associated with eigenvalue 
problem (5) is the adjoint eigenvalue problem 

C T v = XA T v. (6) 

As (5) and (6) lead to the same characteristic determinant, the corresponding eigen¬ 
values are identical. 

At this point, the two assumptions underlying this investigation must be reviewed. 
First, it is required that A is nonsingular. This is a common assumption made in earlier 
investigations. Technically speaking, there is no practical loss of generality in accom¬ 
modating this assumption. Should the coefficient matrix A be singular, at least one 
acceleration term can be removed from the formulation. A linearized controlled 
structure is termed degnerate if its coefficient of acceleration A is singular. The 
eigenvalue problem (5) possesses n eigenvalues if and only if A is nonsingular. Thus, the 
implicit assumption that a system is not degenerate ensures the existence of a full set of 
eigenvalues. The second requirement presumes independence of the eigenvectors 
associated with eigenvalue problem (5). An eigenvalue problem is termed defective if it 
does not possess a full complement of independent eigenvectors. Experience indicates 
that an eigenvalue problem (5) possessing physical significance is invariably not 
defective. And the assumption that there is a full complement of independent eigenvectors 



can be made without fail in almost every application. A sufficient condition under 
which problem (5) is not defective is for the associated eigenvalues to be distinct. 
However, this is only a sufficient and not a necessary condition. In addition, the 
eigenvalue problem (5) is not defective if and only if its adjoint problem (6) is not 
defective. Henceforth, without practical loss of generality, it will be assumed that 
a linearized controlled structure is not degenerate or defective. 

Corresponding to each eigenvalue A { , an eigenvector u t can be found such that 

Cu t = /IjAWj. (7) 

Likewise, a solution to the adjoint eigenvalue problem associated with the eigenvalue Xj 
is expressed by 

C t Vj = XjA t Vj. (8) 

Note that each column vector u t or is undetermined to the extent of an arbitrary 
multiplicative constant. Transpose the above equation to obtain 

vJC^XjvJa. (9) 

Premultiply (7) by vj and postmultiply (9) by u r It follows by subtraction that 

{X i -Xj)vjAu i = Q. (10) 

Provided X i ± X p the eigenvectors w ; and Vj are orthogonal, with A playing the role of 
a weighting matrix. Thus, the two sets of column vectors u t and Vj are biorthogonal with 
respect to A if the corresponding eigenvalues are distinct. An extension to include 
repeated eigenvalues has been made by Ma & Caughey (1995). 

Based on the above clarification, a biorthogonality relation between the eigenvectors 
u ( and Vj holds whether or not there are repeated eigenvalues. Normalization of these 
eigenvectors leads to 

vjAUi = 5 ip ij =1,2(11) 

Note that each eigenvector u { and its adjoint eigenvector v ; are still determined within 
an arbitrary multiplicative constant. If the multiplier for u t is a { and that for v,- is b- t , the 
above equation determines only the product a i b i . In other words, the choice of either a ( 
or b t separately is still arbitrary. The above relation implies, in addition, that 


V J Citf = X i d ij , ij = l,2,...,n. (12) 

Define the following square matrices of order n by 

t/ = [u 1 ,u 2 ,...,u„], (13) 

* / =[Vi,v 2 ,...,vJ t , (14) 

A = diag[/i 1 ,A 2 ,...(15) 
The biorthogonality relations (11) and (12) can now be expressed in a compact form: 

VAU = 1, ( 16) 

VCU = A. (17) 

Let q=Up. Equation (3) may be simplified to 
p + Ap = Vf{t), 


( 18 ) 
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which represents a completely decoupled system. Needless to emphasize, the solution 
of a decoupled system is immediate. The following statement has been established. 

Theorem 1. A linearized controlled structure that is not degenerate or defective and in 
which B = 0 can always be decoupled by equivalence transformation. 

The decoupling equivalence transformation is defined by two adjoint eigenvalue 
problems (5) and ( 6 ). It has been proved by Ma & Caughey (1995) that any other 
equivalence transformation that decouples system (3) must be derivable from these 
eigenvalue problems. As explained earlier, the decoupling transformation is uniquely 
determined within arbitrary multiplicative constants in the eigenvectors u ( and Vj if the 
associated eigenvalues are distinct. The complex eigenvectors u t and v ] may be termed 
modes and adjoint modes respectively. In the case of repeated eigenvalues, the 
decoupling transformation is not unique. 

For a symmetric and definite system, (5) takes the form 

Ku = XMu, (19) 

leading to classical normal modes and natural frequencies (Meirovitch 1967). Due to 
symmetry of M and K , (5) and ( 6 ) are identical. The modes and adjoint modes are equal 
and are real. Solution of only the eigenvalue problem (5) is sufficient for determining the 
modes. Thus, the decoupling equivalence transformation reduces to classical modal 
transformation if the coefficient matrices possess symmetry and definiteness. The method 
of equivalence transformation represents a direct extension of classical modal analysis, 
and the lack of symmetry in a system only approximately doubles the computational 
effort. 

The method of equivalence transformations offers physical insight. This can be 
briefly demonstrated by clarification of the meaning of natural frequencies of a line¬ 
arized controlled structure. According to theorem 1, it is legitimate to write the 
equation of an undamped nongyroscopic structure in the form (18). For free vibration, 
in which f{t) = 0 , the coordinate p admits a harmonic solution if and only if 2 t >0 
(i == 1,2,..., n). Let X t — cof, where each co f is real and positive. A component p. of the 
vector p then has the solution 

p,. = c t .cos(m.r-d>,.), i = 1,2, ...,n. (20) 

The real constants c { and 4> t can be identified as amplitude and phase angle. Bearing in 
mind that q- Up, one obtains 

n 

q= Z c i u t cos(ft),.t-0,.). (21) 

(•=i 

Validity of the following statement has therefore been demonstrated. 

Theorem 2. The t components of a linearized controlled structure in which B = 0 can 
perform harmonic vibration with identical frequency if the associated eigenvalue problem 
(5) possesses positive eigenvalues. 

. The natural frequencies of free vibration are simply the square roots of*'- 

eigenvalues. It remains to determine if there is any feature peculiar 
vibration of a controlled structure. In a symmetric and definite syste 
real. Thus (21.) implies that in harmonic motion with a specific 
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all components also vibrate identical phase angle </>,-. That need not be true for 
a controlled structure, whose eigenvectors u t are generally complex. As is evident from 
(21), these complex eigenvectors induce phase differences among components of the 
response vector. Therefore, at a specified frequency co,-, the components of an undamped 
nongyroscopic structure usually vibrate with different phase angles. Amplitudes of 
vibration of the components are proportional to the magnitudes of elements in u i . For 
this reason, each complex eigenvector u { still determines a mode shape, in analogy to 
classical modal vibration. The general response in free vibration is then the superposi¬ 
tion of n harmonic motions of this kind. 


3. General linearized structures 

In classical modal analysis of a viscously damped system, the modal matrix diagonal¬ 
izes the mass matrix M and stiffness matrix K but, in general, will not diagonalize the 
damping matrix D. Caughey & O’Kelly (1965) showed that D can be diagonalized by 
the modal matrix if and only if M ~ 1 D and M~ l K commute in multiplication. In other 
words, 

DM~ l K = KM~ l D (22) 

is a necessary and sufficient condition for the modal transformation to completely 
decouple a damped symmetric and definite system. Condition (22) is not usually 
satisfied. A common procedure in this case is to ignore the off-diagonal elements in the 
transformed damping matrix, especially when these elements are small. This procedure 
is termed the decoupling approximation in damping, which allows modal transform¬ 
ation to decouple an entire system. In addition, an iterative scheme can be used to 
improve the accuracy of approximate solution if the off-diagonal elements in the 
transformed damping matrix are not small. A discussion of this aspect was given by 
Udwadia & Esfandiari (1990) as well as Hwang & Ma (1993): Practically speaking, 
there is no objection to using the decoupling approximation in damping. But, as 
reviewed by Park et al (1994), regorous analysis of errors committed by such an 
approximation has not been reported in the open literature. 

A similar situation arises in the analysis of linearized controlled structures. Referring 
to (1), an equivalence transformation diagonalizes A and C. As before, let this 
equivalence transformation be defined by the two nonsingular matrices U and V. 
Recall q — Up. Equation (1) may be expressed in the form 

p+VBUp + Ap=Vf(t). „ (23) 

The transformed coefficient matrix of velocity VBU is in general not diagonal. That 
means the entire controlled structure is not decoupled by equivalence transformation. 
Clearly, an extension to the decoupling approximation in damping may be proposed: 
the off-diagonal elements in VBU can be ignored if they are small. Intuitively, the errors 
of approximation should be small if the off-diagonal elements in VBU are small. In 
addition, an iterative scheme, similar to the one for symmetric and definite systems, 
may be applied to improve the accuracy of approximate solution if the off-diagonal 
elements in VBU are not small. These approximate techniques have immense practical 
implications. However, as in classical modal analysis, these are also points of specula¬ 
tion. To continue to dwell upon them would merely be distracting. 
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It is possible to further refine the method of equivalence transformation. The 
coefficient matrices A and C in a controlled structure need not have the same divine 
rights as the symmetric matrices M and K. Sometimes it is more convenient to apply an 
equivalence transformation to diagonalize A and B instead. Afterwards, the entire 
structure may be decoupled through approximating the transformed coefficient matrix 
of displacement by a diagonal matrix. This digression will not be pursued. Henceforth, 
it will be assumed that an equivalence transformation will be chosen primarily to 
diagonalize A and C, in a fashion expounded earlier. An upshot at this stage in the 
following statement. 

Theorem 3. The linearized controlled structure (1) can be decoupled by an equivalence 
transformation if and only if the matrices A~ X B and A~ X C commute in multiplication. 

In other words, this extension of criterion (22) asserts that 

BA~ X C = CA~ X B. (24) 

is a necessary and sufficient condition for an equivalence transformation to completely 
decouple linearized controlled structures. To prove this, it will be easier if condition (24) 
is first recast in a more transparent form. From (16), observe that 

A~ x = UV. (25) 

Condition (24) is therefore equivalent to 

BUVC = CUVB. (26) 

Premultiply the above equation by V and postmultiply by U. This gives 

VBUVCU = VCUVBU. (27) 

Let S' = VBU. It follows, on substitution of (17) into the above equation, that 

SA = AS. (28) 

Conditions (24) and (28) are equivalent, satisfaction of one implies satisfaction of 
the other. 

If the equivalence transformation defined by U and V decouples system (1), the 
matrix VBU must be diagonal. Since diagonal matrices commute in multiplication, 
condition (28) is satisfied. Condition (24) is therefore also satisfied. 

On the other hand, assume condition (24) is valid. Satisfaction of condition (28) 
implies, on expansion, that 

2,. • •, n, (29) 

where s tJ - is the i/th element of S. If all eigenvalues X t are distinct, the above equation 
implies that s tj = 0 when i # j. That means S is diagonal, and the equivalence transform¬ 
ation which diagonalizes A and C decouples the entire system. The case of repeated 
eigenvalues can be treated similarly and the observation that S is diagonal remains 
valid (Ma & Caughey 1995). This completes the demonstration that condition (24) is 
both necessary and sufficient for system (1) to be decoupled by an equivalence 
transformation. 








4. Discussion and example 


The exposition of linear nonconservative vibrations by Inman (1983) provides an 
inspiring account for this investigation. This article also draws upon a remarkable 
study by Fawzy & Bishop (1976). There, under more restrictive assumptions, the 
authors attempted to derive orthogonality relations involving all three matrices A, 
B and C. As a result, their orthogonality relations contained the eigenvalues and 
became rather impractical. In view of theorem 3, an equivalence transformation cannot 
be constructed to diagonalize A, B and C in every case. 

There are three practical implications of the method developed herein. First, this 
constructive method represents a direct extension of classical modal analysis and only 
approximately doubles the computational effort required of modal transformation. While 
the method is exact if B — 0, it is necessary to use some type of decoupling approximation 
for a general linearized structure. Extensive simulations have indicated that the method is 
substantially more efficient than the state-space approach. Reduction in computational 
effort, particularly in the case of large-scale systems, is indeed very attractive. Second, this 
method appears to possess ample physical insight, more of which has yet to be uncovered. 
For instance, in clarification of theorem 2, it has been pointed out that the complex 
eigenvectors u { determine the mode shapes. Third, the method of equivalence trans¬ 
formations can be used to streamline computational algorithms based upon the 
method of weighted residuals. Many numerical techniques in this family of algorithms 
generate the same type of equations as (1). As an example, the collocation method leads 
to equations resembling that of a controlled structure with B = 0. These numerical 
techniques can be greatly streamlined by utilizing equivalence transformations. 

As shown by Ma & Caughey (1995), any equivalence transformation that decouples 
system (3) must be derivable from the adjoint eigenvalue problems (5) and (6). It follows 
that a general theory of decoupling by equivalence transformations has been presented. 
Equivalence transformations are alreadly the most general nonsingular linear trans¬ 
formations. It can therefore be stated that no nonsingular linear transformation will ever 
diagonalize A, B and C simultaneously every time. Further research to find universal 
decoupling transformations will not be necessary. Equation (23) is the simplest 
representation of a linearized controlled structure in Lagrangian coordinate. 


Example. A linearized controlled structure, whose equation of motion has the form (1), 
is defined by 


'7 12' 

A — - 1 7 0, 

1-16. 

' 2*2577 -0-7016 -0-0567' 

£= 0-7305 4-4257 0-9294 , 

1-4657 0-8812 3-9978 

L J 

15-7158 9-0316 10-7895" 

C= -35-0000 -17-0000 - 24-5000 

27-7368 11-4737 19-8421. 


(30) 


(31) 


( 32 ) 
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Solution of two adjoint eigenvalue problems (5) and (6) yields, after normalization, 


U = 


V= 


-0*2136 

-0*5502 

-0*7315 

1*0000 

-0*2893 

0*0614 

-0*9580 

1*0000 

1*0000 

- 0*3869 

-0*1728 

-0*1887 

-1*2311 

— 0-9129 

-0*5196 

0*8396 

0*7751 

0*5124 


(33) 


(34) 


With this equivalence transformation, the system can be reduced to the form (23), 
where 


A = diag[l*9333,0*2198,0*0495]. (35) 

The natural frequencies are simply the square roots of the positive eigenvalues and 
are 0*22, 0*47 and 1*39 radians per second. All modes u t , which are columns of U, 
are real in this case. Hence, if the coefficient matrix B were absent, all components 
of the system could perform harmonic vibration with identical phase angle at each 
natural frequency. The system still does not possess classical normal modes because 
the eigenvectors U; are not orthogonal to each other with respect to A. The adjoint 
modes v. constitute the rows of V. In addition to complementing biorthogonality 
relations, the role of the adjoint modes is to modify the generalized excitation/(t). 

An examination of the transformed coefficient matrix of velocity reveals that 

VBU = diag[0*4685,0*5109, 0*5733], (36) 

and therefore the system has been completely decoupled. As a result, this system 
can be regarded as composing of three independent single-degree-of-freedom systems. 
According to theorem 3, condition (24) must be satisfied. This can be verified by 
a simple calculation. In most applications, explicit verification of condition (24) 
at the start of computation is not advisable. An equivalence transformation should 
simply be applied to diagonalize the coefficient matrices A and C. Afterwards, one 
can examine the transformed coefficient matrix of velocity VBU to determine if 
the use of decoupling approximation would require a corrective scheme. Although 
the coefficient matrices A, B and C in this system can be diagonalized by equivalence 
transformation, they cannot be diagonalized by similarity transformation. Among 
other things, A itself is not diagonalizable by similarity transformation because 
there is only one eigenvector [1,1, — l] r associated with the repeated eigenvalue 6. 
In addition, the coefficient matrices A, B and C cannot be simultaneously reduced 
to symmetric forms because A itself is not similar to a real symmetric matrix (Inman 
1983). From this discussion, the power and generality of equivalence transformations 
are clear. 


5. Conclusions 

A time-honoured procedure for the approximate solution of nonlinear mechanical 
systems is linearization, or in some cases piecewise linearization. Development 
of an efficient method for analyzing and interpreting the equations of motion of 
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linearized controlled structures is timely due to increasing use of control devices 
in structures in recent years. The method expounded in this article represents a direct 
extension of classical modal analysis. This constructive method utilizes equivalence 
transformations in Lagrangian coordinates, and does not require conversion of the 
equations of motion to first-order forms. It is assumed, without practical loss of 
generality, that a linearized structure is not degenerate or defective. That means the 
coefficient of acceleration A is nonsingular, and the eigenvalue problem (5) possesses 
a full complement of independent eigenvectors. The major results, summarized in the 
following, are applicable for any generalized excitation/(f). 

(1) A controlled structure in which 5 = 0 can always be decoupled by equivalence 
transformation. Compared with classical modal transformation, the lack of sym¬ 
metry in a structure only approximately doubles the computational effort. 

(2) In free vibration, all components of an undamped nongyroscopic structure can 
perform harmonic vibration with identical frequency if the associated eigenvalue 
problem (5) possesses positive eigenvalues. The natural frequencies are simply the 
square roots of these positive eigenvalues, and the mode shapes can be determined 
from the corresponding complex eigenvectors. Unlike classical modal vibration, 
the system components generally vibrate with different phase angles. 

(3) The linearized controlled structure (1) can be decoupled by an equivalence trans¬ 
formation if and only if the matrices A~ 1 B and A ~ 1 C commute in multiplication. 
Similar to classical modal analysis of a viscously damped system, it is often 
necessary to draw upon some type of decoupling approximation for fast solution. 

Compared with the state-space approach, the method expounded herein offers 
substantial reduction in computational effort and ample physical insight. The method 
of equivalence transformations is applicable to nonsymmetric systems of any order, 
just as classical modal analysis is applicable to symmetric systems of arbitrary order. 
The tremendous power of equivalence transformation reflects its role as mathemati¬ 
cally the most general nonsingular linear transformation. In addition, many numerical 
techniques involve equations resembling those of linearized controlled structures. 
These numerical algorithms will economize on both core memory and computing 
time if the method of equivalence transformations is incorporated. Among other things, 
it is hoped that the present paper would point to directions along which further 
research efforts can be profitably made. It appears feasible, for example, to examine 
stability and other qualitative features of a controlled structure with equivalence 
transformations. Analysis of errors committed by the application of decoupling 
approximation to a controlled structure is also worthwhile in a subsequent course 
of investigation. 
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On refined nonlinear theories of laminated composite structures 
with piezoelectric laminae 

J N REDDY and J A MITCHELL 

Department of Mechanical Engineering, Texas A & M University, College 
Station, TX 78843-3123, USA 

Abstract. In this paper geometrically nonlinear theories of laminated 
composite plates with piezoelectric laminae are developed. The formula¬ 
tions are based on thermopiezoelectricity, and include the coupling between 
mechanical deformations, temperature changes, and electric displacements. 
Two different theories are presented: one based on an equivalent-single¬ 
layer third-order theory and the other based on the layerwise theory, both 
of which were developed by the senior author for composite laminates 
without piezoelectric laminae. In the present study, they are extended to 
include piezoelectric laminae. In both theories, the electric field is expanded 
layerwise through the laminate thickness. The dynamic version of the 
principle of virtual displacements (or Hamilton’s principle) is used to derive 
the equations of motion and associated boundary conditions of the two 
theories. These theories may be used to accurately determine the response 
of laminated plate structures with piezoelectric laminae and subjected to 
thermomechanical loadings. 

Keywords. Refined nonlinear theory; laminated composite structures; 
piezoelectric laminae; Hamilton’s principle; thermomechanical loading. 



1. Introduction 

The study of embedded or surface mounted piezoelectric materials (e.g., conventional 
ferroelectric polycrystals in the form of ceramics, natural crystals, and polyvinylidene 
(PVDF)) in the active control of structures has received considerable attention in 
recent years. One reason for this is the possibility of creating certain types of structures 
and systems that can be adapted to or corrected for changing operating conditions 
using piezoelectric materials. The advantage of incorporating these special types of 
materials into a structure is that the sensing and actuating mechanism becomes 
part of the structure. These type of mechanisms are referred to as strain sensing 
and actuating (SSA). This advantage is especially apparent for structures which 
are deployed in space. Generally, space-borne structures are very flexible because 
they are not designed for operations in which the force of gravity is present. Most 
actuator systems other than the strain induced type add considerable weight and 
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possibly space to the structure and thereby change its mechanical properties 
significantly. 

A laminated structure with piezoelectric laminae receives actuation through an 
applied electric field, and the piezoelectric laminae send electric signals that are used to 
measure the motion or deformation of the laminate. In these problems, the electric field 
that is applied to actuate a structure provides an additional body force, which couples 
the stress analysis problem to the electrostatic problem. This coupling is similar to 
a temperature field inducing a body force that couples the mechanical problem to the 
thermal problem. 

The first nonlinear formulation of an electrically polarized elastic continuum is due 
to Toupin (1956). Following this work, many investigated the behaviour of continua in 
the presence of electric and/or magnetic fields (Erigen 1963; Tiersten 1964,1971,1981; 
Nelson 1978). Recently, Tiersten (1993) studied electroded plates undergoing large 
electric fields. In these works, thin, single-layer plates were considered with small 
strains and quadratic electric fields. A quadratic approximation of the mechanical 
displacements and a cubic approximation of the electric potential through the plate 
thickness were used to develop the plate theory. 

In order to utilize the strain sensing and actuating properties of piezoelectric 
materials, the interaction between the structure and the SSA material must be .well 
understood. Mechanical models for studying the interaction of piezoelectric patches 
that are surface mounted to beams have been developed by Crawley & de Luis (1987) 
and Chandra & Chopra (1993). The SSA lamina can offer both a discrete effect similar 
to patches as well as a distributed effect. Lee (1990) developed a theory for laminates 
piezoelectric plates using the classical laminate theory. In this theory, the linear 
piezoelectric constitutive equations were the only source of coupling between the 
electric field and the mechanical displacement field. Pai et al (1993) presented a geomet¬ 
rically nonlinear plate theory for the analysis of composite plates with distributed 
piezoelectric lamina. By using a layerwise type displacement similar to that proposed 
by Reddy (1987), they have enforced continuity of both displacements and interlaminar 
stresses. However, their model does not include the charge equations of electrostatics. 
In contrast, Tiersten (1969, 1993) modelled single-layer piezoelectric plates using the 
charge equations, but they did not study laminated plates. Tzou & Zhong (1993) 
developed governing equations for piezoelectric shells using the first-order shear 
deformation theory, and they included the charge equations of electrostatics. From 
these equations, classical and first-order shear deformation plate theories were derived 
for single-layer piezoelectric laminae. 

In the present study, two different formulations are developed to study the exten- 
sional and flexural motions of plates laminated with orthotropic layers and piezoelec¬ 
tric laminae. In the first model an equivalent single-layer third-order theory (Reddy 
1984, 1995) is used, and in the second one the layerwise theory (Reddy 1987) is used 
to approximate the displacement field. In both models, a layerwise expansion is used 
for the electric field. In addition, the formulations include geometric nonlinearity 
through the von Karman strains. Variational statements, which are useful in the 
development of finite element models, and the governing equations of the two models 
for piezoelectroelasticity are developed from basic principles. Recently, the authors 
have used the model based on the third-order theory but with linear strains to study the 
bending and vibration of composite laminates with piezoelectric laminae (Mitchell & 
Reddy'1994). 
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2. Equations of thermoelectroeiasticity 

2.1 Thermodynamic principles 


There are two principles of thermodynamics: the first law of thermodynamics and the 
second law of thermodynamics. The first law of thermodynamics, also known as the 
principle of conservation of energy, states that the time rate of change of the total 
energy is equal to the sum of the rate of work done by applied forces and the change of 
heat content per unit time. The second law of thermodynamics places restrictions on 
the interconvertibility of heat and work done. For irreversible heat transfer, the second 
law states that the entropy production is positive. 

The principle of conservation of energy leads to the well-known heat conduction 
equation (see Reddy & Gartling 1994) 


dT 


pc v 


dt 


— Vq + pr + cr:Vv, 


( 1 ) 


where Tis the temperature, v is the velocity vector, q is the heat flux vector, r is the 
internal heat generation per unit mass, c v is the specific heat at constant volume or 
constant strain and o is the stress tensor. Equation (1) is used to determine the 
temperature distribution in the body. The viscous dissipation a: Vv couples the thermal 
problem to the stress problem. Even when the viscous dissipation is neglected, the 
thermal problem is coupled to the stress problem through stress-strain relations, as 
explained in the next section. 

The thermal problem for the solid requires the temperature or the heat flux to be 
specified on all parts of the boundary enclosing the heat transfer region: 

T = T(s,t) on T t , (2) 

n-q + h c {T- T c ) = q n {s,t) on T q , (3) 

where T is the total boundary enclosing the heat transfer region, F = Tj-uT^, h c is the 
convective heat transfer coefficient, T c is a reference (or sink) temperature for convec¬ 
tive transfer, is the specified boundary flux, and n is the unit normal to the boundary. 


2.2 Thermoelastic constitutive relations 


The therrpoelastic problem is governed by the strain-displacement equations, equa¬ 
tions of motion, thermodynamic equations (l)-(3), and constitutive equations. The 
constitutive equation of the thermal problem is given by the well-known Fourier’s heat 
conduction law (Carslaw 8c Jaeger 1959, Nowinski 1978, Reddy & Gartling 1994) 

q = — k VT, q^-ktjidT/dxj), ' (4) 

where k denotes the thermal conductivity tensor of order two. 

The constitutive equations of thermoelasticity are derived by assuming the existence 
of the free-energy function , ¥ = ¥(8^, T), of the form 

VK T ) = hC mVk , - ^0 + imB, (5) 

such that (see Carslaw & Jaeger 1959; Nowinski 1978) 

Oij = ST*/5fi y = C ijkl s kl - P u 6, 


( 6 ) 
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where 6 = T— T 0 ,T 0 is the reference temperature, and m and are material coeffi¬ 
cients. Equation (6) is known as the Duhamel-Neumann law for an anisotropic body. 
Inverting relations (6), we obtain 

^ij = S ijki a ki + a iA 

where S ijkl are the elastic compliances, and a {j are the thermal coefficients of expansion 
related to by = C lJu ct u . 

2.3 Constitutive relations of hygrothermal elasticity 

Temperature and moisture concentration in fibre-reinforced composites cause reduc¬ 
tions of both strength and stiffness (Jost 1952; Shen & Springer 1976; Browning et al 
1977). Therefore, it is important to determine the temperature and moisture concentra¬ 
tion in composite laminates under given initial and boundary conditions. As described 
in the previous section, the heat conduction problem described by (l)-(4) can be used to 
determine the temperature field. 

The moisture concentration problem is mathematically similar to the heat transfer 
problem. The moisture concentration, c, in a solid is described by Fick’s second law: 

dc 

_=_V-q / + d> / , (8a) 

H f = — D-Vc, (8b) 

where D denotes the mass dijfusivity tensor of order two, is the flux vector, and <& f is 
the moisture source in the domain. The negative sign in (8b) indicates that moisture 
seeps from higher concentration to lower concentration. The boundary conditions 
involve specifying the moisture concentration or the flux: 

c = c(s,t) on r\, (9a) 

n-q / = q / (s,0onr 2 , (9b) 

where T = r\ vj T 2 and quantities with a hat are specified functions on the respective 
boundaries. 

The moisture-induced strains {g} M are given by 

{g} M ={a} M c, (10) 

where {a} M is the vector of coefficients of hygroscopic expansion. Thus the hygroscopic 
strains have the same form as the thermal strains [see (7) ]. The total strains are given by 

M = MW + W r (T- r 0 ) + W M (c - c„), (ii) 

where T 0 and c 0 are reference values from which the strains and stresses are measured. 
In view of the similarity between the thermal and moisture strains, we will use only 
thermal strains to show their contribution to governing equations. 

2.4 Electroelasticity 

Electroelasticity deals with the phenomena caused by interactions between electric and 
mechanical fields. The piezoelectric effect is a linear phenomenon of electroelasticity. 
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and it is concerned with the effect of the electric field and polarization on the 
deformation (Nye 1957; Penfield & Hermann 1967; Tierstein 1969; Reddy & Gartling 
1994). The effect is described by the polarization vector P, which represents the electric 
moment per unit volume or polarization charge per unit area. The relation between the 
stress tensor and polarization vector is known as the direct effect, and it is given by 

P = d:<r or P ; = d ijk a jk (12) 

where d is the third-order tensor of piezoelectric moduli, a is the stress tensor, and d ijk 
and a jk denote their rectangular Cartesian components. The converse effect relates the 
electric field vector E to the linear strain tensor s by 


8 = E-dor b.j — d ki j E k . 


(13) 


Note that d kiJ is symmetric with respect to indices i and j because of the symmetry of the 
strain components e u . 

The coupling between the mechanical, thermal, and electrical fields can be estab¬ 
lished using thermodynamics principles and Maxwell’s relations. Analogous to the 
strain energy density function U 0 for elasticity and the free-energy density function 'Pq 
for thermoelasticity, we assume the existence of a density function <D 0 


d> 0 (8 I . j ,£ 1 .,T) = (/ 0 -E-D-^T 

= ijklhjhl ~ e ijk S ij^k ~ fiijhj® 

~ ihl^k^l ~ (P C y/3 7 q)0 2 , 

called the electric Gibb’s function or enthalpy function, such that 

_ D= _^o d<X> 0 

iJ d£ u ’ £ dEf n 8T 86 ’ 


(14) 


(15) 


where D, are the components of the electric displacement vector, and r] is the entropy 
(which is governed by the second law of thermodynamics). Use of (14) in (15) gives the 
linear constitutive equations of a piezoelectric medium: 


a u ~ C,jkihi e kij^k Bijd, (16a) 

Dk~?kij £ ij + hl^l’ (16b) 

'! = /V« + TT0. (16c) 

l 0 

where C ijkl are the elastic moduli, e ijk are the piezoelectric moduli, £ u are the dielectric 
constants, /? (J - are the temperature expansion coefficients, c v is the specific heat per unit 
mass, and T 0 is the reference temperature. In single subscript notation, (16) can be 
expressed as 

a i — C ij e j ~ e ji E j — Pi°, (17a) 

^k~ e kih+ hi^o (l^b) 


J 0 


(17c) 
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Equations (17) provide linear constitutive relations for the problem at hand. For 
a general anisotropic material, there are 21 independent elastic constants, 18 piezoelec¬ 
tric constants, 6 dielectric constants, and 6 thermal expansion coefficients. 

In the following derivations, we account for thermal (and hence, moisture) and 
piezoelectric effects only with the understanding that the material properties are 
independent of temperature and electricity, and that the temperature T is a known 
function of position (hence, its variation is zero: <5 T= 0). Thus temperature enters the 
formulation as a known function through constitutive equations (17). However, since 
we are primarily interested in developing the coupled equations of piezoelectricity, the 
electric field E will be treated as a dependent variable (i.e., variable subject to variation). 
It can be shown that for electrostatics, the electric field E is derivable from a scalar 
potential function ifj 

E = — Vi/'. (18) 

2.5 Approximation of electric field potential 

In the interest of representing the electric potential more accurately through the 
thickness of the laminate, it is approximated using the layerwise expansion (see 
Nowinski 1978; Reddy 1987; Robbins & Reddy 1993) 

M 

i^(x,y,z,f) = £ ij/j(x,y,t)® J (z), (19) 

j=i 

where t j/j(x,y,t) denote the nodal values of i(/, M is the number of nodes through 
thickness, and $ J are the global Lagrange interpolation functions for the discretization 
of i j/ through thickness (see Reddy 1993; Reddy & Gartling 1994). This is equivalent to 
modelling through-the-thickness variation of ^( x,y,z,t ) with one-dimensional finite 
elements (see figure 1). Non-piezoelectric lamina can be modelled by simply setting 
the piezoelectric constants to zero and retaining the dielectric permittivity constants 
if necessary. In this way it is possible to model arbitrary applied potential loading 
conditions as well as more than one material type, including piezoelectric material. 



Figure 1. Layerwise approximation of a field variable, u, in a laminated plate. 
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3. The third-order theory 

3.1 Preliminary comments 

The equivalent single-layer (ESL) laminate theories are those in which-a heterogeneous 
laminated plate is treated as a statically equivalent, single layer having a (possibly) complex 
constitutive behavior, reducing the 3-D continuum problem to a 2-D problem. The ESL 
theories are developed by assuming the form of the displacement field as a linear 
combination of unknown functions and the thickness coordinate (Mitchell & Reddy 1994). 
Then the governing equations are determined using the principle of virtual displacements 
(or its dynamic version, when time dependency is to be included). For plate structures, 
laminated or not, the integration over the domain of the plate is represented, because of 
the explicit nature of the assumed displacement field in the thickness coordinate, as the 
(tensor) product of integration over the plane of the plate and integration over the thickness 
of the plate (volume integral = integral over the plane x integral over the thickness). Since 
all functions of the theory are explicit functions of the thickness coordinate, the integration 
over plate thickness is carried out explicitly, reducing the problem to a two-dimensional 
one. Consequently, the Euler-Lagrange equations will consist of differential relations 
involving the dependent variables (or generalized displacements) and thickness-averaged 
stress resultants. The resultants can be written in terms of the generalized displacements 
with the help of the assumed constitutive equations and kinematic relations. More 
complete development of this procedure is presented in the coming sections. 

The classical and first-order theories of composite laminates are the most commonly 
used ones. The classical laminate theory does not account for the transverse shear 
deformation, while the first-order theory accounts for transverse shear strains as 
constant through the thickness and hence requires shear correction factors. Second and 
higher-order ESL laminated plate theories use higher-order polynomials in the expan¬ 
sion of the displacement components through the thickness of the laminate. The 
third-order theory of Reddy accounts for quadratic variation of transverse stresses 
through the thickness, and thus does not require shear correction factors. 

3.2 Displacements and strains 

Consider a plate of total thickness h composed of N orthotropic layers, some of which 
may be piezoelectric, with the principal material coordinates (x*,*^**) of the k-th 


!- 

V. 



active layers with electrode 


Figure 2. Schematic of a laminated plate with piezoelectric laminae. 
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lamina oriented at an angle 9 k to the laminate coordinate, x (see figure 2). The xy-plane of 
the plate coincides with the undeformed midplane Q of the laminate. The z-axis is taken 
positive upward from the midplane. The k-th layer is located between the points z = z k and 
z = z k+1 in the thickness direction. The total domain Q of the laminate is thetensor product 
of Q x (—h/2,h/2). Different parts of the boundary F = T x ( — h/2,h/2) of Q are subjected 
to different generalized forces or generalized displacements. Here T denotes the boundary 
of Q, the midplane of the laminate. A discussion of the boundary conditions is given later. 

The total displacement components ( u,v,w ) along the coordinates (x,y, z) are 
approximated in the third-order laminate theory of Reddy (1984,1995) as, 

w = «o + z(<P x ~ c 0 (dw 0 /dx) - q z 3 (p x , 
v = v Q + z{q> y - c 0 {dw 0 /dy) - q z 3 ^, 

w = w 0 , (20) 

where u 0 ,v 0 ,w 0 ,(p x , and (p y are unknown functions of position (x,y) and time t, and 
parameters c 0 and q are introduced to identify the classical laminate theory ( c 0 ~ 1 and 
<p x = (p y — 0) or the first-order laminate theory (c 0 = q = 0) as special cases of the present 
theory. Note that (u 0 , u 0 , w 0 ) denote the displacements of a material point at (x, y, 0) along 
the three coordinates, and (<p x — (dw 0 /dx)) and {cp y — (dw 0 /dy)) denote rotations of 
transverse normals at z = 0 about the y and x axes, respectively. 

The von Karman strains (see Reddy 1995) associated with the displacement field (20) are 
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\ 

yA. 



It is easy to verify that the transverse shear strains e 4 and e 5 are zero at the upper and 
lower surfaces of the plate, and that they vary quadratically through the thickness. 
Although this particular approximation was developed for laminated plates without 
piezoelectric lamina, it also serves well when piezoelectric lamina are present since 
contributions to the shear stress on the major surfaces due to piezoelectric lamina are 
derived from the gradient of the potential function. Note that only the tangential 
component of the electric field is required to be continuous across boundaries having 
material discontinuities. A plate having piezoelectric laminae on its outer surfaces covered 
with a thin electrode will not have any component of inplane electric field on these surfaces 
because the electric field is zero inside the conductor. Further, since transverse shear 
stresses due to piezoelectric effects result from components of the inplane electric field (this 
depends upon the particular form of the constitutive relations), it follows that shear stresses 
on the free surfaces are only due to mechanical effects. 


3.3 Equations of motion 


Hamilton’s principle (or the dynamic version of the principle of virtual displacements) is 
used to derive the equations of motion for a laminated plate with piezoelectric laminae. 
Hamilton’s principle requires that 


rr 


0 = 


{m + SV-SK)dt, 


(23) 


where <50 is the virtual Gibb’s energy (the volume integral of <5O 0 ), <5 V is the virtual work 
done by applied forces and possibly charge, and 5K is the virtual kinetic energy. For the 
problem at hand, these are given by 


<50 = 


W2) 
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(cTijSeij — D i SE i )dz jdxdy 
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+ <T„ 
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(26) 


where q = q b + q t ,q b is the distributed force at the bottom (z = — h/2) of the laminate, q t 
is the distributed force at the top (z = h/2) of the laminate, (<r nn , <t„ s , d„ z ) are the specified 
stress components along thenormal, tangential, and transverse directions respectively, 
at a point on the boundary T; (<5 u n , Su s ) are the virtual displacements along the normal 
and tangential directions, respectively; p is the density of the plate material; and 
superposed dot on a variable indicates time derivative, u 0 = du 0 jdt. The subscripts 
n and s refer to normal and tangential directions to the midplane Q, respectively. 

Note that in (25)—(28), integration over the laminate thickness is carried out and 
thickness-averaged resultants are introduced as follows: 
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(28) 
(29) 


J -(h/2) 

Note that in (29) i takes the values of 0,1,2,•••,6, and in (27)—(29), a and j? take the 
symbols x and y. The same definitions hold for the stress resultants with a hat. Note 
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should be made of the fact that the stress resultants defined above are the total 
resultants that contain the mechanical, thermal as well as the piezoelectric parts. We 
shall separate these parts later. 

Substituting for <5$, 5 V, and SK from (24)-(26) into (23), noting that the virtual 
strains can be written in terms of the generalized displacements ( u 0 , v 0 , w 0 , (p x , cp y , if /,) 
using (22), integrating by parts to relieve the virtual generalized displacements in Q of 
any differentiation, and using the fundamental lemma of calculus of variations (Reddy 
1934), we obtain the following Euler-Lagrange equations: 
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xy dx yy dy 


Qtx Qa C 2 M a p — M a p C 1 K (1 p, (37) 

where a and ft take on the letters x and y. It must be recognized that the charge p 1 can 
only be applied at a conductor dielectric interface. Therefore, unless node / corresponds 
to such an interface, p 1 = 0, in most cases, yj/j (nodal value of the electric potential, x//) will 
be specified at such interfaces, and the corresponding equation will not be there. In 
these cases, the surface charge density is calculated a posteriori . 

The natural boundary conditions of the theory are: 

(6u n ,du s ):N„„-N nn = 0, N ns -N ns = 0, 

(& + c o^r)=0- c oM„„-M„„ = 0, 

(Sq>„,S(p s ): M m - (M„„ - c x KJ = 0, M„ s - ( M ns - ( 

Wi-P'n-P' n = 0, 







732 


J N Reddy and J A Mitchell 


where 


Q„ — Co(Qx n x + Qy n y) + C 0~T~ + ^( w o) 


dM„ dM y 


I l ii 0 -I 2 <p x n x 


f [ dM dM dw 0 \ — .. 

+ L c# l“ar + "aT + 2 lyJ _ ,Bo- aV ’J" 

( dwn. <9w n \ / dw n , r <3w n \ 

^( w o) = + Nxy ~d7j n * + \ xy ~dx + yy ~dy j W; 

IM \ —nn n n n 2 —n 2 ] — y>l (’ 

t ivi ns J L * J- x y J V/ 


nl 2n v .n 


x \ K 


lK.J~L-.-s -.-s 

P',sP' x n, + P‘ y n r (41) 

Thus the primary variables (i.e., generalized displacements) and secondary variables 
(i.e., generalized forces) of the theory are: 

primary variables: u n ,u s ,w 0 ,dw Q /dn,q> n ,q> s ,^/j, (42a) 

second variables: N nn , N ns , Q„, M„„, M„„, M ns , P[. (42b) 

Note that the boundary term involving the tangential derivative of w 0 was integrated 
by parts to obtain the boundary term 

-(£ M_„^dT=(£ ^^dw n dT-[Mdw n l r . 


- y r M «s~fc dr = 5w 0 dr-[M„ s 5w 0 ] r . 

The term in the square bracket is zero since the end points of a closed curve coincide. 
This term is then added to Q n (because it is a coefficient of <5w 0 ). This boundary 
condition, Q„ + ( dM n Jds ), is known as the Kirchhoff free-edge condition (see Reddy 
1984, 1995). 

The initial conditions of the theory involve specifying the values of the displacements 
and their first derivatives with respect to time at t = 0: 

= «,= **?, W 0 = wg, (P„ = <P°, <P S = <P S °, 

“» = «?. «*=«?> W Q = W°, <Ps = <Ps’ ( 43 ) 

for all points in Q. 


3.4 Special cases 


The equations of motion of the third-order theory (c 0 = 1 and c 0 = 0) contain, as special 
cases, the equations of motion of the classical and first-order laminate theories. These 
are outlined below. 
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The classical laminate theory 

By setting (p x - 0, <p y = 0, c 0 = 1, c x = 0, and c 2 = 0 in (30)-(44), we obtain the equations 
of motion and the boundary conditions of the classical laminate theory. The equations 
of motion are: 


5u 0 : 

dN xx , dN xy_j d2u 0 , d 3 W 0 

dx dy 0 dt 2 1 dxdt 2 ’ 


(44) 

5v 0 \ 
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dP 1 dP l 

ii + t~ Pi=p ‘- (47) 

The first-order laminate theory 

By setting c 0 = 0,c x — 0, and c 2 = 0 (hence, c 0 = 1 ,T i — I i = If in (30)—(37), we obtain 
the equations of motion of the first-order laminate theory: 
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wherein and q> y take the meaning of total rotations [they denote only shear rotations 
in (30)—(36)3. 


3.5 Constitutive relations 


The theory presented here is valid for laminates with arbitrary lamination scheme. 
Since the laminate is made of several orthotropic layers, with their material axes 
oriented arbitrarily with respect to the laminate coordinates, the constitutive equations 
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of each layer must be transformed to the laminate coordinates. The linear constitutive 
relations for the kth orthotropic lamina are 
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where QVf, e\ k) , and e ( f k) are the plane stress-reduced stiffnesses, piezoelectric moduli, 
and dielectric moduli, respectively, for the kth lamina in its material coordinate system, 
E^Di) are the stress, strain, electric field, and electric displacement components, 
respectively, referred to the material coordinate system (x 1 ,x 2 ,x 3 ), a l and a 2 are the 
coefficients of thermal expansion along the x x and x 2 directions, respectively, and AT 
is the temperature increment from a reference state, AT = T — T 0 . 

The constitutive equations (54), when transformed to the laminate coordinates 
(x, y, z ), take the form 
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where Qf) are the plane stress-reduced elastic stiffnesses, referred to the laminate 
coordinates 

fin = fin cos 4 0 + 2 (Q 12 + 2Q 66 )sin 2 0cos 2 0 + Q 22 sin 4 6, 

Q 12 = fin cos 2 0 sin 2 0 + 2fi 12 (cos 4 0 + sin 4 0) + [Q 22 — 4fi 66 )sin 2 0cos 2 0, 

622 = Q 11 sin 4 0 + 2 (Q 12 + 2 fi 66 )sin 2 0 cos 2 0 + fi 22 cos 4 0 , 

file = (fii 1 - fii 2 - 2fi 66 )sin 0cos 3 0 + (fi l2 - Q 22 + 2fi 66 )sin 3 0cos 0, 

0.26 = (fill — fii 2 — 2fi 66 )sin 3 0cos0 + (Q l2 - Q 22 + 2fi 66 )sin0cos 3 0, 
fiee = (fin + fi 22 ~ 2 fi 12 )sin 2 0 cos 2 0 + fi 66 (sin 2 0 - cos 2 0 ) 2 , 

Q 4 4 = 644«>S 2 fl+e55Sin 2 0, 
fi45=(fi5 5-fi44) C OS0Sin 0 , 

Q 5 5 = fi 44 sin20 + fi 55 cos2 ^ (56a) 

a,,, a. y and a xy are the transformed thermal coefficients of expansion, 
oi x = a i cos 2 0 + a 2 sin 2 0 , 
a y = oqsin^-f a 2 cos 2 0 , 

2a JCJ) = 2(a t — a 2 )sin0cos0, (56b) 

e ( . . are the transformed piezoelectric moduli, and e xx , e yy and e xy are transformed 
dielectric coefficients 

e 31 = e 31 cos 2 0 + <? 32 sin 2 0 , e 32 = e 31 sin 2 0 + e 32 cos 2 0 , e 33 = e 33 , 

^36 = ( e 3 i — e 32 )sin 0cos 0, e 14 . = (e 15 — <? 24 .)sin0cos0, 
e 24 = e 24 cos2 0 + e 15 sin 2 0, e ls =e 15 cos 2 0 + e 24 sin 2 0, 

^25 = (^1 5 — £ 24 )sin 0 C0S Q ’ e xx — £ 11 COs2 $ + e 22 S * n2 

e J , J , = £ 1 i s in 2 0-f e 22 c °s 2 0, e X)! —( £ u — £ 22 )sin0cos0. (56c) 

3.6 Laminate constitutive relations 


The stress resultants in (27) and (28) can be expressed in terms of the strains using the 
lamina constitutive equations. We have 
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(61b) 
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where {lV r }, {M 7 }, and {K 7 } thermal force resultants 
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(62a) 

(62b) 

(62c) 
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This completes the development of the third-order theory with piezoelectric laminae. 


4. Layerwise theory 

4.1 Displacements and strains 

In contrast to the equivalent-single-layer theories, the layerwise theories are developed 
by assuming that the displacement field is C^-continuous through the thickness of 
each layer of the laminate (see Reddy 1987, 1995; Robbins & Reddy 1993). This 
amounts to using ESL displacement fields in each (physical or numerical) layer of the 
laminate. Thus the displacement components are continuous through the laminate 
thickness but the transverse derivatives of the displacements may be discontinuous at 
various points through the thickness, thereby avowing for the possibility of continuous 
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transverse stresses at interfaces separating dissimilar material layers. A layerwise 
displacement field provides a kinematically correct representation of the moderate to 
severe cross-sectional warping associated with the deformation of moderately thick to 
very thick laminates. 

In the layerwise theory of Reddy (1995), the total displacement field is represented as 

N 

u(x,y, z, t) — X U r (x,y,tW(z), 

/ = 1 

N 

v(x,y, z, t) = £ Vj{x,y, t)<&'(z), 

i=i 

M 

w(x,y,z,t) = X (68a) 

/ = i 

where ( Uj , Vj, W t ) denote the values of (w,u,w) at the /th interface through the 
thickness, N is the number of nodes through the thickness, and <I> ; and the global 
interpolation functions for the discretization of the inplane displacements through 
thickness (see figure 1); M is the number of nodes and X ¥ I are the global interpolation 
functions for discretization of the transverse displacement w through thickness. Note 
that independent approximations for the inp'lane and transverse displacements are 
assumed OF't*® 1 ). Consequently, inextensibility of transverse normals (i.e., 
w = w(x,y)) can be imposed, if desired, by simply setting M = 1 and 'P 1 = 1 for all z. 

The electric potential is also represented using a layerwise approximation 

jv 

if/(x,y,z,t) = Y 'l' I (x,y,tW(z), (68b) 

i = i 


where «/q(x,y,t) denote the nodal values of i/j, N is the number of nodes through 
thickness, and are the global Lagrange interpolation functions for the discretization 
of \j/ through thickness. 

For linear variation through each numerical layer, the functions O 1 are given by 
® 1 {z) = \Jj[ 1 ) {z), z 1 <z^z 2 , 


<X>'(z) = 


Z^z^z f+1 


(7 = 2 , 3 ,. 


0 N (z) = i {/^(z), z N _^z^z N , 
i//[ k) = l - z/h k , \j/f = z/h k , 0^z^h k , 


(69a) 

(69b) 


where N = n + 1, n is the number of numerical subdivisions through the thickness, h k is 
the thickness of the kth layer, z = z — z \, and z k t denotes the z-coordinate of top of the 
kth numerical layer. Any desired degree of displacement variation through the 
thickness is easily obtained by either adding more linear finite element sub-divisions 
through the thickness or using higher-order elements through the thickness. The 
layerwise concept is very general in that the number of subdivisions through the 
thickness can be greater than, equal to, or less than the number of material layers 
through the thickness. Note that the sublaminate concept can be used (i.e. the number 
of thickness subdivisions is less than the number of material layers). The layerwise 
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displacement field (68a) contains, as special cases, the displacement fields of the ESL 
theories described in §3. 

The von Karman nonlinear strains associated with the displacement field (48a) are: 

“ A ax Al A a* AA sx y 


, 1 l v 9W lwl 


I I AA' lAr*' . 


" \3y 2\,f, dy 

«»- I w '~f’ 

1 = 1 uz 

N dd) f M dW 
y = y V.— + y 
V A 1 dz + A dy 

N M dW 


<^u,. 


dy 


Note that the transverse normal strain is nonzero, and that the strains are discontinu¬ 
ous at the layer interfaces because of the layerwise definition of the functions <J> J and X ¥ J . 

4.2 Equations of motion 


The governing equations of motion for the present layerwise theory can be derived 
using Hamilton’s principle. The virtual Gibb’s energy virtual work done by applied 
forces <5 V, and the virtual kinetic energy 5K are given by 


!>*>« + ff yy& 6 yy + G zz ^ £ zr 


flo U — h /2 


+ Vxyhxy + ff xz&Vxz + Oyzfyyz 

— {D X 8E X + D y dE y + D z (5£ z )]dzldxdy 


{lU; 

i« (. / = 1 _ 


f, r»r/ MU, ATl ddV, , (dSU, dSV, 

y N 1 -- + N 1 -- + N 1 - - - - 

t, xx dx yy dy xy \ dy dx 


+ &u, +ej^ f ] + £ + Qisw, 

+ Z Un^ + N"^)^ 

, = i \\ dx y dy J dx 

+ (TO 
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where 


SV= - 


J Oo 


q b (x,y)Sw (x, y, - - ) + q t (x, y)dw (x, y, - 


+ p b {x,y)Sil/(x,y, + p t (x,y)S\p ^ x,y, 


dxdy 


ft /2 


la nn bu n + <J ns 5u s + d n 2 < 5 w] dzdr 


r J -ft /2 

[q b 3 W x + q, S W M + p 1 ,bij/! ]dxdy 


dr, 


§K 


Clo J 


I (N',JU", + N'JV‘,)+ £ Q' n W t 

r L /= i J= 1 

ffc/2 

p 0 {uSu + vSv + w<5vv)dz dxdy 
m 

N M 

£ I' J (U,SUJ+v,sVj)+ £ i' J w,6Wj 


(72) 


£lo L I,j= 1 


I,J = 1 


dxdy, (73) 


z 


~3_ 

dx 


(^ + R rW)y±( R uW 1 + fjlJ W 1 

V “ dx ” By J By\ ” dx ” By 



JU - 


cm 


Po&tfdz, I 


u _ 


- ft /2 


■hi 2 


p 0 x ¥ ly ¥ J dz. 


-hi 2 


The Euler-Lagrange equations of the layerwise theory are 


AN 1 dN 1 N d 2 TI 

1 dx dy dt 2 ’ 

5Vr £g, + ^_ e;= 

ew - f - + f -- a ++ **» + ^ 1 '"5- 


(74) 

(75) 

(76a) 

(76b) 

(77) 

(78) 

(79) 

(80) 
(81) 
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The natural (force) boundary conditions of the theory are: 


= K-fi'ns = 0, (e! + ^)-£ = 0, 

Pi-Pi- o, 

where (iV^, N' s ) have meaning similar to (N nn , JV ns ) introduced in § 3: 

Q I „ = Q I x n x + Q I y n y , U]= U t n x + V,n y , U s l =-U I n x + Vjn y , 

V^)n + N"™± 

*> cy ) * V *’ dx ” s y 


M 

#'= I 

J -1 


** ax 


(82a) 

(82b) 

(83a) 

(83b) 


Thus, the primary variables (displacements) and secondary variables (forces) of the 
layerwise theory are: 

Primary variables: U ", U], W I} i/q, (84a) 

Secondary variables: N { nn , N* s , Q[ + , P\. (84b) 


4.3 Laminate constitutive equations 


The stresses in the kth layer may be computed from the 3-D stress-strain equations. 
For the kth (orthotropic) lamina we have 
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where the Cf), efj\ and ej'p are the transformed material coefficients in the (x,y,z) 
system. If inextensibility of transverse normals is assumed, one may use the plane 
stress-reduced stiffness in place of the 3-D stiffnesses. Note that the strains at a layer 
interface depend on the layer, i.e., {s}£ ^ {s}p +1 at a point P on the interface of layers 
k and k + I. 

The laminate constitutive equations for the layerwise theory can be derived using 
the lamina constitutive equations (56) and (57) and the definitions for the stress 
resultants in (75) and (76). In the layerwise theory distinction must be made between 
the number of physical layers (N e ) and the number of numerical layers (n) in the 
laminate. The integrals in the definition of the resultants must be evaluated only over 
the thickness interval over which the global interpolation functions, <J> 7 and 'F / , are 
defined. 

The relations between the stress resultants and the strains can be expressed as 
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and the thermal stress resultants are defined as 
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In the above equations, N e denotes the number of physical layers in the laminate, z £ and 
z k t are coordinates of the bottom and top of the Icth layer, and the laminate stiffnesses are 
defined as follows: 
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(100a) 

(100b) 

( 101 ) 

(102a) 

(102b) 

(103) 

(104a) 

(104b) 

(105) 
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(106b) 
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Note that the sum over the number of layers in the definition of the laminate stiffnesses 
is limited to the numerical layers over which the functions <J> 7 and T* 7 are defined. The 
functions <I> 7 and H? 7 are defined at the most over two adjacent numerical layers. When 
the von Karman nonlinearity is not included, all laminate stiffnesses with three and 
four superscripts will not enter the governing equations. 


5. Closure 

In this • paper two theories are presented *for extensional and flexural motion of 
laminated plates with piezoelectric laminae. The first one is based on a third-order 
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expansion of the displacement field and the second one is based on a layerwise 
expansion of the displacement field through the laminate thickness. The first one 
contains, as special cases, the classical and first-order laminate theories. In both 
theories, the scalar electric potential is approximated through the laminate thickness to 
allow independent and higher-order prescription of the electric field. Linear constitut¬ 
ive equations of thermoelectro-elasticity and the von Karman nonlinear strains in the 
laminate are used in the development. The theories developed herein can be used to 
study the response of composite laminates with either surface mounted or embedded 
piezoelectric laminae. 

The model based on the third-order laminate theory is computationally simple 
because it contains only five dependent variables. It is adequate for predicting global 
response characteristics or preliminary design of laminated structures. However, the 
theory does not predict interlaminar stresses accurately in thick composite structures. 
The model based on the layerwise theory is essentially a three-dimensional model, 
hence computationally more expensive. However, the model can be used to determine 
the local stress fields accurately. In practice, a combination of the two models - ESL 
model in (global) regions of 2-D stress state and layerwise model in (local) regions of 
3-D stress state will prove most appropriate. Such hierarchical (or variable kinematic) 
models with the global-local strategies (Reddy & Robbins 1994) provide the most 
robust computational methodology for the analysis of practical structures. 

The theories presented here can be extended to the case where the plates are 
subjected to large electric fields (Tiersten 1993). This allows modelling of a larger class 
of phenomenon - electrostriction and higher-order permeability effects. For quadratic 
nonlinear constitutive behavior of the electric field, the enthalpy function is assumed to 
be a quadratic function of strains and a cubic function of the electric field, 

®o( e y’ ^ = 2^ i Jkl £ ij & kl ~ e ijk E ij^k —^) i j kl £ i jE k E l — PijSijO 


so that 


{hfifii ~^w E .EA ~\%E,E,8 - 9 2 . ( 108 ) 


°ij ~ Cijkl £ kl e ijk^k ~ b iJkl E k Ei — fiijO, 


(109) 


■Ofc ^ijk £ ij + £ jk "b « ^ j jk E; Ej + C[j k E k 0 , 


( 100 ) 


n = Pij £ ij + \<lijEiEj + jrB, ( 111 ) 

where b iJkl denote the effective electrostrictive constants, X iJk are the electric susceptibi¬ 
lity constants, and q tj are the pyroelectric coefficients. 
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Abstract. There are no definitive solutions available for the order estima¬ 
tion of the auto-regressive process. First, the performance of the three 
criteria, namely FPE, AIC and MDL is illustrated. Next, it is indicated that 
there are possibilities of their performance being improved. The algorithm 
tri proposed here utilizes three minimum values of any of the conventional 
loss functions in the FPE, AIC and MDL methods. It also uses three 
statistics derived from these three minimum values. The estimated order is 
a rounded weighted average of these six statistics. The algorithm is found to 
do better in a qualified sense of yielding peakier distribution of the 
estimated orders when tested for 1000 synthetic models of orders 3, 5 and 
7 each. The conclusion drawn is that there are open possibilities of 
improving upon the conventional order estimators for auto-regressive 
processes. This means that till axiologically sounder estimators are avail¬ 
able one should be ready to use heuristic estimators proposed here. 

Keywords. Algorithm tri ; auto-regressive process; order estimation; plural 
criteria; order statistics. 


1. Introduction 

Consider a time series x(n), n = 0,1, • • • ,N — 1 of finite length N. If 

r 

x(n) = ]T a r i x(n — i) + u(n), 

; = l 

(1) 

where u{n) is an uncorrelated innovations process, then x(n) is said to be an auto¬ 
regressive time-series (ARTS) of order r (Box & Jenkins 1976). Coefficients 
a r ., i = 1,2, • • •, r are called auto-regressive or linear prediction coefficients (LPCs). As 
the true order r would be unknown and would have to be estimated, and as it 
determines the number of LPCs to be estimated, it is useful to make r an explicit part of 
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the notation for LPCs as done above. It also permits the possibility of referring to 
r LPCs under different assumed orders r, pending the estimation of r. The innovations 
process «(n) is assumed to be zero-mean, white and Gaussian. 

There are many methods to estimate the order r of ARTS. Three famous ones are 
final prediction error (FPE) (Akaike 1969, 1970), asymptotic information criterion 
(AIC)(Akaike 1973,1974) and minimum description length (MDL)(Rissanen 1983). At 
times the ‘A’ in AIC is taken to be Akaike’s rather than asymptotic to give the acronym 
an eponymic advantage. The basic structure of the three methods is similar. They use 
a loss function which is minimized. The three loss functions are: 


A fpe (N, r) = o 2 {N + r + 1 )/(N -r- 1). 

(2) 

A AIC (iV, r) - In b 2 + 2r/JV, 

(3) 

A MDL (N> r) — N In a 2 + r In N. 

(4) 


Each loss function depends on a 2 which is the estimated variance of the linear 
prediction error. This is expected to monotonically decrease as the assumed or fitted 
order r is increased. Then there is a modifier dependent on r, the assumed order and N, 
the length of the timeseries available. The purpose of the modifier is to arrange for 
a minimum in the loss function, possibly at the correct order. The modifier could be 
multiplicative as in the case of FPE or additive as in the case of AIC and MDL and 
a monotone transformation could be used on the variance of the prediction error before 
a modifier is applied as in the latter two cases. 

Most users assume that the problem of estimating the order of the auto-regressive 
process is solved satisfactorily and use one of the standard methods available for the 
purpose. One of the aims of this paper is to suggest that the problem above is still an 
open one. Thus, initially the performance of the FPE, AIC and MDL methods is 
assessed ab initio. This has not been done adequately in the literature. For example, one 
study (Shibata 1976) reports the frequency distribution of the estimated order by the 
AIC method on the basis of 500 realizations when the correct order is unity. For N = 50 
and oti t = ± 0-8 the order is estimated as unity about 74% times. When a t a is weaker 
the percentage of correct order estimation is even lower. Another more recent study 
(Broersen & Wensink 1993) used 10,000 realizations of ARTS of order two and N — 25 
to obtain the frequency distribution of the estimated order by 10 different methods. The 
percentage of correct order estimation ranged between 35-6 and 74-4. There are many 
applications wherein the ARTS model of higher orders would be necessary (Makhoul 
1975; Kay & Marple 1981). Therefore, true orders 3,5 and 7 are used here to generate 
1000 realizations each of ARTS and for n = 50, the frequency distribution of the 
estimated orders has been studied. The inference drawn is that the three conventional 
methods do not perform very satisfactorily, making their axiological elegance only of 
restricted practical utility. 

The problem posed here is whether the performance of the conventional order 
estimators can be improved. It is a real problem, particularly in the context that it is not 
even widely known that the performance of the conventional order estimators is as bad 
as is brought out here, for values of r between 3 and 7. 

The approach here is heuristic and is substantiated by extensive numerical simula¬ 
tion as better axiology was not available for more than two decades. To keep the 
computational efforts within manageable limits, the effects of sample size N have not 
been studied. N is chosen to be 50 which is larger of the values used in the two earlier 
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studies (Shibata 1976; Broersen & Wensink 1993) referred to above. The number of 
realizations of ARTS used to base the inference on is between the values used in the 
earlier two studies. The number of methods investigated here also lies in between. 
Taking these three considerations together, it may be said that the computational 
determinants of effort here form roughly the geometrical mean of the two earlier 
studies, given that three as against just one, and that too larger values of the correct 
order have been studied. Further, the computational effort put in is of the same order as 
for the latter of the two studies. 

Motivation for the present work came through watching the loss functions of the 
three methods of ARTS order estimation. Unlike the formal expectation from equa¬ 
tions (2)—(4), the loss functions do not show a sharp single minimum. Their shapes are at 
times very irregular and depend on the parameters (LPCs) of the ARTS realization. 
Therefore, it was believed that more information could be extracted from the loss 
functions to improve upon the conventional order estimators. The work reported 
establishes such a hope without claiming that the results here cannot be improved upon 
in a variety of ways. 

2. Assessment of the conventional estimators 

Thousand ARTS models of orders 3,5 and 7 each have been generated. The basic idea 
was to make them representative of all the possibilities that could arise. This obviously 
needs proper definition. ARTS models are amenable to at least three different charac¬ 
terizations. They are in terms of their auto-correlation coefficients, linear prediction 
coefficients and reflection coefficients. The three characterizations are interconvertible. 
But it does not mean that they are equivalent for all purposes. For example, it has been 
shown that when ARTS models are to be ordered on the basis of their proximity to 
a prescribed model, the results may be different when the three characterizations above 
are used (Moharir 1993). This obviously means that the characterization to be relied 
upon must be chosen a priori. The characterization chosen here is in terms of the 
reflection coefficients. The reflection coefficient (RC) 

k j = «jj ( 5 ) 

is the last LPC when an ARTS model of order j is imposed on the time series. There are 
some good reasons why the RCs have been chosen for a proper characterization of the 
ARTS models. When the Levinson-Durbin algorithm (LDA) (Kay & Marple 1981; 
Moharir 1985; Alexander 1986) is used for estimating the LPCs, reflection coefficients 
turn out to be important intermediate results (Makhoul 1975). But even more 
pertinently, the prediction on the basis of the ARTS model can be arranged entirely in 
terms of the RCs (Makhoul 1975; Kay & Marple 1981) with some advantages. Further, 
the Schur-Cohn stability condition (Proakis 1989) for the ARTS is that 

\kj\ < 1 - ( 6 ) 

Thus, the appropriate RC space is a hypercube. The stability conditions are more 
involved in terms of the other two characterizations of ARTS. Filling a hypercubical 
region uniformly with uncorrelated random numbers is possible and easy but it has 
been suggested (Hoel 1966) that for regions with other boundaries it may not even be 
possible in principle. Therefore, the definition of a set of models representative of all 
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Table 1. The assessment of the performance of the three methods, FPE, AIC and MDL, for the 
estimation of the order of the ARTS models. 

The numbers of models which gave the estimated order are tabulated against the latter. 
Thousand synthetic models of true ARTS orders 3,5 and 7 each were synthesized. See text for the 
non-numerical entries in the column ‘estimated order’. Entries against them summarize the 
properties of the distribution of the estimated orders. 


Estimated 

order 

Correct order 

= 3 

Correct order 

= 5 

Correct order 

= 7 

FPE 

AIC 

MDL 

FPE 

AIC 

MDL 

FPE 

AIC 

MDL 

0 

38 

38 

66 

12 

12 

31 

4 

4 

21 

1 

73 

73 

94 

28 

28 

49 

13 

12 

44 

2 

186 

184 

224 

45 

43 

74 

31 

31 

54 

3 

607 

609 

582 

132 

132 

157 

61 

61 

93 

4 

67 

67 

29 

169 

168 

203 

71 

71 

101 

5 

24 

23 

5 

465 

465 

415 

108 

106 

133 

6 

3 

4 


80 

82 

37 

197 

195 

192 

7 

3 

2 


58 

59 

31 

340 

340 

273 

8 




8 

8 

2 

107 

110 

76 

9 




1 

1 

1 

56 

58 

11 

10 




1 

1 


19 

9 

2 

11 




1 

1 



1 


lZ 

c 

607 

609 

582 

465 

465 

415 

340 

340 

Z 

273 

L 

297 

295 

384 

386 

383 

514 

485 

480 

638 

H 

97 

96 

34 

149 

152 

71 

183 

180 

89 

N 

860 

860 

835 

714 

715 

655 

644 

645 

541 


793 

793 

806 

634 

633 

618 

537 

535 

465 


possibilities was taken to mean a set in which RCs are chosen to spread uniformly and 
randomly over every axis of the hypercube. 

For each of the 3000 ARTS models chosen, LDA was used to evaluate the RCs. 
A time-series sample of length N = 50 was used for this purpose. Then predictions were 
performed under various assumed orders and were compared against the extension of 
50 values of the same ARTS to calculate a 2 of (2)-(4). The FPE, AIC and MDL 
estimates of ARTS order were obtained by minimizing the respective loss functions. 
The results are shown in table 1. 

In the column for the estimated order in table 1, there are five non-numerical entries 
at the end. C refers to the correct order. Order 3 is correctly estimated less than 61% 
times by the FPE and AIC methods and about 58% times by the MDL method. For 
orders 5 and 7 the percentage of correct order estimation is even lower for all the 
methods. The difference between the performance of the MDL method and the other 
two also is seen to grow as the correct order increases. For the correct order of 7, the 
percentage of correct order estimation is about 34% for FPE and AIC methods and 
just about 27% for the MDL method. L stands for the ‘orders lower than the correct 
one’ and H for the ‘orders higher than the correct one’. As the correct order increases, 
for all the three methods the percentage of underestimation or overestimation of the 
order increases, underestimation being more probable than the overestimation. The 
MDL method underestimates more and overestimates less than the other two. 
N s stands for the ‘symmetric neighbourhood of the correct order’ comprising the 
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correct order and immediately adjacent orders, one on each side. When the order is 
incorrectly estimated, an underestimate may be preferred to an overestimate by the 
same amount because erroneous as both the estimates are, the former subsequently 
requires less effort or cost to predict. Further, simpler models have more philosophical 
appeal for those who adopt Occam’s razor (Anderson 1963; Bernal 1969; Constable 
1987; Moharir 1991). In this spirit N 0 stands for the ‘Occamic neighbourhood of the 
correct order’ and includes the correct order and the next lower order. On the criteria of 
the estimated order being in the neighbourhood of the correct order, the three methods 
do much better than on the criterion of the correct order estimation. Major part of the 
improvement comes by going to the Occamic neighbourhood from the correct order, 
the further improvement obtained by enlarging the neighbourhood symmetrically 
being relatively smaller. The first improvement is larger for the MDL method than for 
the other two and the second one is relatively small. For the correct order of 3, the MDL 
method even does marginally better than the other two on the criterion of N a . For all 
the methods studied, the distributions of the estimated order is asymmetrical with 
respect to the correct order. 

Information summarized in table 1 and some preliminary numerical experiments 
suggested that the conventional order estimators may possibly be improved upon. One 
possible way of obtaining the improvement and the attendant difficulties are discussed 
in the next section. 

3. Algorithm tri 

The loss functions used by the FPE, AIC and MDL methods are motivated by the 
usefulness of deriving a measure which takes the least value for the correct order. But 
the above results indicate that it does not happen frequently enough. Further, it is 
believed to be a common experience with those who observe the loss functions plotted 
rather than the algorithm reporting the estimated order on their basis that these loss 
functions are not necessarily uni-minimal. Therefore, it is surmised that there is more 
information in the loss functions about the order of ARTS than is contained in their 
lowest minima. The question then was whether it could be extracted. The results 
presented below should be regarded rather as one such possibility than the final answer. 

Table 2. One possibility in which three opinions 
about the ARTS order can be consensually correc¬ 
ted. The actual correction strategy used is a modifi¬ 
cation of this. 


Ranking of the 
three opinions 

Possible corrected values 
of the three opinions 

r i< r 2 <r 3 

'i 

'2-1 

'3- 2 

r i < r 3 < r 2 

'i 

' 3 -l 

'2 — 2 

r 2 <r i <r 3 

'2+1 

'l 

' 3 -I 

r 2 < r 3 < 'i 

r 2 + 2 

' 3 +l 

'1 

r 3 < r i < r 2 

'3+1 

'l 

' 2 -I 

r i< r 2 <r i 

r 3 + 2 

r ^ + 1 

'1 
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From the loss function graphs three order values were chosen which gave the three 
lowest values of the loss function. Let these orders viewed as three opinions be called r 1 , 
r 2 and r 3 . Then r t would be the conventional order estimate. These opinions can be 
ordered in six different ways as shown on the left of table 2. If the opinions are to be 
corrected, as they refer to the same correct order, one possible way of correcting, giving 
maximum credence to the first opinion, is shown on the right of table 2. The corrections 
applied are the least possible, given that the three opinions are different. This can be 
expressed in a single formula as follows: 

Pj = rj-[R(rj)-R(r J], j= 1,2,3, (7) 

where /?(•) is the rank when the three opinions are ranked in the ascending order and 
Pj is the corrected value of the opinion Vj. Next, it is possible to modify (7) by making the 
correction depend on the spread of the three opinions. That is, if r max and r min are the 
maximum and minimum values of the order among the three opinions one could 
choose 

p j = r J ~ PM™* - r min )Wrj) - K( r i)]> (8) 

where fij is called the degree of correction of r Jt Further if P, turns out to be negative, it 
could rather be set to zero. The values of and /? 3 were chosen as 0-2. Corrections 
applied amount to editing the opinions about the quantity to be estimated for 
achieving robustness and are in the spirit of Winsorization, trimming and jack kniving 
(Barnett & Lewis 1978; Diaconis & Efron 1983; Moharir 1988). 

In addition to the three corrected opinions, three statistics are derived from the three 
opinions. They are: 

p a = r mi d> (9) 

( 10 ) 

JWw 3 ) 1/3 , (ii) 

where r mid is the middle value among r t , r 2 and r 3 . The averages of these three statistics 
are presumed to represent the central tendency, spread and third cumulant of the ARTS 
order distribution. Therefore, use of these three statistics is in the nature of stochastic 
feedback up to the third order moments or cumulants. The rounded value of the 
weighted average of P/s,j = 1,2, • • •, 6 is taken to be the estimate r est of the ARTS order. 
That is, 

r es,= I WjPjl I w ; , (12) 

j =i / j =i 

where for convenience the weight factors w. are taken to be nonnegative multiples of 0-25. 
It can be readily seen that care has been taken to ensure that the quantities averaged are 
dimensionally compatible. In fact, the particular measures of central tendency, spread 
and skewness have been specifically chosen to satisfy this requirement. 

It was found that the weight factors cannot be chosen uniquely to achieve that, for 
a given true order, the order estimates most frequently match the correct order. It was 
also found that a choice of weight factors which achieves this for a particular true ARTS 
order is not necessarily likely to achieve it simultaneously for other true orders. Therefore, 
the numerical experiment was performed as follows. For the true order three, a large 
number of combinations of weight factors which gave largest or near-largest overall 
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Table 3. Summary of the performance of tri-modification for ARTS order estimation, operat¬ 
ing on the three conventional methods, viz. FPE, AIC and MDL. For the nonnumerical code for 
estimated order, see text. 

Weight factors (a): [0-25,0-75,0-25,4*25, 2,1] 

(b) : [0-25,0-5, 0-5,4-25,2, 1] 

(c) : [0,0-75,0-5,4, 2, 1] 

(d) : [0-5, 0-5, 0-5,4, 2,1] 

Symbols for methods: tri- modified FPE = TFPE, etc. 

Estimated 

order code TFPE TAIC TMDL TFPE TAIC TMDL 


(1) Correct order ■ 


Weight factors a 


(2) Correct order = 5 


Weight factors a 


(3) Correct order = 7 


Weight factors a 


Weight factors b 


c 

750 

749 

750 

722 

721 

730 

L 

161 

161 

233 

161 

161 

233 

H 

89 

90 

17 

117 

118 

37 

N s 

945 

944 

917 

945 

945 

917 

*0 

868 

867 

Weight factors c 

902 

840 

839 

Weight factors d 

882 

c 

750 

749 

750 

722 

721 

730 

L 

161 

161 

233 

161 

161 

233 

H 

89 

90 

17 

117 

118 

37 

N s 

945 

944 

917 

945 

944 

919 

* 0 

868 

867 

902 

840 

839 

884 


Weight factors b 


C 

403 

402 

327 

408 

407 

333 

L 

502 

499 

615 

497 

494 

643 

H 

95 

99 

23 

95 

99 

24 

N, 

767 

768 

695 

780 

782 

707 

s 

N o 

703 

702 

679 

718 

716 

690 



Weight factors c 



Weight factors d 


C 

408 

407 

323 

406 

405 

332 

L 

497 

494 

654 

499 

496 

644 

H 

95 

99 

23 

95 

99 

24 

N 

765 

768 

694 

777 

779 

703 

5 

K 0 

703 

702 

678 

715 

713 

686 


Weight factors b 


c 

358 

366 

211 

368 

372 

212 

L 

592 

589 

780 

592 

579 

779 

H 

50 

55 

9 

40 

49 

9 

N 

654 

654 

489 

654 

656 

488 

s 

JV 

612 

612 

480 

622 

618 

479 



Weight factors c 



Weight factors d 


c 

369 

373 

219 

368 

374 

214 

L 

592 

579 

780 

592 

579 

779 

H 

39 

48 

9 

40 

47 

7 

N 

652 

654 

488 

652 

654 

488 

S 

621 

617 

479 

620 

620 

481 
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percentage of correct order estimation on the basis of the FPE, AIC and MDL loss 
functions of (2)-(4) were found. This was done on the basis of only one-fifth of the 1000 
ARTS models of order 3 to save computational effort. Some hopeful combinations 
among them were tested for simultaneous good performance for true order 7 and were 
selected on that basis. Again the test was restricted to only one-seventh of the 1000 
ARTS models generated. The results for the true order 5 were then taken as they came. 

The numerical comparison is offered in table 3. The comparison is made for four 
different choices of weight factors. In one of them, one of the weights is zero. That means 
only five determinants of order are used. More particularly, the one that is not used is 
the conventional order estimate, but it does not seem to matter much! This is a very 
direct suggestion that the loss functions of equations (2)—(4) have more information 
than or parallel modes of information to the one utilised by the conventional order 
estimation methods. 

What is provided in table 3 is basically the summary in terms of the non-numerical 
symbols regarding estimated order, introduced in the context of table 1. Comparing the 
relevant part of table 3 with that of table 1, it can be seen that for the correct ARTS 
order of 3, tri algorithm with all the four weight factors yields results better than the 
conventional algorithm on the criterion C though weight factors (b) and (d) do not do 
as well as (a) and (c). For the correct ARTS order 5, no weight factor yields tri results 
superior to the conventional methods on the criterion C. For the correct ARTS order 7, 
all the weight factors yield tri results better than the FPE and AIC methods. For correct 
order 7, the tri modification does better on the criterion N 0 but not on N'for the MDL 
method. This too is understandable because, of the three conventional methods, MDL 
has the strongest tendency to underestimate the order. On the criteria N s or N 0 for the 
correct ARTS orders 5, all the weight factors yield tri results better than those of the 
conventional methods. Thus, taking C as the first criterion and N s and/or N 0 as 
a supplementary criterion of considerable importance, it is possible to uniformly 
improve the performance of the conventional order estimators at least for the range of 
correct ARTS order considered. This uniformly better performance may not be the best 
at individual values of the correct order but that is just as well because, if the correct 
order is known, there is no order estimation problem; therefore order estimation 
algorithms should work over a range of true orders expected. 

It should be pointed out that the results reported here are not the best possible 
because they are not based on proper optimization but only on incomplete search. Also 
the search for improvement has not been individualized for either the conventional 
criteria of FPE, AIC and MDL or for criteria C, N s or N 0 . Therefore, the results only 
hint at the possible inadmissibility of the conventional methods, a method being 
defined as inadmissible (Judge & Bock 1978) if it can be uniformly improved upon 
without any claims about the optimality of what is proposed. The tri algorithm also 
separates the order estimation problem from that of the estimation of ARTS coeffi¬ 
cients. Once the former is solved, the latter also can be reapproached using unconven¬ 
tional techniques (Judge & Bock 1978; Vinod & Ullah 1981). Results along these lines 
will be reported separately. 


4. Conclusion 

Conventional methods of ARTS order estimation, viz. FPE, AIC and MDL were 
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assessed on the basis of 1000 synthetic models for true orders 3,5 and 7 each. A heuristic 
order estimation procedure is suggested which may do uniformly better than these 
though as a compromise by not doing the best possible at individual orders. The 
procedure suggested uses three opinions from a conventional criteria and three 
statistics based on them to derive the order estimate. The objectives of showing that 

(a) the problem of ARTS order estimation is still an open problem and that there exist 
better possibilities than the conventional methods, 

(b) the distribution of the estimated order can be improved upon, and 

(c) heuristics may do better than the available axiological insights, 

are regarded as served. 


The authors are grateful to Dr H K Gupta of the National Geophysical Research 
Institute for encouragement. 
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industrial vision system 
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Abstract. In this paper we have presented some geometric techniques to 
characterize and parametrize surfaces of industrial parts in range images. 
The surfaces are characterized to one of plane, sphere, cylinder and cone, 
because they form the majority of object surfaces in man-made industrial 
parts. The problem has been studied for two different situations. In the first 
case, a priori knowledge about the surface shape is assumed. In such 
a situation the problem of surface characterization reduces to that of 
surface parameter estimation. The standard deviations of the estimated 
parameters give a measure of uncertainty of characterizing a surface patch 
to one of the four surface types. In the second case, no a priori information 
regarding the shape of a surface is available. This includes partially visible 
surfaces also. To deal with such a situation, a fuzzy classifier is designed 
using the uncertainty values. The fuzzy classifier classifies the unknown 
surface patch (including partially visible surfaces) to one of the four surface 
types. Experimental results with synthetic range images are presented to 
highlight the distinctive features of our technique. 

Keywords. Range image; geometric surface fitting; parameter estimation; 
fuzzy classification; occlusion. 

1. Introduction 

The major aim of any computer vision system is to recover useful information about the 
three dimensional world from huge image arrays of sensed values. In the past these 
image arrays were digital intensity images. At each point in an intensity image the 
brightness value encodes information about surface geometry, surface reflectance 
characteristics, surface texture, scene illumination, the distance from the camera to an 
object surface, the characteristics of the intervening medium and the camera character¬ 
istics. However, such images do not contain explicit information about dep* 4 
In recent years digital range images (also known as 2^-D images) 
available owing to the development of various active and passive 
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techniques. Range data are produced in the form of an array of numbers, where the 
numbers quantify the distance from the sensor focal plane to the object surface within 
the field of view along rays emanating from points on a regularly spaced grid. Thus the 
range images provide direct geometrical information about the shape of visible 
surfaces-. Therefore, the process of recognising objects by their shape is less difficult in 
range images than in intensity images. Such images find wide application in industrial 
automation such as assembly automation, automated inspection of industrial parts etc. 

Surface characterization and parameter estimation are two of the major problems of 
computer vision utilising range images in industrial environment. The estimated 
surface parameters can be used efficiently by the matching process for object recogni¬ 
tion and pose determination even in presence of many undefined or occluded surface 
regions. A number of approaches for surface characterization and parameter extraction 
have been reported in the literature. The optimization and eigenvalue analysis 
(Faugeras 1984) approach employs global analysis of 3-D or 2}-D (range) data. This 
approach is more general in the sense that it is applicable to all sorts of quadrics and 
planar surfaces. Characterization by quadric invariants (e.g. sign of curvatures) (Besl 
& Jain 1984,1988; Brady et al 1985) employs computation of differential properties of 
the surface at each surface point. This technique requires spatial ordering of surface 
points. Surfaces are characterized according to the signs of mean and Gaussian 
curvatures. 

Though eigenvalues or quadric invariants are good discriminators in a mathemat¬ 
ical sense, they are often very sensitive to noise and quantization errors (Han et al 1987). 
Moreover in man-made industrial parts, the majority of object surfaces belong to 
planar, spherical, cylindrical and conical surfaces (Bolle & Cooper 1984). Hence, for 
such applications it is not necessary to work with general quadric surfaces but rather to 
deal with a narrow domain of surface types, such as plane, sphere, cylinder and cone. 
Hough transform (Muller & Mohr 1984) or modified Hough transform (Lin & Wee 
1985) techniques have been used in such cases for surface characterization purposes. 

Parametric estimation technique by Han et al (1987) calculates the geometric 
properties of a sufficient set of shapes, namely plane, sphere and cylinder, based upon 
an analysis of the surface normals. This approach uses a mixture of histogramming and 
parameter-fitting techniques to estimate surface parameters. 

We have presented in this paper some new geometric techniques which use trans¬ 
formation invariant properties of surface normals and surface tangents to obtain the 
description of an unknown surface in terms of plane, sphere, cylinder or cone. This work 
presumes surfaces in range images are already segmented. Range image segmentation is 
not addressed here. A number of techniques for range image segmentation have been 
discussed by Biswas et al (1990). Unlike Han et al (1987) who use histogram analysis of 
surface normals to extract surface regions of a particular type, our technique uses mean 
and standard deviation of parameters characterizing different surfaces. The standard 
deviation of a parameter represents the measure of uncertainty of fitting a particular 
type of surface with respect to that parameter. The problem has been attempted for two 
different situations. In the first case, it has been assumed that a priori knowledge about 
the shape of the surface is available. In such situations, a surface of proper type is fitted 
to the set of points representing the surface patch and the parameters of the fitted 
surface are extracted. This strategy of selective surface fitting reduces the amount of 
computation. In the second case, when no a priori information regarding the shape of 
the unknown surface is available, the characterization of the surface is obtained in the 
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form of a fuzzy membership vector. A fuzzy classifier is designed using the uncertainty 
measures which classifies an unknown surface to one of the surface types. 

Geometric surface fitting and parameter estimation is discussed in §2. Section 
3 describes the fuzzy classifier to classify an unknown surface. Occluded surface 
characterization is discussed in §4. 


2. Surface characterization through geometric method 

The geometric method described in this section is guided by a priori information 
regarding the shape of a surface patch to fit an appropriate surface and extract the 
parameters of the fitted surface. The parameters extracted for each of planar, spherical, 
cylindrical and conical surfaces are the surface normal and the perpendicular distance 
from the origin for a plane, the centre coordinates and the radius for a sphere, the axis 
and the radius for a cylinder and the apex, the apex angle and the axis for a cone. The 
geometric method of surface fitting for these surfaces is facilitated from the existence of 
certain unique and transformation invariant properties of their surface normals and 
surface tangents as listed below: 

Plane: All surface normals are parallel (to the normal to the plane). 

Sphere: All surface normals are concurrent (at the centre of the sphere). 

Cylinder: All surface normals are perpendicular to the axis of the cylinder. 

Cone: All tangential planes to the surface intersect at a point (at the apex of the 

cone). 

In the following discussion the input is taken to be a set of M points in three dimensional 
space. The ith point is referred to as X { = (x i ,y i ,z i ) or equivalently by X { , the vector 
joining the origin (0,0,0) to (x^y^z,). The set of corresponding normals is taken to be 
{N f 1 1 =1,2,..., M}. The normal N ( - at point X ( - is computed by fitting a plane to the 
5x5 neighbourhood of X i using least square error method (Taylor et al 1989). 


2.1 Characterization of a plane 


A plane is characterized by the direction of the normal to the plane and the perpendicu¬ 
lar distance of the plane from the origin (refer figure 1). The estimate of the normal to 
the plane approximating the set of input points is given by 


M 


N„= Z N, 
1=1 


IN, 


i = 1 


( 1 ) 


where N ; is the unit normal vector at point X- t and ‘S’ in (1) represents vector addition. 
The uncertainty in the estimation of the surface normal is given by the standard 
deviation <r N of N av where 


1 M 


( 2 ) 


where |j represents the magnitude of a vector. The lesser the value of o N , the more 
reliable is the normal estimate. The perpendicular distance d of the plane from the 
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2 



Figure 1. Direction of normal to a plane and the perpendicular distance of the 
plane from the origin. 


origin and the standard deviation o d of d are given by 

i M 

i M 

= (N av -X f -d) 2 . 


(3) 

(4) 


The values of a N and a d together reflect the uncertainty of fitting a plane to the set of 
points. 


2.2 Characterization of a sphere 


Ideally, all the surface normals pass through the centre of a sphere. But in practice, 
owing to the presence of noise in the range data and to computational round-off 
errors, such a unique point cannot be determined. Rather, the majority of surface 
normals pass through a small neighbourhood of the centre of the sphere. So, for 
every pair of nonparallel normals N, and Ny (refer figure 2), the position of the line 
perpendicular to both of them is determined (Shanti Narayan 1985). The length of 
the line segment intercepted between the two normal vectors also gives the shortest 
distance between them. The points of intersection p, and pj of this common perpendicular 
with the two normals are taken as two estimates of the centre of the sphere. The mean 
position of the centre X c and the standard deviation <j x are computed from these 
estimates. 

Once the position of the centre is found, the radius R can be estimated as 

1 M 


( 5 ) 
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Figure 2. Estimation of the 
centre of a sphere. 

The standard deviations of the centre o x and the radius o R together give a measure of 
uncertainty in fitting the sphere to the set of points. 

2.3 Characterization of a cylinder 

The parameters of interest in this case are the orientation and position of the axis and 
the radius of the cylinder. Every normal on the surface of a cylinder is perpendicular to 
the direction of the axis. Thus the cross-product of any two nonparallel surface normals 
gives an estimate of the orientation of the cylinder axis. The cross-products are 
calculated for all possible pairs of nonparallel surface normals. The average orientation 
of all these cross-products determines the candidate axis direction. Thus, Vi, j where N ; 
and Nj are nonparallel, the /cth estimate of axis orientation is given by the vector x k where 

x k ~ (Nf x Nj)/|N,. x Nj\, (6) 

From these estimates the average orientation r av and the standard deviation <x t are 
computed. The position of the axis is determined by finding a point on the axis which is 
done in the following manner. 

At each point X ; , the plane on which X,-, N ; and t av lie is determined. The intersection 
of two such nonparallel planes (Shanti Narayan 1985) gives an estimate of the axis 
position (refer figure 3). From these estimates average estimate X av and standard 
deviation o x are computed. 

A point to be noted is that to get an estimate of the same point on the axis in all cases, 
the estimates are considered for the point at which the cylinder axis intersects the x = 0 
plane if axis is not parallel to that plane, else the y = 0 plane if axis is not parallel to that 
plane, else the z = 0 plane. 

Once the orientation and the position of the axis are fixed, at each point X it the 
radius jR, of the cylinder (refer figure 4) can be estimated as follows. 

Ri = IX,-- XJ* sin 9 = |(X, - X av ) x t av |, ( 7 ) 

From different estimates of jR,- we compute R av and cr R . 

The standard deviations of the axis orientation cr T , axis position o x and the radius a R 
together give a measure of the uncertainty of fit. 
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However in practice, such a unique point cannot be determined due to the same reasons 
as stated earlier. Hence, the apex is estimated in the following manner. For all points X,- 
with normals N f , the equation of the plane, tangent to the surface at X ; is obtained. 
Taking the intersection of three such pairwise nonparallel planes (Shanti Narayan 
1985) an estimate V k of the apex is obtained. Average of all such estimates gives the 
mean apex position V av . The uncertainty in the estimation of the position of apex is 
given by the standard deviation a v of apex. 

Once the position of the apex (V av ) is fixed, the axis of the conical surface is estimated 
in the following manner. Vi, 1 ^ i ^ M, the normal h, to the plane containing the points 
V av , X; and N f (normal to the surface at X,-) is determined as follows (refer figure 5). 

h, = (X,.-V av )xN I , (8) 

The orientation of the axis of the cone is given by the cross product of two such 
nonparallel vectors h, and hy, where 1 ^ i, j ^ M and i ^ /'. Thus Vi, /, where h, and h- are 
not parallel, the A:th estimate of the axis orientation is given by the vector z k where 

t k = (h f x hjVlh, x hj\ t (9) 

From these estimates the expected orientation x av and the standard deviation <r z are 
computed. 

Once the apex F av and the axis orientation are available, for each point X t an estimate 
of the semi apex angle y t - is obtained 

siny i = |(X i -V av )xt av |/|(X i -X av )|, (10) 

From different estimates of y we can compute average semi apex angle y av and standard 
deviation ct y . The standard deviations of the apex, the axis and the semi apex angle 
together give a measure of uncertainty in fitting the conical surface to the set of points. 


Z 



Figure. 5. Estimation of the axis of a cone. 


Table 1. Geometric surface fitting, 
a) Plane samples. 
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2.5 Experimental results 

Experimental results on synthetic range images of size 50 x 50 are presented. To reduce 
the amount of computation, surface normals are computed at some sample points, 
distributed uniformly over the surface patch. At each sample point, surface normal is 
estimated by fitting a plane to a 5 x 5 neighbourhood of the point, using least square 
error method. A set of five images from each of the classes plane, sphere, cylinder and 
cone are considered. For each sample the corresponding surface is fitted geometrically. 
The results obtained are tabulated in table 1. 

It is observed from table 1 that the percentage of errors in the estimated parameter 
values are quite low for plane and sphere fitting (< 3-5%). For the cylinder samples the 
percentage errors in the estimations of axis orientation are low (< 3-6%) and also in the 
estimations of radius ( < 3-3%). Estimation of a point on the axis has given a large error 
in most cases (10-6-24%). This is due to the fact that even a slight error in estimating the 
axis orientation can significantly change the estimated point position. In the case of 
cone samples, the error in estimation of axis orientation is quite low (1-5-11-5%). Angle 
estimation also has low percentage error (1-5-16-1%). The error in estimation of apex 
position has shown wide variation (2-9% to 32-2%). From the results on synthetic 
range images it can be said that errors in an orientation estimation (e.g., axis, normal) or 
a distance estimation (e.g., radius) have been less as compared to estimation of a point 
position (e.g., apex, centre). 


3. Fuzzy characterization of surfaces 

In the previous section on geometric surface fitting it has been shown how an 
appropriate surface may be fitted when the input image is of planar, spherical, 
cylindrical or conical types. The implicit assumption has been that a priori information 
regarding the surface nature is available. But even in the absence of a priori information 
the geometric surface fitting proves to be useful. In such cases all four surfaces are fitted 
to the set of points and in each case the uncertainties, as given by the standard 
deviations of the parameters, are noted. These measures of uncertainty in parameter 
estimation are used to classify the surface as a planar or spherical or cylindrical or 
conical surface. The details of the method involved are discussed in this section. 

The simplest strategy to design the classifier is to assign the input image to that class 
whose parameters have been estimated with the least amount of uncertainty. However, the 
solution is not so straightforward because parameters for different types of surfaces are 
obtained with different methods of estimation. Also the numbers of parameters for different 
types of surfaces are different. This has led to the development of a fuzzy classifier, where 
two thresholds are set for the standard deviation of each parameter of a particular 
surface. The lower threshold denotes the maximum uncertainty expected when the surface 
patch in the input image is of the same type as the surface fitted. The upper threshold 
denotes the least amount of uncertainty expected when the input is from any class other 
than that of the surface fitted. The input is assigned a membership to the class by 
comparing the standard deviation values obtained for a parameter of the fitted surface 
with the corresponding thresholds. Fuzziness in the classification is introduced by 
grading the membership if a standard deviation value falls between the two thresholds. 
Finally, the membership values obtained with each parameter of a fitted surface type are 
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combined to get an overall membership. This procedure is repeated for each of the four 
surface types namely, planar, spherical, cylindrical and conical surfaces. The result is a 
membership vector giving the membership of the input surface with respect to the four 
different classes. The vector itself can be taken as the characterization of the surface. 
Alternatively, the sample is classified into the class with the maximum membership value 
(the dominant class) and characterized by the corresponding parameters. Designing the 
classifier involves finding the thresholds, the details of which are discussed below. 

3.1 Classifier design 

A set of images, henceforth referred to as training images or samples, are taken from 
each of the four classes. All four surfaces are fitted geometrically to each of the samples. 
In each case the parameters and their standard deviations are noted. The fact that for 
any particular type of fitted surface, samples from the same class should yield lower 
standard deviation values as compared to samples from other classes, is used to obtain 
the lower thresholds. If 0 is a parameter of a class denoted by class (0), then the lower 
threshold for 0 is defined as 

lo 0 = mean(o- 0i ) + standard deviation (er 0i ), (11) 

where 

i = 1.. M is the set of sample images from class(d), 

= value of parameter 0 for the ith sample, 

<r fli = standard deviation in estimating 0 ( . 

The upper threshold for parameter 0 can be set by considering the standard deviations 
obtained with sample images from all classes other than class(fi). Thus the upper 
threshold is defined as 

hi e = min(cr fl J, (12) 

for all k , where k = 1.. N is the set of samples not belonging to class(O) and a lh > lo 0 . 

The value of lo g has been selected considering the distribution of standard deviation 
values of the samples from the same class as that of the surface fitted. The value of hi 0 
has been selected in order to reduce the misclassification of samples from other classes 
into the class of the surface fitted. 

The membership function for class(O) with respect to the parameter 0 can be defined 
as g Q where 

^ = 1, if a 0 ^lo 0 , 

/i 0 = { hi g - a 0 )/{hi g - lo 0 ), if lo e hi 0 , 

k» = 0, if hi e ^a e . (13) 

The membership function g e is graphically shown in figure 6. 

Overall membership of a sample in a class can now be defined in terms of the 
membership with respect to individual parameters of the class. For a class to if 
9 d ,d = 1. .D a are the parameters which characterize the class, the overall class member¬ 
ship f.t w for a sample is given by 

D,„ 

l^w 0 /D w ) ^ , 

d= 1 
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Figure 6. Fuzzy membership function (/ i g ). 


where the class co represents one of {plane, sphere, cylinder, cone} and n e * is the 
membership obtained for parameter 0 d as per (13). 


3.2 Experimental results and discussions 

To design the classifier five training samples from each of the four classes are taken. The 
standard deviations of different parameters obtained when all the four surfaces are 
fitted to each training sample are listed in table 2. The lower and upper thresholds for 
each parameter are tabulated in table 3. 

Once the classifier is designed, classification results for other images from the four 
classes (henceforth referred to as test samples) are presented. For each test sample all 
the four surfaces were fitted and the standard deviations of the various parameters were 
noted. From the standard deviations, the membership values with respect to the 
different classes were derived using the corresponding membership functions ((30) and 
(31)). The results are tabulated in table 4. The table also shows the dominant class 
displayed by each of these test samples. It may be seen that only one sample out of 20 
has been misclassified. Another point of interest is the nature of the membership vectors 
for test samples from various classes. Some planar test samples have shown conical 
behaviour as well (although not dominant). When a cone is to be fitted to a surface, each 
apex estimate is formed from three tangential planes where the line of intersection of 
two planes is not parallel to the third. Ideally, for a planar surface, all surface normals 
are parallel. Hence, fitting a cone fails at the stage of finding the apex itself, since no 
suitable combinations of three pairwise nonparallel tangential planes can be found. 
This conical behaviour is seen only in cases where the effects of quantization noise in 
the input image are so severe that it has been possible to find combinations of three 
normals to find apex estimates. In all these cases the semi apex angle is close to 90° and 
axis of the cone is normal to the plane. Thus conical behaviour exhibited by the planar 
samples is acceptable since any plane can be considered as a cone with semi apex angle 
of 90°. Sphere samples also have exhibited conical behaviour. Sphere samples actually 
represent the visible portion of a sphere which may be a hemisphere or a part of it. In all 
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Table 2. Standard deviations of the parameters obtained when all the four surface types are 
fitted to the training samples, 

a) Planar training samples 

Unable to fit surface; plane: a N s.d. of the normal direction, a d s.d. of perpendicular distance from 
origin; sphere: a x s.d. of the centre, a R s.d. of the radius; cylinder: a r s.d. of axis orientation, a x s.d. 
of a point on the axis, cr R s.d. of the radius; cone: a v s.d. of the apex, a x s.d. of the axis orientation, 
s.d. of the semi apex angle. 


Type of surface fitted 


Plane Sphere Cylinder Cone 


Sample 

ff N 


a x 

<7* 

G 

r 


<7* 

o v 

a 

X 


PL-1 

0-00 

000 

— 

— 

— 

— 

— 

— 

— 

— 

PL-2 

002 

027 

406-42 

6-98 

0-54 

1248-51 

8-32 

— 

— 

— 

PL-3 

0-01 

022 

— 

— 

0-40 

642-61 

3-74 

— 

— 

— 

PL-4 

000 

014 

— 

— 

0-44 

826-92 

4-62 

— 

— 

— 

PL-5 

001 

022 

— 

— 

0-81 

653-74 

3-89 

81-42 

0-98 

092 


b) Spherical training samples. 

Unable to fit surface; plane: a N s.d. of the normal direction, a d s.d. of perpendicular distance from 
origin; sphere: a x s.d. of the centre, <r R s.d. of the radius; cylinder: o x s.d. of axis orientation, o x s.d. 
of a point on the axis, <r R s.d. of the radius; cone: a v s.d. of the apex, a t s.d. of the axis orientation, 
<x g s.d. of the semi apex angle. 


Type of surface fitted 

Plane Sphere Cylinder Cone 


Sample 



<7* 

ff R 

o 

I 

<7* 

<7r 

o v 

ff t 

a e 

SP-1 

0-74 

3-78 

3-11 

028 

084 

494-2 

2-27 

64-87 

072 

0-41 

SP-2 

067 

5-77 

7-21 

035 

0-81 

227-1 

3-38 

126-77 

0-78 

073 

SP-3 

078 

2-89 

1-76 

017 

085 

197-8 

1-90 

98-39 

076 

068 

SP-4 

073 

4-77 

3-02 

0-20 

083 

522-2 

3-08 

95-25 

074 

04 

SP-5 

0-49 

3-42 

2-95 

0-25 

079 

301-0 

2-40 

86-12 

0-97 

0-44 


c) Cylinder training samples. 

Unable to fit surface; plane: <j n s.d. of the normal direction, a d s.d. of perpendicular distance from 
origin; sphere: a x s.d. of the centre, a R s.d. of the radius; cylinder: <r t s.d. of axis orientation, a x s.d. 
of a point on the axis, o R s.d. of the radius; cone: a v s.d. of the apex, o r s.d. of the axis orientation’ 
a e s.d. of the semi apex angle. 


Type of surface fitted 


Plane Sphere Cylinder Cone 


Sample 




<7* 


<7* 

<7« 

°V 

a 

X 

°e 

CL-1 

057 

3-27 

19-20 

5-14 

o-oo 

1057 

072 

— 

— 

— 

CL-2 

0-44 

3-97 

39-51 

3-56 

030 

45-98 

030 

295-45 

0-46 


CL-3 

059 

2-16 

56-66 

9-02 

033 

62-00 

0-58 

129-59 

080 


CL-4 ' 

057 

5-01 

55-09 

12-37 

0-25 

48-59 

0-41 

463-24 

077 ' 


CL-5 

036 

3-57 

85-91 

39-63 

018 

96-45 

019 

629-80 

082 
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Table 2. ( Continued !) 
d) Conical training samples. 

Unable to fit surface; plane: a N s.d. of the normal direction, a d s.d. of perpendicular distance from 
origin; sphere: a x s.d. of the centre, a R s.d. of the radius; cylinder: <t t s.d. of axis orientation, a x s.d. 
of a point on the axis, a R s.d. of the radius; cone: a v s.d. of the apex, o z s.d. of the axis orientation, 
s.d. of the semi apex angle. 


Sample 




Type of surface fitted 




Plane 

Sphere 


Cylinder 



Cone 





°R 



°r 


G 

r 


CN-1 

0-30 

9-5 

66-42 

3-15 

0-64 

158-34 

1-24 

66-67 

0-42 

0-06 

CN-2 

0-74 

3d 

45-31 

94-11 

0-44 

164-12 

21-73 

20-41 

0-12 

0-01 

CN-3 

0-43 

2-15 

32-8 

18-84 

0-73 

330-0 

4-21 

24-37 

0-31 


CN-4 

0-50 

3-29 

35-30 

2-86 

0-63 

784-8 

7-93 

20-46 

0-36 


CN-5 

0-48 

3-64 

47-79 

4-57 

0-74 

105-29 

2-47 

43-88 

0-24 



these cases, the axis of the fitted cone is perpendicular to the plane of the image, i.e., 
along the z-axis. The standard deviations of the apex and the axis are low enough to fall 
below the corresponding lower threshold. These two parameters thus contribute 
a membership value of 1 each. The third parameter, the apex angle, has given a high 
standard deviation above the higher threshold. Thus contribution of this parameter to 
overall membership function is 0. Thus an overall membership of 0-67 has resulted for 
these samples when a cone is fitted. The 0-33 membership with respect to cylindrical 
classes shown by sphere samples can be explained similarly. The point estimate has 
given a low standard deviation below the lower threshold in all the cases while the other 
two parameters have crossed the higher threshold. Cylinder samples have shown no 
other behaviour significantly apart from cylindrical behaviour. Two cone samples have 
shown some amount of planar behaviour out of which one cone sample has been 
misclassified as planar because in that case value of the semiapex angle was very high. 


Table 3. Lower and upper thresholds of different para¬ 
meters for different surface classes 

PL = planar surface; SP = spherical surface; CL = cylindri¬ 
cal surface; CN = conical surface. 


Surface 

type 

Parameters 

Lower 

threshold 

Upper 

threshold 

PL 

Normal N 

0-015 

0-30 


Distance d 

0-28 

2-15 

SP 

Centre X c 

5-48 

19-2 


Radius R 

0-31 

2-86 


Axis t 

0-33 

0-40 

CL 

Point X 

80-42 

105-29 


Radius R 

0-63 

1-24 


Apex V 

53-15 

64-87 

CN 

Axis i 

0-39 

0-46 


Angle 6 

0-07 

0-30 
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Table 4. 

Fuzzy classification of plane, sphere, cylinder and cone samples. 

Sample 


Membership vector 


Dominant 

class 

Correctly classified 
(Y/N) 

PL 

SP 

CL 

CN 

PL-6 

0-93 

006 

0-00 

0-67 

PL 

Y 

PL-7 

1-00 

000 

0-00 

0-67 

PL 

Y 

PL-8 

0-98 

0-38 

0-00 

0-88 

PL 

Y 

PL-9 

0-94 

0-00 

0-00 

0-67 

PL 

Y 

PL-10 

100 

0-00 

0-00 

000 

PL 

Y 

SP-6 

000 

1-00 

0-33 

0-67 

SP 

Y 

SP-7 

0-00 

100 

0-33 

0-67 

SP 

Y 

SP-8 

0-00 

100 

0-33 

0-67 

SP 

Y 

SP-9 

0-00 

100 

0-33 

0-67 

SP 

Y 

SP-10 

010 

100 

0-33 

0-67 

SP 

Y 

CL-6 

0-00 

000 

1-00 

0-00 

CL 

Y 

CL-7 

0-00 

000 

1-00 

0-00 

CL 

Y 

CL-8 

0-07 

0-00 

100 

0-30 

CL 

Y 

CL-9 

0-00 

0-00 

100 

0-00 

CL 

Y 

CL-10 

000 

0-00 

0-54 

0-22 

CL 

Y 

CN-6 

000 

0-00 

0-03 

0-33 

CN 

Y 

CN-7 

0-00 

000 

000 

0-52 

CN 

Y 

CN-8 

0-00 

0-09 

0-00 

0-41 

CN 

Y 

CN-9 

0-53 

0-00 

000 

1-00 

CN 

Y 

CN-10 

0-59 

0-00 

0-00 

0-22 

PL 

N 


Abbreviations as in table 3 


Classification results for general quadratic surfaces not belonging to any of the four 
classes, plane, sphere, cylinder and cone, are also presented. The aim is to see if a general 
pattern of membership vectors is derivable for a set of samples from the same quadratic 
class. This may be useful in characterising that quadratic class. The results of this 
experimentation are given in table 5. The classification based on the dominant class 
behaviour is also indicated in the table. It is interesting to note that samples from the 
same quadratic class show the same dominant class behaviour. For example cylindrical 
behaviour is predominantly shown by samples from elliptical and hyperbolic cylinder 
classes, while ellipsoids are predominantly spherical, hyperboloids are predominantly 
conical. 


4. Occluded surface characterization 

In a range image with more than one object, usually some objects may be partially 
occluded by the others. This makes the problem of understanding range images 
more difficult since certain regions in the image representing surfaces of objects 
may be missing or partially occluded. Since due to occlusion the object surfaces are 
partially visible, local surface properties should be used to characterize them 
geometric method for surface characterization, discussed in §2, obtain* c 
descriptions through surface normal analysis. Since the normal to a 
is the local property of the surface at the point with respect to the poin 
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Table 5. Fuzzy classification of general quadric surface samples. 


Membership vector 

-- Dominant 


Surface type 

Sample 

PL 

SP 

CL 

CN 

class 


1 

0-00 

0-19 

0-67 

0-00 

CL 

Elliptic 

2 

040 

0-00 

1-00 

0-33 

CL 

cylinder 

3 

0-00 

0-19 

0-67 

000 

CL 


4 

043 

0-15 

1-00 

000 

CL 


5 

0-00 

0-00 

0-33 

000 

CL 


1 

0-00 

0-00 

0-33 

000 

CL 

Hyperbolic 

2 

0-07 

0-28 

1-00 

0-00 

CL 

cylinder 

3 

0-00 

0-00 

0-67 

000 

CL 


4 

0-00 

0-09 

0-78 

000 

CL 


5 

0-00 

0-00 

0-33 

0-00 

CL 


1 

000 

040 

0-33 

0-33 

SP 

Ellipsoid 

2 

0-00 

.0-53 

0-33 

0-25 

SP 


3 

Oil 

0-93 

0-33 

0-33 

SP 


4 

0-16 

0-93 

0-33 

0-36 

SP 


5 

0-22 

0-99 

046 

0-67 

SP 


1 

000 

000 

0-00 

1-00 

CN 

Elliptic 

2 

0-27 

0-55 

000 

100 

CN 

cone 

3 

000 

000 

Oil 

1-00 

CN 


4 

000 

000 

000 

0-67 

CN 


5 

000 

000 

000 

0-33 

CN 


1 

000 

000 

000 

0-33 

CN 

Hyperboloid 

2 

000 

000 

000 

0-33 

CN 


3 

0-00 

000 

000 

0-33 

CN 


4 

000 

000 

0-31 

0-33 

CN 


5 

000 

000 

0-00 

0-33 

CN 


1 

0-00 

0-00 

0-00 

0-33 

CN 

Hyperbolic 

2 

000 

000 

0-00 

0-27 

CN 

paraboloid 

3 

000 

000 

0-00 

0-33 

CN 


4 

000 

0-00 

0-00 

0-33 

CN 


5 

000 

0-00 

0-00 

0-33 

CN 


Abbreviations as in table 3 


the geometric method also forms a reliable tool for characterizing partially visible 
surfaces. 

The shape of a surface (i.e., planar, spherical, cylindrical or conical) is uniquely 
determined by the global parameters of the surface. For example, a spherical surface is 
uniquely characterized by its centre and radius. The geometric method estimates the 
global surface parameters from the locally calculated surface normals. Since the 
estimation of global parameters at different regions of a surface should be very close to 
each other, it is also possible to determine if two or more partially visible surface 
patches are parts of the same surface. This is achieved in the following manner. 

Let a surface class co, cue (plane, sphere, cylindrical, cone}, be characterized by a set 
of parameters (f^jl ^d^D a ), where D a is constant whose value is different for 
different surface classes. This set of parameters defines a D^-dimensional space, hence¬ 
forth referred to as parameter space, such that the mean values of the parameters 
describing a surface S { of class co represent a point p ( in the parameter space. Because 
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of the uncertainty in the measurement of Of, different estimates of Of may occur 
anywhere in the region (Of — hi (j ->) to (Of + hi ir >) where hi (r > is the upper threshold used in 
the fuzzy classifier and is given in (29). Thus different estimates of the point p ; may occur 
anywhere within a hyper rectangular parallelopiped bounded within the region 
[Of — hi (r i. Of + /zi^] for 1 ^ d ^ £>„. This region is referred to as region of occurrence 
of S t . Let this region for surface S t be denoted by R t . Now two partially visible surface 
patches S t and Sj, both of same surface type co, are assumed to be possible parts of the 
same surface if R^Rj # 0. If this condition is satisfied then surfaces are fitted over the 
region S t uSj. If the fitted surface also shows a dominant class co then S,- and S - are 
identified as parts of the same surface and the parameters of the fitted surface are 
extracted. If the fitted surface shows a dominant class other than co then S-, and Sj are 
identified as different surfaces. 

4.1 Experimental results 

Experimental results on six synthetic range images, each containing occluded surfaces, 
are presented. All the synthetic images are of size 128 x 128 and are shown in 
figures 7a-7f. These images are referred to as image-1, image-2, image-3, image-4, 
image-5 and image-6 respectively. Boundaries of the surface patches in these images are 
shown in figures 8a-f respectively, where each surface region is marked with an integer. 
Each of these images contain one or more occluded surface patches. The fuzzy 
classification results are shown in table 6. The table shows the original class and the 
original parameters of the surfaces, the dominant class of the fitted surfaces and the 
estimated parameters when surfaces of respective dominant classes are fitted. 

From table 6 it is seen that the occluded surfaces, i.e., the cone in image-1 (figure 7a), 
the cylinder in image-2 (figure 7b) and the sphere in image-3 (figure 7c), are correctly 
classified and the estimated parameters of the fitted surfaces closely approximate the 
original parameters. Image-4 (figure 7d) contains two partially visible hemispherical 
surfaces (surface patches 2 and 4 in figure 8d), one with radius 55 and the other with 
radius 30. Both of them have the same centre (63,63,150). Both these hemispherical 
surfaces are correctly classified and parametrized individually. But since the region of 
occurrence of one hemisphere does not intersect that of the other, they are not 
considered as part of a single surface. Whereas in image-5 (figure 7e), though the 
regions of occurrence of two partially occluded hemispheres (surface 2 with radius 50 
and surface 4 with radius 55 in figure 8e) intersect, the combined surface shows conical 
behaviour though individually they are dominantly spherical. Hence they also cannot 
comprise a single surface. Two partially visible conical surface patches (2 and 4 in 
figure 8f) in image-6 (figure 7f) together construct a single conical surface whose 
estimated parameters closely approximate the estimated parameters of the individual 
surfaces and also those of the original surface. The cylindrical surfaces in all these three 
images are correctly classified and parametrized. 


5. Conclusion 

In this paper various geometric methods for surface fitting and surface characterization 
have been considered. Geometric surface fitting techniques give best results when the 
input is from the restricted domain of plane, sphere, cylinder and cone. The problem 
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(a) 


(b) 




(c) 


(d) 




(e) (f) 

Figure 7. Synthetic range images containing occluded surfaces: (a) Image- 1, 

(b) image-2, (c) image-3, (d) image-4, (e) image-5, and (f) image-6. 

has. been attempted for two different situations. In the first case, it has been assumed 
that a priori knowledge about the shape of the surface is available. In such situations, 
a surface of the proper type is fitted to the set of points representing the surface patch 
and the parameters of the fitted surface are extracted. This strategy of selective surface 
fitting reduces the amount of computation. 

In the second case, when no a priori information regarding the shape of the unknown 
surface is available, the characterization of the surface is obtained in the form of a fuzzy 
membership vector. The unknown surface is taken as belonging to the class with 
maximum membership value. In this case the burden of fitting all four classes of 
surfaces cannot be avoided. The design of the fuzzy classifier plays a crucial role in 
determining the performance of the system. The performance of the classification 
scheme depends upon how correctly it can classify unknown surfaces. Thus the choice 



Industrial vision system 


111 
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Figure 8. Boundaries of surface patches in synthetic range images: (a) Image-1, 
(b) image-2, (c) image-3, (d) image-4, (e) image-5, and (f) image-6. 
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High-temperature effects in hypersonic flows 

DINESH K PRABHU 
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space Laboratories, Bangalore 560017, India 
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Abstract. The hypersonic (M = 25) flow past a 10° right circular cone at 0° 
angle of attack has been computed using an implicit, noniterative, finite- 
difference scheme for the Parabolized Navier-Stokes (PNS) equations. 
Three different gas models have been considered in the study - (1) ideal gas 
(y = constant), (2) equilibrium air, and (3) five species (N 2 ,0 2 ,N0,N,0) 
chemical nonequilibrium air. For the nonequilibrium air case, three differ¬ 
ent models have been considered - (a) one in which the constituent species 
have no internal structure (i.e., species are ideal gases), (b) one in which the 
constituent species have internal structure (i.e., species are thermally perfect 
gases), and (c) one in which the species have internal structure and their 
thermodynamic properties have been obtained from quantum mechanical 
and spectroscopic calculations. 

Keywords. Hypersonic flows; chemical equilibrium/nonequilibrium 
flows; planar/axisymmetric flows; finite-difference formulation. 


1. Introduction 

Research in the area of hypersonic aerothermodynamics in India has received a great 
impetus from the success of the Agni re-entry vehicle (especially from the point of 
designing its heat shield). In addition to the Agni programme, there are also plans for 
a one-stage-to-orbit Hyperplane whose design will require the integration of the 
aero thermodynamic and propulsive (supersonic combustion of H 2 and air) aspects of 
flight. The development, design and testing of such vehicles and perhaps future space 
vehicles will require ground-based experimental facilities capable of simulating the 
extreme conditions normally encountered at hypersonic speeds. Ground-based test 
facilities are expensive from an operational point of view and the ones currently 
available in India are inadequate. This handicap can be overcome to a large extent 
through the use of high-speed digital computers that are widely available today. 

It is conventional to classify any flow whose Mach number exceeds 5 as hypersonic 
flow. However, this is merely a rule of thumb and is insufficient (Anderson 1989). 
Hypersonic flow is markedly different from the usual supersonic flow. For instance, the 
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elevated temperatures (due to high enthalpies) in the hypersonic viscous shock layer 
result in the occurrence of several complex physical phenomena such as vibrational 
relaxation, chemical reactions, ionisation, radiation, etc. which are not present in 
conventional supersonic flows. These phenomena are generally termed as “real- 
gas effects” (“high-temperature effects” is a more appropriate term) and are usually rate 
processes (mostly due to atomic, molecular, electronic or photonic collisions) which 
could be either fast or slow compared to the characteristic flow time. On the one hand, if 
the rate processes are infinitely fast, especially in the denser reaches of the atmosphere, 
then one has a local thermodynamic equilibrium situation. For this case, the local 
thermodynamic state is defined uniquely by any two-state variables such as the internal 
energy and density (two commonly used dependent variables in CFD). The usual 
Navier-Stokes equations expressing the conservation of mass, linear momentum, and 
energy with suitable modifications to the equation of state are enough to model the 
flow. On the other hand, if the rate processes occur at finite rates, especially in the rarer 
reaches of the atmosphere, then one has a nonequilibrium situation. For this case, apart 
from the usual Navier-Stokes equations, one must consider the equations governing 
the rate processes (Anderson 1989; Park 1990). For instance, if chemical reactions occur 
at finite rates, then one has to take into consideration the conservation of mass of each 
of the species that makes up the gas mixture. This implies that the governing equation 
set is expanded (considerably if one has a large number of species). In addition to these 
phenomena, one must also take into account the internal structures of the species that 
compose the flow stream. As a first approximation, one can assume the atomic species 
to be point masses and the molecular species to be rigid rotors. For a more rigorous 
analysis, however, one must account for the realistic structure of the atomic and 
molecular species. 

Once the flow has been modelled mathematically, the equations governing the flow 
must be solved subject to some boundary conditions. The usual approach is to solve the 
equations numerically since they are intractable to analytical solution. There are 
several approaches that are possible (Anderson et al 1982), (1) solve the time-dependent 
form of the equations iteratively till a steady-state solution (usually the one of interest) 
is reached, (2) solve the time-independent form of the equations by means of space 
marching along the dominant flow direction, or (3) a combination of the two. The first 
approach is usually resource intensive for the three-dimensional case, both in terms of 
computer memory and time. The second approach reduces the dimensionality of the 
problem by one, i.e., some initial solution surface is marched in the dominant flow 
direction (time-like axis). This approach is subject to some restrictions (for example, the 
inviscid flow must be everywhere supersonic), but in the cases where such an approach 
is viable, it provides solutions of accuracy comparable to that obtained from the first 
approach and is also substantially less resource intensive. Some examples of the 
space-marching method are the Viscous Shock Layer (VSL) method, and the Para¬ 
bolised Navier-Stokes (PNS) method (Rubin & Tannehill 1992) etc. 

The PNS equations are obtained from the time-dependent Navier-Stokes equations 
by dropping the time terms and the viscous terms in the streamwise direction. The 
equations so obtained are actually hyperbolic-elliptic due to the presence of the 
subsonic layer near the wall. This layer permits pressure disturbances to travel 
upstream (upstream influence) resulting in exponentially growing solutions (departure 
solutions). Consequently, the equations are not well posed for a space-marching 
method. There are several ways of overcoming this difficulty; the ones due to Vigneron 
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etal (1978) or Schiff & Steger (1979) are commonly used. In the Schiff and Steger 
method, the pressure at the sonic point within the boundary layer is imposed in the 
direction normal to the body throughout the subsonic zone. In the technique due to 
Vigneron et al (1978), only a fraction of the streamwise pressure gradient is retained in 
the subsonic boundary layer. This fraction, which is a function of the local, streamwise 
Mach number, is obtained from an eigenvalue analysis of the governing equations. The 
eigenvalue analysis also reveals that the PNS equations cannot be used in flows where 
the inviscid Mach number is less than 1, e.g., the nose of a blunt body which has a large 
embedded subsonic zone in the shock layer. In conjunction with time-dependent 
Navier-Stokes equations, PNS methods have been successfully used by many re¬ 
searchers for a variety of problems ranging from three-dimensional ideal supersonic 
flow to three-dimensional reacting hypersonic flow (Rakich et al 1984; Prabhu & Tan- 
nehill 1986; Prabhu 1987,1993; Prabhu et al 1988, 1993). 

After the choice of time or space-marching method has been made, one has a choice 
of several numerical algorithms - (i) explicit methods, (ii) implicit methods, or (iii) 
hybrid explicit-implicit methods (Anderson et al 1982). Explicit methods are usually 
slow since they are limited by the “effective time scale” of the problem in addition to the 
CFL limit. Implicit methods, however, do not normally have CFL limitations and can 
resolve multiple scales within the problem but require a lot more computer resources 
and time. The choice of method is actually dictated by the problem. For instance, 
unsteady nonequilibrium flows require the use of explicit methods to capture both the 
flow time scale and the reaction time scale (Cambier etal 1992). For a steady 
equilibrating flow, where the reaction time scale is much less than the flow time scale, an 
implicit method is more appropriate (Cambier & Prabhu 1992). 

Other considerations in choosing the numerical scheme are the choice of finite- 
difference or finite-volume method and the representation of the numerical flux. Until 
recently, the finite-difference method was used with a symmetric representation of 
fluxes, i.e., central differences. With advances in numerical analysis, the choice has since 
shifted to finite-volume methods with biased representation of numerical fluxes, i.e., 
upwind differencing. 

In the present work, a finite-difference method with central-differencing of fluxes is 
chosen to solve the PNS equations for steady, reacting/nonreacting, laminar, hyper¬ 
sonic flow past a simple geometry such as a right circular cone. The primary objectives 
of the study are: 

• To bring out the salient features of hypersonic viscous shock layers through the study 

of high-temperature effects such as chemical reactions. 

• To study the influence of the grid on the computed solution. 

• To study the influence of various gas models (ideal, equilibrium, and nonequilibrium 

air) on the computed solution. 

• To provide a comprehensive set of “benchmark” results for code validation. 

With these objectives in mind, a two-dimensional planar/axisymmetric PNS code 
has been developed. The code is based on an implicit, finite-difference method with 
central differencing of inviscid/viscous fluxes. Three types of gas models have been 
included in the code - (1) ideal gas (y = 1-4), (2) equilibrium air whose thermodynamic 
and transport properties are based on piecewise continuous curve fits developed by 
Srinivasan et al (1987a,b), and (3) nonequilibrium air consisting of five species - N 2 , 
0 2 , NO, N, and O. For the last case, three further sub-cases are considered - (a) the 
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Figure 1. Coordinate system used in the present study. are generalised 
coordinates (not necessarily orthogonal). L is the characteristic length. The outer 
boundary is placed far enough from the body to be able to capture the outer shock. 

constituent species have no internal structure, (b) the constituent species have internal 
structure and their thermodynamic properties are obtained from Blottner et al (1971), 
and (c) the constituent gases possess internal structure and their thermodynamic 
properties are obtained from accurate spectroscopic constants and potentials (Liu 
& Vinokur 1989). 

2. Governing equations and boundary conditions 

2.1 I deal/Equilibrium gas 

The equations governing the steady, two-dimensional planar/axisymmetric, laminar 
flow of an ideal/equilibrium gas can be written for a general curvilinear coordinate 
system (<!;, Q shown in figure 1 as: 

^(E') + ^{G i -G”)=j(S i -S''), (1) 

where j — 0 for planar flow, and j = 1 for axisymmetric flow. The superscripts i and 
v represent the inviscid and viscous quantities, respectively. The coordinate £ is 
identified as the primary or streamwise coordinate and C as the normal coordinate. 
The inviscid flux vectors, E*’ and G‘, are given by 



pU 


- \ 
pW 

E '=J 

pu€^ + Z x p 
pw(7 + ^p 

• G ‘=j 

puW+ C x p 

pwW + £ z p 


(pe, + p)U 

i 


{pe, + p)W 
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where p (kg/m 3 ) is the mass density of the gas, p (Pa) is the static pressure, u,w (m/s) 
are the Cartesian components of velocity, and e t (J/kg) is the specific total energy 
given by: 


e, = e + j{u 2 +w 2 ), 

where e (J/kg) is the specific internal energy of the gas. 
The viscous flux vector, G u , is given by 


0 



\Mi u + h w ) u s + ih u + WwJ + /c / 4 




(3) 


(4) 


where /u (Pa-s) is the coefficient of viscosity and k (W/m-K) is the coefficient of thermal 
conductivity. The subscript £ indicates partial differentiation w.r.t. the coordinate £. 
The coefficients l t ,..., l 4 are given by 


h-ic 2 x +c 2 z , 


Iz-Cl+iCi 


h =K*C Z , 


U-tl + G 


(5) 


£ jc ,£ 2 ,£ x ,C z are the metrics and J is Jacobian of the coordinate transformation 
[(x, y) ->(£,£)].£/ and W are the contravariant components of velocity and are given by 

= + lT = £ x u + £ z w. (6) 


Note that the viscous flux contains terms in the £ direction only, i.e., the streamwise 
(£) terms are assumed to be much smaller than those in the normal (£) direction. The 
equations are thus said to be “parabolized”. The governing equations, however, are still 
hyperbolic-elliptic and hence improperly posed for space-marching solutions. The 
conditions for well-posedness are elaborated on in a later section. 

The vectors S 1 and S u are source terms for the axisymmetric formulation and are not 
given here. 

For the case of the ideal/equilibrium gas, the vector of conservation variables Q is 
chosen as 


Q = {p,pu,pw,pe t } T . (7) 

2.2 Nonequilibrium gas 

The chemically reacting gas is actually a mixture of gases that react with one another. 
The rates of these reactions are finite as compared to the equilibrium gas where the 
reaction rates are infinitely large. Therefore, in addition to the conservation of mass of 
the gas mixture as a whole, one has to satisfy the law of conservation of mass for each 
constituent species. 

The equations governing the steady, two-dimensional planar/axisymmetric, laminar 
flow of a chemically reacting gas consisting of n constituents can be written for a general 
curvilinear coordinate system (£, £) shown in figure 1 as: 

tL(E') + jp(G‘ - G”) = /(S' - S') + W', 
d£ o£ 


( 8 ) 
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where the superscript c represents the chemical source terms. The other symbols retain 
the same meanings as before. The inviscid flux vectors, E‘ and G‘, are given by 


pU 


- 

pW 

puU^ + Z x p 


puW+ C x p 

pwU + £ s p 


pwW+ C Z P 

pHu' 

G . = l | 

pHW 

P c i& 

’ jI 

pc t W 

pc 2 U 


pc 2 W 

pc n - 1# 


PCn-lW 


where c 1 , c 2 ,..., c„_ : are the mass fractions of the species and H (J/kg) is the specific 
total enthalpy of the mixture. This is given by 

H = h + j{u z + w 2 ), (10) 


where h(J/kg) is the specific static enthalpy of the mixture. 

Note that only n — 1 equations are considered for the conservation of species mass. 
The nth equation has been replaced by the global mass conservation equation and the 
mass fraction of the nth species is obtained from the algebraic relation 

c^l-s'c,. (11) 

5= 1 

Compare the flux vectors given by (2) with those given in (9) - for the nonequilibrium 
case the flux vectors have n + 3 components while for the ideal/equilibrium gas case the 
vectors have 4 components. Therefore, the equation set for a nonequilibrium case can 
be very large depending on the value of n. 

The viscous flux vector, G v , is given by 


G y = 


(h u z + h w c) 

(/ 3 u c + / 2 w ? ) 

C(V + / 3 w)« c + (/ 3 m + / 2 w)w c ] + 


q-T c + IlA-c sc 


v 


( 12 ) 


where fi (Pa^s) is the coefficient of viscosity, v — fj./p is the kinematic viscosity. 
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k (W/m-K) is the coefficient of thermal conductivity, and 2 (m 2 /s) is the coefficient of 
binary diffusion. The coefficients l t ,..., / 4 are given by (5). 

The chemical source vector, W c is 


W c = {0,0,0,0,w 1 ,w 2 




(13) 


where vv s (kg/m 3 -s) represents the mass production rate of species s. 

The vectors S' and S” are source terms for the axisymmetric formulation. They are 
not given here and can be found elsewhere (Prabhu et al 1988). 

For the case of the nonequilibrium gas, the vector of conservation variables Q is 
chosen as 


Q = {p, pu, pw, pH, pc j, pc 2 ,..., pc„ _ j} T . 


(14) 


2.3 Boundary conditions 


The domain of interest is bounded by two boundaries - (a) the far field boundary and 
(b) the inner or wall boundary. 

At the outer boundary freestream conditions are applied. This means that any 
discontinuities that arise in the flowfield are “captured” as part of the solution. This is 
possible due to the strong conservation-law form of the equations. 

Since the flow is viscous, “no-slip” boundary conditions are applied at the wall, i.e., 

m = w = 0. (15) 

The wall is assumed to be isothermal with a constant wall temperature, i.e., 


T w — constant. (16) 

Further, the pressure gradient in the normal direction ({) is assumed to be zero at the 
wall. Strictly speaking, one has to solve the normal momentum equation at the wall. In 
practice, however, this approximation from boundary-layer arguments works well. 
This boundary condition is expressed mathematically as: 



(17) 


In addition to these boundary conditions, conditions have to be imposed on the 
species mass densities (or equivalently mass fractions) for the chemical nonequilibrium 
case. We will assume the wall to be noncatalytic, i.e., the species density gradients 
normal to the wall are zero. This is expressed as: 


dc, 

d( 


= 0 . 


(IB) 


3. Gas models 

In order to close the system of governing equations, relations between the ther¬ 
modynamic quantities and expressions for the transport properties of the gas or gas 
mixture are required. The relations and expressions depend on the type of gas 
considered and are discussed in what follows. 





788 


Dinesh K Prabhu 


3.1 Thermodynamic and transport properties: Ideal gas 

The ideal gas obeys the following equations of state (thermal and caloric): 

p = pRT, (19) 

p = (y — l)pe, (20) 

where p (Pa) is the static pressure, p (kg/m 3 ) is the mass density, R (J/kg-K) is the 
specific gas constant, T (K) is the static temperature, e (J/kg) the specific internal 
energy and y (= a constant) is the ratio of specific heats. The speed of sound a (m/s) is 
defined as 

a — (yp/p) 112 - (21) 

The coefficient of viscosity (Pa-s) is obtained from Sutherland’s formula 

p = p^T m !(T+C\ ( 22 ) 

where p ref is a reference viscosity and C (= 110-4K) is Sutherland’s constant. The 
coefficient of thermal conductivity (W/m-K) is obtained from the definition of the 
Prandtl number 

Pr = pC p /K = constant =>k — p.C p / Pr, 
where the specific heat C p (J/kg* K) at constant pressure is given by 
C p = y R/(y — 1) = constant. 

3.2 Thermodynamic and transport properties: Equilibrium gas 

The equilibrium gas is actually a mixture of gases in which we have local ther¬ 
modynamic equilibrium. For this case, given any two s'tate variables (for instance p and 
e ) we can determine the chemical composition of the gas using Gibbs free-energy 
minimisation (Gordon & McBride 1976), or equilibrium constants method (Bailey 
1967). Such iterative methods are quite time consuming because the gas composition 
has to be determined at every grid point at every spatial step. Fortunately, for the 
special case of air, gas compositions over a wide range of densities and temperatures 
have already been computed and curve fits for the thermodynamic and transport 
properties have been obtained (Srinivasan et al 1987a, b). These piecewise continuous 
curve fits are of adequate accuracy and are briefly described in what follows. 

Recall that the primary variables used in the ideal/equilibrium gas formulation are 
p and e. For this choice of state variables the curve fits for the thermodynamic 
properties p, Tand a are in functional form (Srinivasan et al 1987a) 

p = p(p, e); a = a(p, e); T = T(p, e ). (25) 

We also require the following curve fits for the implementation of boundary conditions 
(this is explained in a subsequent section): 

h = h(p, p); T=T(p,p). (26) 

In addition to these curve fits, we need a link between the fluid dynamics and the 
equilibrium chemistry. This link is provided by two parameters y and /? which are 


(23) 

(24) 
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defined as 


y = ffe, (27) 

P/pRT. (28) 

The first parameter is analogous to the usual isentropic exponent y which is the ratio of 
specific heats. For the case of ideal gas y and y are identical. The second parameter is the 
compressibility of the gas and measures the departure of the gas from its ideal 
behaviour. For the case of an ideal gas fi is identically 1. 

The first correlation given in (25) is actually obtained from a curve fit of y, 

y = y{p,e\ (29) 

in conjunction with 

p = (y-l)pe. (30) 

Using (30) with the definition of compressibility the temperature can be rewritten as 
T = W- Wfle/R). (31) 

Through the use of (30) and (31), the formulation for the case of equilibrium air is the 
same as that for ideal gas with y being replaced by y. 

As in the case of the thermodynamic properties, the transport properties are also 
available in the form of piecewise continuous curve fits (Srinivasan et al 1987b) with 
p and e as the primary variables. The curve fits in functional form are: 

p = p(p,e); K = K(p,e) (32) 

3.3 Thermodynamic and transport properties: Chemical nonequilibrium gas 

In the present context, chemical nonequilibrium gas refers specifically to air consisting 
of five species - N 2 , 0 2 , NO, N and O. These species react with one another and the 
reactions are assumed to be far from equilibrium, i.e., occur at a finite rate. 

We have to consider the properties of the individual species as well as those of the gas 
mixture. The properties of the individual species depend on the kind of structure the 
species possess. For instance, we can assume the atomic species to be point masses or 
take into account their actual quantum structure. These assumptions lead to different 
values and behaviour of the thermodynamic properties. 

The component gases obey the thermal equation of state, 

p=p s 0tTflM s \ (33) 

where p s (Pa) is the partial pressure of species s,t% is the universal gas constant 
(8314-34 J/kmoFK), Ji s (kg/kmol) is the molecular mass of species s. The caloric 
equation of state depends on the nature of the constituent species. Three models are 
considered in the present study. These models differ in the kind of structure possessed 
by the constituent species of the gas mixture. In the first gas model, the species are 
assumed to have no internal structure - each atomic species is assumed to be a point 
mass with three translational degrees of freedom and each diatomic species is assumed 
to be a rigid rotor with three translational and two rotational degrees of freedom. In 
other words, the species that constitute the gas mixture are ideal gases. In the other two 
models, each species has an internal structure - each atomic species has quantised 
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IDEAL 


REAL 



3 Trans, d.o.f. (f=3) 



3 Trans, and 2 Rot. d.o.f. (f-5) 



Vibrational States 


f+2 R 

C =-= constant 

P 2 M 


C p =C p (T) 


Figure 2. Schematic representation of atoms and molecules in the ideal and real 
senses. Atoms are idealised as point masses (a) while diatomic molecules are 
idealised as rigid rotors (b). In reality the internal structure is quantised - atoms 
have discrete energy levels and molecules have electronic, vibrational, and rota¬ 
tional states. The specific heats of the species are constants for the ideal case and 
functions of temperature for the real case. 


energy levels and the diatomic species has quantised electronic states which further 
contain vibrational and rotational levels. These models are schematically represented 
in figure 2. 

3.3a Mixture of ideal gases: In this model, the atoms are assumed to be point masses 
with 3 translational degrees of freedom and the diatomic molecules are assumed to be 
rigid rotors with 3 translation and 2 rotational degrees of freedom (vibrational degrees 
of freedom being neglected). The specific heat and enthalpy of each individual species 
are given by 

c P .s = [(/ + 2)/2](£?/„# s ) = constant, (34) 

h, = C pj T + h°, (35) 

where (J/kg) is heat of formation of species s and / the number of degrees of freedom. 

3.3b Mixture of perfect gases - Blottner model: In this model, the internal structure 
of the atoms and molecules are taken into account. Consequently, the specific heats and 
enthalpies are no longer constant and are dependent on the temperature. This is 
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expressed as 

C p . s = A Us (T), (36) 

TA 2s (T) + /i°, (37) 

where A i s and A 2<s are tabulated functions of temperature (Blottner et al 1971). In the 
present study, cubic spline interpolation is used to determine the intermediate values of 
A x s and A 2s . 


3.3c Mixture of perfect gases - Liu-Vinokur model: As in the previous model, the 
internal structure of the atoms and molecules are taken into account. However, more 
exact methods utilizing the latest available spectroscopic constants are used in 
determining the thermodynamic properties (Liu & Vinokur 1989). The properties are 
then curve fit using cubic splines and the spline coefficients tabulated as functions of 
temperature. This is expressed as 

C p , s = C u 4- C 2 Jz +T)+ C 3s (z 2 + 2 zT) + C 4 . s (z 3 + 3z 2 T), (38) 

h s = T(C Us + C 2s z + C 3 , s z 2 + C 4s z 3 ) + h° s , (39) 

where C Us , C 2 s , C 3 s and C A s are tabulated values of the spline coefficients over 
a temperature range 250-25,000 K (in steps of 250 K). The parameter z is defined as 

z= T-int(T/AT)AT. (40) 

In the above equation int(-) indicates the integer part of the argument. 

For each of the gas models considered for the nonequilibrium case, the static 
pressure of the mixture is obtained from Dalton’s law of partial pressures, i.e., 

p = p3tT'£(cJJ( ! ) = p®TIJ(. (41) 

The specific heat and enthalpy of the mixture are computed as weighted sums (the 
weights being the mass fractions of the species) of the species properties, i.e., 

c„=Z^c,,’ < 42 > 

S 

I > = £<A- (43) 

The frozen speed of sound a f for the nonequilibrium case is given by 

Of — L^C P; TflJiC Pf — ^)] 1,2 . (44) 

The viscosity (Pa-s) of each individual species is obtained from curve fits due to 
Blottner et al (1971). These curve fits are of the form 

^ = 0-1 exp[(y4 s ln T + 2? s )ln T + C s ], (45) 

where A si B s and C s are constants. The thermal conductivity (W/m-K) of species s is 
computed using a semi-empirical formula due to Eucken (1913) 



The transport properties of the gas mixture are then obtained through the use of mixing 
rules. The mixing rule due to Wilke (1950) is considered adequate for non-ionizing 
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mixtures and is written for the coefficient of viscosity as 

M = Z(V S 


!+ It 


where 


r %X s J/ s + J/ r \3A* 


5 J_ 


O , 


<D = 


,8 (Jf, + Jf r ) 


1/2 


F„+ { — ( —^ 

Mr 


1/2 ■) 1/2 




The values of the constants A* B sr and F. r are 


^,* 3 . ^=1-0, B sr — 10. 


(47) 

(48) 

(49) 


The same mixing rule is used for the coefficient of thermal conductivity with y s being 
replaced by k s in (47). 

In the present study, the mass diffusion is assumed to be binary. The justification for this 
is provided by the fact that air consisting of 5 species can be treated as a binary gas with 
the “heavy” particles N 2 , 0 2 and NO (nearly the same molecular weight) and “light” 
particles N and O (nearly the same atomic weight). The binary diffusion coefficient is 
obtained from the definition of the Lewis number which is assumed to be a constant. 

Le = pC p <3/K=>3) = KLe/pC p . (50) 


3.4 Reaction model: Chemical nonequilibrium gas 


Consider a multicomponent system of n species undergoing m simultaneous elemen¬ 
tary reactions. These reactions can be represented symbolically as 


n 


n 


Z v 'i.A— Z V ?,A l = u 2,.. 

S= 1 S= 1 


,,w, 


(51) 


where vj s , v" s are stoichiometric coefficients and /4 S is chemical symbol of species s. 
Using the law of mass action, the rate of production of species s is 


w = 


1=1 


z (Ks - Ks) n - Ki(T) n [pvr] vi: ' b < 52 > 


where y r is the mole-mass ratio of species r and k fl and k b l are, respectively, the forward 
and backward reaction rates of reaction l. These reaction rates are expressed in 
modified Arrhenius form (Blottner et al 1971) as: 


k H = A H T ‘"™V{-Ef.,lkn k bJ = A bJ T"-‘exp(-E bl /kT), ( 53 ) 

where A f ,, A bl , n f l , n b l , E^, and E bJ are constants for a particular reaction / and k is 
the Boltzmann constant. 

In the present work, the following reactions between the various species have been 
considered: 


(1) 0 2 + U t ^±20 +Mi 

( 2 ) N 2 + M 2 -2N + M 2 

(3) N0 + M 3 ^±N+0 + M 3 

(4) N0 + 0^±0 2 + N 

(5) N 2 + 0^±N0 + N 
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Figure 3. Temperature vari¬ 
ation of the specific heat (<?,,) of 
molecular oxygen (0 2 ). 


In reactions 1-3 (also called three-body reactions), Mj, M 2 and M 3 represent the 
catalytic third body which undergoes no change during the reaction and could be any 
one of the species, N 2 , 0 2 , NO, N, or O. Reactions 4 and 5 are called “shuffle” or 
exchange reactions. Therefore, there are 17 reactions in the above set. 

3.5 Discussion 

The inclusion of the internal structure of the species changes their thermodynamic 
properties. In figures 3-5, the specific heats (at constant volume) of 0 2 , NO, and O are 



Figure 4. Temperature vari¬ 
ation of the specific heat (C„) of 
nitric oxide (NO). 
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Temperature, T (K)xl0 3 


Figure 5. Temperature vari¬ 
ation of the specific heat (C„) of 
atomic oxygen (O). 


plotted as functions of temperature. It is clearly seen that the species with no internal 
structure have a constant specific heat while those with internal structure have heat 
capacities that increase with increasing temperature (the heat capacity decreases in the 
diatomic species when their dissociation limits are reached). In the case of atomic oxygen, 
the agreement between the C v values of Blottner et al( 1971) and those of Liu & Vinokur 
(1989) is very good for temperatures less than about 8000 K. Beyond this temperature, 
the differences are large. For the diatomic species, however, large differences occur 
between the models from around 3000 K. The Liu-Vinokur model is to be considered 
as more accurate because it uses the latest available data for spectroscopic constants 
and intermolecular potentials in conjunction with accurate numerical methods. 


4. Analysis 
4.1 Parabolization 

As mentioned earlier, the word “parabolized” in the PNS equations is a misnomer. The 
governing equations are actually hyperbolic-elliptic in the <* direction. The ellipticity is 
introduced into the system due to the presence of the subsonic zone in the boundary- 
layer at the wall. This subsonic zone permits the pressure signals to travel upstream 
(upstream influence). Hence, a single-sweep space-marching method, wherein the 
spatial evolution (in ^) of a line of initial data is computed, is not properly posed and 
leads to exponentially growing solutions called departure solutions. There are several 
ways of getting rid of this ellipticity (Vigneron etal 1978; Schiff & Steger 1979). 
Following Vigneron et al (1978), an analysis was carried out for the various gas models 
considered in the present study. The details of the analyses are not given here but only 
the salient points are mentioned. 

An eigenvalue (linear) analysis (Vigneron etal 1978; Prabhu & Tannehill 1986; 
Prabhu et al 1988) of the governing equations reveals that the governing equations are 
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i 


hyperbolic-parabolic, i.e., well-posed for space marching in £ if and only if - (1) there is no 
flow separation in the axial (£) direction, (2) the local Mach number is greater than unity in 
the inviscid part of the flowfield, and (3) only a fraction co (0 ^ co ^ 1) of the streamwise 
pressure gradient is retained in the subsonic part of the flowfield. The magnitude of 
(o depends on the local streamwise Mach number and is given by the expression 

co = min{l,crfMj[l + (y-l)Mj] -1 }, (54) 

where a (0 ^ a ^ 1, typically a value of 0-85 or less is used) is a factor of safety to account 
for nonlinear effects neglected in the analysis. The parameter y in the above equation is 
given by 

ly (ideal gas) 

y = Ih/e (equilibrium air) (55) 

I MC p J{JtC vt — 0t) (nonequilibrium air) 

and the local streamwise Mach number is given by 

M^ = (?/(a|V£|), (56) 

where the speed of sound a is given by (21) for an ideal gas, by the second correlation of 
(25) for equilibrium air, or by (44) for chemical nonequilibrium air. 

Note that when co = 0 the streamwise pressure gradient is completely omitted and 
when a> = 1 the streamwise pressure gradient is completely retained. In the inviscid part 
of the flowfield, since the Mach number must be greater than unity, the streamwise 
pressure gradient is completely retained. In the subsonic viscous region close to the wall 
only a fraction of dp/d£ is retained. 


4.2 Treatment of the streamwise pressure gradient 


From the preceding paragraphs, it is clear that for a single sweep space-marching 
solution to be viable, the streamwise pressure gradient has to be treated specially. The 
streamwise inviscid flux E‘ is now split into two parts E‘* and P‘ as shown below 

E 1 ' = E 1 '* + P‘, (57) 


where E‘* and P‘ are 

E‘* = (l /J){pU,puO + Z x a>p,pwU + t z wp,(pe, + p)U j T , (58) 

P‘ = (1/J){0.{ J ,(1 -<o)p, 0} r , (59) 

for the ideal/equilibrium case, and 


rl 

fU 


0 

puU + 


^(1 - 0))p 

pwU + % z oop 


t z (l-w)p 

pHU 

, pi _i 

0 

pCl ? 


0 

pc 2 U 


0 

PCn-lU 


, 0 , 


for the nonequilibrium case. 
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The governing equations (1) or (8) can now be written as 

+ U G ‘ - =i' s ‘ - s '> + C w '] - («) 

dC og 

Normally, the last term of (61) is not included in the computations and the systems of 
equations that is actually solved is 

4(E' - ) + ~(G‘ - G") =j( S' - S") + [W']. (62) 

d£ d£ 


5. Numerical method 

The PNS equations, (62), cannot be solved analytically and must be solved numerically. 
In the present study, a finite-difference approach has been taken for the numerical 
solution of the governing equations. The method of discretisation of the solution 
domain, the numerical algorithm, and the numerical implementation of boundary 
conditions are discussed in this section. 


5.1 Discretisation and grid generation 


As mentioned earlier, the computational domain consists of two boundaries - (1) an 
outer freestream boundary and, (2) an inner wall boundary. The outer boundary is 
chosen in such a way that the bow shock lies completely within this domain. The 
domain is then discretised algebraically for implementation of a finite-difference 
procedure. In this procedure, at any given £ station, a point on the body surface and 
a point on the outer boundary are connected by a straight line and grid points 
distributed on this line using the following stretching function 


s( 0 = 1-0 


ros+i) 1 ^ 

L(/>+n 1_{ 


C = [k — \)/(NK — 1), 


+os-ir c J’ 

k — 


(63) 

(64) 


where /?(/} > 1) is the stretching function and NK the total number of points on the grid 
line. Note that s(0) = 0 and s(l) = 1 and the points are clustered close to the wall for 
values of close to 1. Such clustering is necessary for good resolution of the subsonic 
viscous layer. 

The coordinates of the grid points on the grid line are obtained from 


x(^0 = x w (^) + n 1 (^(0^), (65) 

( 66 ) 

where and n 2 are direction cosines of the unit vector along the grid line and S is the 
linear distance of the outer boundary from the body surface. 


5.2 Algorithm 

An implicit, noniterative finite-difference scheme is used to solve the PNS equations, 
(62). This algorithm is an adaptation of one developed by Tannehill etal (1982). 
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Associating the index i with the stream wise direction £ and the index k with the normal 
direction (, this algorithm in delta form is 


{A^ + A^(B u 


M u )-D 2 }AQ u 


i - G» t ) + S'., -- [Wy \ + D 4 E -, 


where the change in the vector of dependent variables is 

AQu = Q/+ i.k ~ Qu’ 

and the Jacobian matrices of the various fluxes are, 


A* = 


<3E‘* 

~dQ' 



M = 


dF v 

~8Q 


(67) 

( 68 ) 

(69) 


The derivative d/d£ is replaced by the conventional 3-point central-difference 
operator. D 2 and D 4 are second-order implicit and fourth-order explicit smoothing 
operators, respectively. These are required to stabilize the central-difference algorithm. 
In the linearization of the viscous flux, the transport properties are assumed to be 
locally constant. Note that all the source terms have been lagged in the present 
formulation. 

The left hand side of the algorithm, (67), corresponds to a block-tridiagonal system of 
equations. The blocks are square matrices of order 4 and order (n + 3) for the 
ideal/equilibrium gas case and the nonequilibrium case, respectively. Specifically, for 
the five-species nonequilibrium air model considered in the present calculations, these 
blocks are square matrices of order 8. 

The code developed in the present study is a unified code for all the three gas models. 
Therefore, the block tridiagonal solver developed here is meant for systems of any 
block order m (m > 1). The penalty paid in using a general solver instead of using one for 
order 4 and one for order 8 is not substantially large. 


5.3 Decoding the Q vector 

At the end of the solution step given by (68), we have just the vector Q i+l k . From this 
vector we have to extract all the thermodynamic and transport coefficients and apply 
the boundary conditions. The decoding procedure and implementation of the bound¬ 
ary conditions for each of the gas models are considered in what follows. 

5.3a Ideal air : Decoding the Q vector for an ideal gas is rather simple. One can 
easily obtain the variables p, u, w, and e t from Q. Knowing the variables «, w, and e t , the 
variable e is obtained from (3). The variables T, p, a, p and k follow from (19)—(23), 
respectively. 

5.3b Equilibrium air: Decoding the Q vector for this case is a little more involved. As 
before, we can easily obtain the variables p, u, w and e t from Q. Again, using (3) and the 
values of u, w and e t we can compute the value of e. Knowing e and p, we can use the 
correlations given in (25) to determine the values of p, a and T. Using (27) and (28), we 
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compute the parameters y and /?. The transport properties are easily determined 
from (32). 


5.3c Nonequilibrium air: Decoding the Q vector for this case is again quite involved. 
We can easily obtain the variables p, u, w, H, and c l ,c 2 ,...,c n _ l from Q. The mass 
fraction c n is computed from (11). Using (10) and the values of u, w, and H we can 
compute the value of h. Knowing h and the mass fractions c x ,c 2 ,...,c„, we compute the 
temperature T of the mixture iteratively using the Newton-Raphson method. 


Z" s= 1 cMT p )-h 

c sC ps (T p ) ' 


(70) 


where p is an iteration index. The Newton-Raphson method usually requires no more 
than 3 to 5 iterations for convergence. Once the temperature has been computed, the 
other thermodynamic and transport properties (for both the species and the mixture) 
are computed using expressions listed in the section on gas models. 


5.4 Numerical boundary conditions 

The implementation of boundary conditions (discussed in the section on governing 
equations) is considered here. In the implicit portion of the algorithm (left hand side of 
67) boundary conditions are applied to only first-order accuracy. This is done to 
preserve the block tridiagonal structure of the matrix. After the integration step, the 
boundary conditions are updated explicitly. This is discussed for each of the gas models 
in what follows. 


5.4a Ideal gas: At the wall, the pressure is assumed to be the same as the pressure 
one grid point above it (17). Using this value of pressure and the given wall temperature 
T w , the value of the mass density of the wall can be computed. No-slip conditions are 
implemented by setting the values of u and w to zero. 

At the freestream boundary, Q at the grid point is updated using the values at the 
grid point just in the interior of the domain, i.e., a zero-gradient condition is imposed 
on Q. 

5.4b Equilibrium air: At the wall, the pressure is assumed to be the same as the 
pressure one grid point above it (17). Now the temperature of the wall is specified as T w . 
Using the second correlation of (26), i.e., 

T w T w {p w , p w ) (71) 

the wall density p w is determined iteratively using a secant method (Prabhu & Tan- 
nehill 1986). Using the first correlation of (26), the enthalpy at the wall is determined for 
given values of p and p. The internal energy at the wall is then updated using the 
definition of the enthalpy 

e w = h w ~{pjp„)- (72) 

This is done in order to ensure consistency in the use of curve fits. No-slip conditions 
are implemented by setting the values of u and w to zero. 

At the freestream boundary, a zero-gradient condition is imposed on Q. 
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5.4c Nonequilibrium air: At the wall, the pressure is assumed to be the same as the 
pressure one grid point above it (17). Now the temperature of the wall is specified as T w . 
The mass fractions of the species are computed using zeroth-order extrapolation from 
the interior. Using these values of mass fractions, temperature, and pressure, the density 
of the gas mixture is computed from the equation of state (41). No-slip conditions are 
implemented by setting the values of u and w to zero. 

At the freestream boundary, a zero-gradient condition is imposed on Q. 


6. Results 

6.1 Test conditions 

The laminar hypersonic flow past a sharp right circular cone has been computed in the 
present study. The details of the test conditions are given in table 1. The ambient 
conditions correspond to an altitude of 61 km approximately. The Mach and Reynolds 
numbers for these conditions are 25 and 0-13 x 10 6 per m, respectively. 

6.2 Preliminaries 

The calculations were started at x = 1 x 10 “ 3 m from the tip of the cone. This was done 
to avoid the singularity at x = 0 where z — 0. The line of initial data was initialised using 
freestream values and this line was marched downstream to a location of x - 1 -276 m with 
the marching step-size varying linearly from Ax = 5 x 10~ 4 m to Ax= 1-2 x lCT 3 m 
over 1500 steps. The spacing at the wall was also varied linearly from Az,, = 5 x 10~ 5 m 
to A z h = 10 -4 m. The outer boundary height, however, was kept fixed at <5(x) = 0 5 m. 
The number of grid points was chosen after detailed grid sensitivity studies were carried 
out. A sampling of the results obtained from the various studies in this work is 
presented in what follows (some more results and details can be found in Prabhu 1993). 

In the presentation of the results, the following nondimensional quantities - surface 
pressure coefficient C p , skin-friction coefficient C /5 and Stanton number St have been 
defined as 




i P, 


u: 


c f - 


ip,M 


St 


42 


P«UJH K -H w y 


(73) 


where p w is wall pressure (Pa), x w is surface shear stress (N/m 2 ), 4 w (W/m 2 ) and H w (J/kg) 
are respectively, the total heating and the total enthalpy at the wall. The total heating at 


Table 1. Test conditions. 


Cone half angle (6 C ) 

10" 

Freestream velocity (Uff 

8100m/s 

Freestream pressure (p % ) 

20-35 Pa 

Freestream temperature ( T rfj ) 

252-6 K 

Wall temperature ( T w ) 

1200K 

Freestream N 2 mass fraction (c N ,) 

0-79 

Freestream 0 2 mass fraction (c 02 ) 
Noncatalytic wall 

0 21 
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the wall is conductive heating in the case of an ideal gas, and the sum of conductive and 
diffusive heating in the case of equilibrium and nonequilibrium air. 

6.3 Grid study 

The sensitivity of the computed solution to varying grid sizes was studied first to 
establish the consistency of the numerical scheme. Six different grid sizes - NK — 67, 
75,101,125, 151, and 167 were tried for the ideal gas (y = 1-4) and the nonequilibrium 
air models. Some of the results for the nonequilibrium case are presented here and the 
grid sensitivity of the solution for the ideal gas case can be found in Prabhu (1993). 

The axial variation of the surface pressure coefficient C pw for the various grid sizes is 
shown in figure 6. The impact of the grid resolution is clearly seen in the picture. As the 
grid is refined the solution changes. Though the changes are very pronounced for the 
coarser grids, for the two finest grids {NK = 151 and 167) the differences between the 
solutions are negligibly small. This indicates the independence (from the grid) of the 
solution. The Newtonian value of the surface pressure coefficient is also shown in the 
figure. The deviation between the PNS results and the Newtonian results is due to the 
viscous-in viscid interaction which is rather strong at the tip of the cone for the chosen 
Mach number and Reynolds number in the present study. 

Figure 7 shows the pressure profiles at the base of the cone (at x= l-276m) for 
various grid sizes. The grid size has a significant impact on the pressure. The shock gets 
progressively sharper and the post-shock pressure rises with grid fineness. For the finest 
two grids, however, the differences in the profiles are negligibly small. 

The pressure profiles also show oscillations which are characteristic of central- 
difference schemes. The amplitudes of these oscillations grow with the grid fineness. 
Recall that central differencing has been used for the inviscid fluxes. This means that the 
flux differencing is not biased in the direction of signal propagation and hence, the 
numerical damping that is necessary for algorithm stability will smear any discontinuities 



Figure 6. Axial variation of 
the computed surface pressure 
coefficient for various grid sizes 
for the nonequilibrium gas 
model of Blottner et al (1971). 
The Newtonian pressure coeffi¬ 
cient distribution is also shown. 
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Pressure, p (Pa) 


Figure 7. Effect of grid size on 
the static pressure. Profiles of 
p at x — 1 -276 m for various grid 
sizes. The oscillations are due to 
the use of central differences for 
the inviscid fluxes. 


that are captured as part of the solution. In the coarse mesh the captured outer shock 
smears over quite a few grid points which are spaced rather widely apart. In the fine 
mesh the shock still spreads over nearly the same number of grid points which are, 
however, quite closely spaced resulting in a sharper definition of the captured shock. It 
must be mentioned here that the numerical smoothing coefficient was kept constant 
(and at a low value) across all grid sizes. This causes a problem in the finer grids where 
high-frequency errors need to be damped out with a larger amount of artificial 
viscosity. This explains the increased amplitude of the oscillations for the finest grids. 

Figure 8 shows the temperature profiles at the base of the cone (at x = l-276m) for 
various grid sizes. The agreement is good at all points except at the points in the shock 



Figure 8. Effect of grid size on 
the static temperature. Profiles 
of T at x = 1-276 m for various 
grid sizes. 







802 


Dinesh K Prahhu 



Figure 9. Effect of grid size on 
the mass fraction of O (atomic 
oxygen). Profiles of c 0 at 
x= 1-276 m for various grid 
sizes. 


region. The grid fineness has little or no impact on the peak temperature in the boundary 
layer and the temperature profiles do not exhibit the oscillations seen in the pressure. 

The mass fraction profiles of two representative species - atomic oxygen (O) and 
molecular nitric oxide (NO) are shown in figures 9 and 10, respectively. Again, the grid 
refinement has a significant influence on the chemistry. The changes in the mass 
fractions, however, decrease at the finest grid levels. Large changes in the chemistry in 
the coarser grids can pose problems in radiation calculations which are very sensitive to 
changes in the species concentrations. 

On the basis of the grid sensitivity study for both the ideal gas and nonequilibrium air 
gas models, a grid size of NK = 151 points was chosen for all subsequent calculations. 



Figure 10. Effect of grid size 
on the mass fraction of NO (ni¬ 
tric oxide). Profiles of c N0 at 
x= 1-276 m for various grid 
sizes. 
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This was done in order to avail the gain in accuracy (with finer grids) while keeping the 
computing times down to a reasonable level. 

6.4 Species structure study 

After having studied the grid dependence of the computed solution for the nonequilib¬ 
rium air case, the impact of the thermodynamic properties of the constituent species 
was studied for three models of the internal structure - (1) ideal species model, (2) model 
due to Blottner et al (1971), and (3) model due to Liu & Vinokur (1989). 

The pressure profiles at .x = 1-276 m for the three species structure models are shown 
in figure 11. While the results of the Blottner and Liu-Vinokur models are indistin¬ 
guishable, they are very different from those of the ideal species model, especially in the 
shock region. At the body, however, all three models predict nearly the same surface 
pressure. This is understandable because the pressure being a “mechanical” variable is 
less dependent on the internal structure of the species. The shock standoff distance, on 
the other hand, depends on the thermodynamic properties of the species. In the region 
behind the shock where there are very high temperatures, the internal degrees of 
freedom of the species are excited. 

The temperature profiles at x = l-276m for the three species structure models are 
shown in figure 12. Again, the results of the Blottner and Liu-Vinokur models are 
indistinguishable. However, these results are very different from those of the ideal 
species model. The post-shock and peak boundary layer temperatures are higher for 
the ideal species model. This is purely due to the interna! structure of the species. The 
internal structure enables the species to absorb the collisional energy thereby lowering 
the temperatures. 

Since the ideal species lack internal structure, the collisional energy can only lead to 
increased dissociation. This is clearly seen in figures 13 and 14 which depict the mass 
fraction profiles of atomic oxygen (O) and nitric oxide (NO), respectively. While the 
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Figure 12. Effect of the inter¬ 
nal structure of atoms and mol¬ 
ecules on the temperature. 
Temperature profiles at x — 
l-276m are shown here for the 
nonequilibrium model. 


difference between the results of the Blottner and Liu-Vinokur models are negligibly 
small, the ideal species model leads to larger amount of dissociation. 

6.5 Gas model study 

The principal aim of this portion of the work is to study the ideal, equilibrium, 
nonequilibrium (Liu-Vinokur) air models in the context of external hypersonic flows 
and determine their influence on the thermodynamic environment. Again a fine grid 
(NK =151) was chosen and identical marching step sizes and grid spacings were used 
for all the computations. 



Figure 13. Effect of the inter¬ 
nal structure of atoms and mol¬ 
ecules on the chemistry. Mass 
fraction profiles of atomic oxy¬ 
gen (O) at x = 1-276 m are 
shown here for the nonequilib¬ 
rium model. 







High-temperature effects in hypersonic flows 


805 



Figure 14. Effect of the inter¬ 
nal structure of atoms and mol¬ 
ecules on the chemistry. Mass 
fraction profiles of nitric oxide 
(NO) at x = l-276m are shown 
here for the nonequilibrium 
model. 


The pressure and temperature profiles at x = 1 -276 m are shown in figures 15 and 16, 
respectively. There are differences in the shock standoff distances for the three gas 
models with the shock standoff distance being the largest for ideal gas and the smallest, 
for equilibrium air. The elevated temperatures behind the shock serve to excite the 
internal degrees of freedom of the gases that constitute the mixture. The effect of this is 
to alter the value of y behind the shock, thus altering the standoff distance. The peak 
temperature within the boundary layer are significantly different between the three 
models. The effect of chemical reactions (finite or infinite rates) is to drain the kinetic 
energy contained in the fluid and use it in the chemical reactions and/or excitation of 



Figure 15. Pressure profiles at 
x= 1-276 m for ideal, equilib¬ 
rium and nonequilibrium air 
models. Oscillations in the 
pressure are typical of central- 
difference schemes. 
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The principal aim of this portion of the work is to study the ideal, equilibrium, 
nonequilibrium (Liu-Vinokur) air models in the context of external hypersonic flows 
and determine their influence on the thermodynamic environment. Again a fine grid 
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respectively. There are differences in the shock standoff distances for the three gas 
models with the shock standoff distance being the largest for ideal gas and the smallest- 
for equilibrium air. The elevated temperatures behind the shock serve to excite the 
internal degrees of freedom of the gases that constitute the mixture. The effect of this is 
to alter the value of y behind the shock, thus altering the standoff distance. The peak 
temperature within the boundary layer are significantly different between the three 
models. The effect of chemical reactions (finite or infinite rates) is to drain the kinetic 
energy contained in the fluid and use it in the chemical reactions and/or excitation of 



Figure 15. Pressure profiles at 
x= 1-276 m for ideal, equilib¬ 
rium and nonequilibrium air 
models. Oscillations in the 
pressure are typical of central- 
difference schemes. 
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Figure 16. Temperature pro¬ 
files at x=l-276m for ideal, 
equilibrium and nonequilib¬ 
rium air models. 


internal states of the constituent species. This shows up as an overall reduction in the 
peak temperature in the boundary layer. 

The departure of the gas from ideal behaviour can be studied using the two 
parameters y and j8 defined previously. The profiles of y and ^ at x = 1-276 m are 
depicted in figures 17 and 18, respectively. Note that y changes from its usual value of 
1-4 immediately before the shock to about 1-3 for the nonequilibrium case and about 
l-18m in the equilibrium case. This is to be expected because the internal energy and 
enthalpy of the species are dependent on the local temperature which varies depending 



Figure 17. Profiles of the ratio 
of enthalpy and internal energy 
at x = 1 -276 m for ideal, equilib¬ 
rium and nonequilibrium air 
models. 















on the gas model. The parameter fi which is a ratio of the local molecular weight to the 
average molecular weight varies from 1 to about 1-2. This parameter becomes 
important only in the reaction zone where the mass concentrations vary. Hence, there is 
no difference at the shock but there is substantial variation in the boundary layer. For 
the equilibrium air case, /?, varies at the shock. This difference can perhaps be attributed 
to the piecewise continuity of the curve fits for equilibrium air. This effect has to be 
studied in greater detail. 

The axial variation of the surface pressure coefficient (C p ) is shown in figure 19. The 
ideal and nonequilibrium surface pressures are very nearly the same but differ from that 
for equilibrium air. This is quite surprising since the surface pressure is expected to be 



Axial distance, x (in) 


Figure 19. Effect of ideal, equi¬ 
librium and nonequilibrium air 
models on the computed surface 
pressure coefficient. The invis- 
cid ideal gas value of C p is ex¬ 
trapolated from the tables of 
Sims (1964). 
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Figure 20. Effect of ideal, equi¬ 
librium and nonequilibrium air 
models on the computed Stan¬ 
ton number. 


nearly the same irrespective of the gas model. The Newtonian value of the pressure 
coefficient is also shown along with the inviscid ideal gas value obtained from the tables 
of Sims (1964). Actually, the cone tables of Sims provide C p values for freestream Mach 
numbers ranging from 1-5 to 20 0 and for cone angles ranging from 2-5° to 30°. The 
value corresponding to = 25 for a cone angle of 10° was extrapolated from the 
tabulated values. The computed surface pressure coefficient for the ideal gas model 
asymptotes to the inviscid ideal gas value of Sims in the weak interaction zone. 

The axial variation of the Stanton number (St) is shown in figure 20. The Stanton 
number is higher for the equilibrium case than the nonequilibrium case which, in turn, 
is higher than that for the ideal gas case. 

The displacement thickness <5*(m) of a wall boundary layer is defined as: 


{l-{pu t )/p e u t ')dz b . 


(74) 


where z b (m) is measured along a line normal to the cone surface, u, (m/s) is the velocity 
tangential to the cone surface, and the subscript e denotes edge values. In the present 
PNS code the grid lines are always normal to the body surface. Consequently, locating 
the boundary-layer edge is quite simple. The edge of the boundary layer is located (at 
every step) by using the specific total enthalpy H of the gas. For an isothermal wall, the 
specific total enthalpy is dissipated in the viscous region close to the wall. The point 
z be at which the dissipation ceases, or equivalently where the total enthalpy reaches 
98% of its freestream value, is chosen as the boundary-layer edge. See for instance the 
profiles of H at x= l-276m shown in figure 21. Using the value of z bt and (74), the 
displacement thickness is computed. 

The axial variation of the displacement thickness of the boundary layer is shown in 
figure 22. There are two important features to be noted here. The first is that the axial 
variation of <5* is shown only from about x = 0-2 m. This is because there was difficulty 
in locating the boundary-layer edge near the tip of the cone. For a high Mach number 
and low Reynolds number flow, the viscous-inviscid interaction near the tip region is 
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Figure 21. Profiles of total en¬ 
thalpy at x = 1-276 m. The total 
enthalpy is dissipated in the vis¬ 
cous layer close to the wall when 
the wall is isothermal. This pro¬ 
file helps in locating the bound¬ 
ary-layer edge which is then 
used in the computation of dis¬ 
placement thickness. 


very strong. In fact there is a merger of the shock and viscous boundary layer making it 
difficult to define the boundary-layer edge. While the boundary layer in the equilibrium 
gas is considerably thinner, the boundary layers in the ideal and nonequilibrium gases 
are nearly of the same thickness. The value of viscosity for the nonequilibrium and ideal 
gas cases is not very different from each other except in the high-temperature region 
near the wall. Further, the rate at which the total enthalpy dissipates in the boundary 
layer is the same for both the ideal and nonequilibrium cases. 

An important feature of hypersonic external flows is that the viscous shock layers are 
extremely thin. This was verified in the present computations through the shock 
standoff distance. The approximate shock standoff distance at every x station was 



Figure 22. Axial variation of 
the computed displacement 
thickness for the ideal, equilib¬ 
rium and nonequilibrium gas 
models. The computed data be¬ 
fore x = l-276m are not reliable 
due to strong viscous-inviscid 
interaction which poses difficul¬ 
ties in locating the boundary- 
layer edge. 











Radial distance, z (m) 
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Figure 23. Comparison of 
shock shapes for ideal, equilib¬ 
rium and nonequilibrium air 
models. The shock is “captured” 
as part of the solution. The 
shock is located by the peak in 
the pressure profile. Due to 
strong viscous-inviscid interac¬ 
tion it is not possible to locate 
the shock very accurately before 
x = 0T m. 


determined by locating the peak point in the pressure profile at that station. Since the 
shock was not “fit” as an infinitesimally thin discontinuity but was captured, the 
pressure peak location was assumed to be a good measure. The shock standoff distance 
as a function of x for the ideal, equilibrium, and chemical nonequilibrium air models is 
shown in figure 23. Again the axial variation is shown only from x = 01 m because 
distinguishing between the shock and boundary layer before this location is quite 
difficult. It is clearly seen that the shock angle (measured from the body) is quite small. 
Further, the shock standoff distance for equilibrium air is the smallest while that for 
ideal gas is the largest. This is because of y varying between 1-4 for the ideal gas and 1T 8 
for equilibrium air. 



Figure 24. Approximate shock 
wave angles for ideal, equilib¬ 
rium, and nonequilibrium air 
models. The displacement effect 
of the boundary layer causes the 
shock top curve initially (in the 
strong interaction zone) before 
it straightens out further down¬ 
stream. 
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To determine whether or not the outer shock was conical, the local inclination of the 
shock was computed. The axial variations of the local shock angle for the ideal, 
equilibrium and chemical nonequilibrium air models are shown in figure 24. The shock 
inclination varies as a function of x while asymptoting to a constant value due to the 
displacement effect of the boundary layer. The fairly thick boundary-layer at the 
leading edge pushes up against the shock thus altering the conical nature. As the 
inviscid-viscous interaction decreases downstream, the shock becomes nearly conical. 

All these results presented here were obtained on an HCL Magnum Multi-RISC 
computer in the C'TFD Division of NAL. The ideal gas calculations took 166 s of CPU 
time, the equilibrium air calculations took 204 s of CPU time, and the nonequilibrium 
gas calculations with the Liu-Vinokur model for species thermodynamic properties 
took about 580 s of CPU time. 

7. Concluding remarks 

The laminar hypersonic flow of air around a 10° cone has been successfully computed 
using a finite-difference algorithm. Three gas models have been considered — (1) ideal 
gas (y= 1-4), (2) equilibrium air with curve-fits for thermodynamic and transport 
properties, and (3) nonequilibrium air with finite-rate chemistry. Based on the results of 
the calculations the following observations can be made. 

• The numerical algorithm chosen for the present study is consistent, in the sense that 
the solution converges with diminishing grid spacing. This has been verified for both 
the ideal and nonequilibrium air cases. 

• The choice of an unbiased differencing scheme such as central difference leads to 
poor definition of the captured shock. Artificial dissipation which is introduced in an 
ad hoc manner in such schemes is essential to stabilise the solution and get rid of the 
high-frequency errors. Low levels of artificial dissipation lead to “ringing” at the 
shock. Further, this artificial dissipation has to be increased with increasing grid size. 
The arbitrariness associated with artificial smoothing can be removed through the 
use of robust upwind methods - methods which bias the differencing in the direction 
of acoustic signal propagation (Lawrence 1991). 

• The results of the calculations can change significantly depending on the choice of 
the gas model. The idealisation that the species lack an internal structure is not very 
good. However, for the test case chosen in the present study, the results obtained 
using the Blottner model (Blottner etal 3971) for nonequilibrium calculations are 
not significantly different from those obtained using a more accurate model develop¬ 
ed by Liu & Vinokur (1989). This is because the temperatures in the flowfield do not 
exceed 8000 K and below this temperature the thermodynamic properties of the 
species are nearly the same. However, the thermodynamic model of Liu and Vinokur 
is recommended for its accuracy above temperatures of 8000 K. 

• Some key features of hypersonic shock layers have been shown in the present 
study - thinness of the viscous shock layer, the displacement effect on the outer 
shock due to the boundary layer, the strong viscous-inviscid interaction near the 
nose of the cone, the effects of high temperatures in the shock layer etc. have been 
demonstrated. 

Apart from the accuracy of the species thermodynamic properties, there are also 
issues associated with the accuracy of the species transport properties and reaction 
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rates. The reaction rates and transport coefficients due to Blottner et al (1971) are fairly 
old and need to be updated. The sensitivity of the solutions to the accuracy of these 
parameters must be studied. 

There is a paucity (at least in the public domain) of experimental results for 
hypersonic speeds and this makes the task of validation of a hypersonic code with 
high-temperature effects very difficult. To some extent, the aero thermodynamic envi¬ 
ronment encountered at hypersonic speeds can be created through a large shock tunnel 
(either combustion or free piston driven). This, however, does not currently exist in 
India. 

In the present code, the block-tridiagonal solver is written for a general block of 
order n and this solver does not use any pivoting (either partial or full) in the LU 
decomposition of the block and is thus quite fast. However, this solver is slower than 
the one that is “hardwired” for a particular block size. This disadvantage of the general 
solver has to be carefully weighed against the advantage of reusability that such a solver 
has to offer. 
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List of symbols 

a speed of sound (m/s); 

C p pressure coefficient; specific heat at constant pressure (J/kg.K); 
C v specific heat at constant volume (J/kg.K); 

Q) binary diffusion coefficient (m 2 /s); 

e specific internal energy (J/kg); 

e t specific total energy (J/kg); 

E (-component of the flux tensor; 

G (-component of the flux tensor; 

H specific total enthalpy (J/kg); 

h specific static enthalpy (J/kg); 

Le Lewis number; 

M Mach number; 

Pr Prandtl number; 

p static pressure (Pa); 

Q algebraic vector of conservative variables; 

q heat flux vector (W/m 2 ); 

sM universal gas constant (8314-34 J/kmol.K); 

R specific gas constant (J/kg.K); 

Re Reynolds number; 

S inviscid/viscous source vector (axisymmetric); 
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St Stanton number; 

T static temperature (K); 

u fluid velocity (mass-averaged) (m/s); 

W chemical source vector; 

w mass rate of production (kg/m 3 .s); 

grid stretching parameter; 
compressibility; 

y ratio of specific heats (isentropic exponent); 

y ratio of specific enthalpy and internal energy; 

9 angle (degree); 

k coefficient of thermal conductivity (W/m.K); 

p coefficient of viscosity (Pa.s); 

p mass density (kg/m 3 ); 

( computational coordinates; 

A£,A£ grid spacing in computational space; 
cr safety factor; 

co streamwise pressure gradient factor. 

Subscripts 

e equilibrium quantity; 

/ frozen quantity; 

i, k spatial indices; 

r, s species; 

w wall; 

C components in the ( directions; derivatives w.r.t. £, C coordinates; 

oo freestream. 

Superscripts 

c chemical quantity; 

i inviscid quantity; 

v viscous quantity. 
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Abstract. The research in the area of Radar Absorbing Materials (RAMs) 
has been actively pursued for at least four decades. Although resonant 
RAMs were originally designed by transmission line approach, and the 
broad band RAMs were obtained by multilayering, the quest for ultrawide 
band performance has led to novel approaches such as chirality and even 
exploring biochemical products. It is observed that radome materials are 
frequently used as RAMs. The understanding of the underlying principles 
of electromagnetic analysis and design, fabrication and the trends in RAMs 
reviewed in this paper could lead to indigenisation, and even pioneering 
next generation of RAM technology. 

Keywords. Radar absorbing materials (RAM); Radar cross section (RCS) 
reduction. 


I. Introduction 

The introduction of microwave radar, during the Second World War revolutionised 
the contemporary air defense scenario. The use of microwaves, enabled radars to detect 
even distant aircraft targets, independent of weather and diurnal variations. This 
provided the initial motivation for designing aircraft which would be “invisible” 
to radar. Since the late 1970s, research efforts in this direction have intensified 
considerably. 

Radar signal strength, reflected from a target, determines its detectability. This 
depends on the shape and size of the target and it is identified as its Radar Cross Section 
(RCS). Hence, the detectability of a target can be decreased by reducing its RCS. The 
RCS reduction is in fact important in many military and civilian applications. For the 
military, the RCS reduction of targets such as aircraft and missiles in hostile terrain is 
an important consideration. Similarly, making aircraft hangars near an air traffic 
control radar less reflective is a civilian application. 


*For correspondence 
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Several techniques have been suggested for RCS reduction. They are: 

(i) Shaping, 

(ii) active loading, 

(iii) discrete (passive) loading, and 

(iv) distributed loading. 

Shaping involves modifying the external features of the scatterer to reduce the 
reflections, usually in the backscatter direction. Although shaping appears to be an 
excellent method for RCS reduction, it has several limitations as well. Normally 
reduction is achieved only in a limited angular region and at the expense of increased 
RCS in other regions. Furthermore, shaping is effective only at higher frequencies; in 
fact, at lower frequencies, it is almost ineffective. Finally, shaping must be integrated in 
the system design stage, right from the beginning; it cannot be implemented as an 
afterthought (Knott et al 1985). 

Methods (ii) and (iii), employ active/passive elements at selected points on the target 
(where the reflections are maximum), to reduce the overall RCS by means of phase 
cancellation. But these are narrow bandwidth methods, still mostly of academic 
interest. 

The distributed loading technique (iv) is considered to be the most useful and 
universal one. It consists of covering the scatterer with suitable materials called the 
radar absorbing materials (R AMs). The RCS reduction in this case is achieved by both, 
absorption and redirection of electromagnetic energy. Ideally, RAM should be an 
ultrawide band paint, effective at all polarisations, and angles of incidence. For 
application to flying objects, it should also have high tensile and compressive strengths, 
resistance to impact, tolerance to fuels and hydraulic fluids, and most importantly, low 
density (Bhattacharyya & Sengupta 1991). 

In order to increase their utility, suitable methods must be found to design broad 
band absorbers with narrow band RAMs as constituent components. Better still would 
be to search for alternative materials which have (ultra) wide band absorption 
characteristics. The answers to these are not readily available, owing to this being 
a classified area. 

Although, almost all the literature on RAM is indeed classified, it is worth mention¬ 
ing that historically, the origins of RAM have involved rather elementary principles in 
electromagnetic theory. Hence, considerable amount of relevant literature is also 
available in the open domain beginning from the late thirties. A closer scrutiny of this 
literature could give clues even to the RAMs that have been classified. Such a study 
could also give a feel of the historical perspective and the trends in RAM. 

The historical perspective to the evolution of RAMs, including their basic perform¬ 
ance characteristics is presented in §2 of this paper. Fundamental electromagnetic 
principles for the analysis of RAMs, both on the flat and curvatured surfaces are 
outlined in §3. This is followed by the design and fabrication of RAM where both the 
resonant and broad band RAMs (including chirals) have been discussed. Since the 
characterisation of RAMs is extremely important from the design and development 
perspective, relevant measurement techniques have been introduced in §5. Although 
RAMs were primarily intended for RCS reduction of aerospace vehicles, they have 
found applications in several other areas as well; they have been comprehensively listed 
in § 6. Finally, a review of the trends in RAM is presented in § 7. 
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2. The evolution of RAMs 

Although the commercial production of RAMs started during the 1950s, the research 
on it was well underway even during the 1930s. The first absorber material to be 
patented was the one developed in the Netherlands in 1936 ( Naamlooze Vennootschap 
Machinerieen 1936). This was a quarterwave resonant material in the 2 GHz region, in 
which carbon black was used to achieve dissipation, and TiO, to achieve a high 
dielectric constant. 

As pointed out earlier, with the advent of radars during the World War II, the quest 
for absorbers intensified. Germany and the United States began to work on projects to 
develop such materials. It is interesting to know the contrast between the ultimate 
motives of the two countries; Germany was primarily looking for absorbers for radar 
camouflage, while in the US, the research was directed primarily toward absorbers that 
would improve the radar performance (Emerson 1973). 

Two types of materials (MacFarlane 1945; Schade 1945) were successfully developed 
in Germany for use in submarines. The first, called Wesch material consisted of 
a semiflexible rubber sheet loaded with carbonyl iron powder, and was resonant 
around 3 GHz. The second material, Jaumann absorber was developed by a multilayer 
approach. This provided reflection coefficients, typically lower than — 20 dB over 
a wide band (2-15 GHz) near normal incidence. 

In the US, two types of absorbers, generally called HARP (Halpern-anti-radar- 
paint) were developed during the same time (Halpren & Johnson), which offered 
a reflection reduction of 15-20 dB at resonance. The airborne version, MX-410, was 
actually akin to a paint which utilised artificial dielectric materials of high relative 
permittivity. The second type was a rugged shipboard version consisting of a high 
concentration of iron particles in a neoprene binder. As with the MX-410, this too could 
operate in the X-band with a typical thickness of 0-07 inch. A salienj; feature of this 
version was the broadening of absorption around the resonant frequency (Mont¬ 
gomery et al 1948). 

A well-known absorber developed around the same time as HARP, was the Salisbury 
screen (Salisbury 1952). A resistive sheet placed at quarter wavelength from the 
scatterer, was spaced by a low dielectric material. Salisbury screens operate by resonant 
technique, and were widely used in earlier anechoic chamber designs. 

Yet another resonant absorber in wide use, was the Dallenbach layer (Ruck et al 
1970). It consisted of a homogeneous lossy layer on a metal plate. The thickness of the 
lossy layer was selected so as to match its input impedance with the intrinsic impedance 
of the free space. 

As discussed above, the absorbers developed in earlier stages were mostly narrow 
band in nature. This necessitated research towards obtaining broad band RAMs. 
Probably the very first approach to be tried was a multilayer structure (c.f. Jaumann 
absorbers discussed above). Most of the practical broad band RAMs were constructed 
using this technique. For example, the broadening of bandwidth of Salisbury screens 
was achieved by introducing additional layers of resistive sheets and spacers (Emerson 
1973). 

In some applications bulk absorbers were also used in place of sheets/layers. The 
conductivity of the bulk material reduces upon infusion of carbon in a matrix of spongy 
urethane foam. Based on this approach, the US Naval Research Laboratory developed 
a broad band absorber in the early 1950s. The conductivity was reduced here, by 
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dipping or spraying carbon onto a mat of loosely spun animal hair. These bulk 
absorbers could be readily used for controlling reflections for both, the indoor 
(anechoic) as well as the outdoor antenna measurement ranges. 

During the late 1950s the focus shifted to problems of anechoic chamber implemen¬ 
tation, such as (i) development of RAMs with reflection coefficients of — 40 to — 60 dB, 
(ii) lowering the operational frequency limit, and (iii) improved measurement tech¬ 
niques for RAM/anechoic chamber. 

Attempts were also made to modify the resonant absorbers. For example, circuit 
analog RAM (CA-RAM) were obtained as Salisbury screen derivatives. CA-RAMs 
were obtained by depositing lossy material in geometric patterns on a thin lossless film 
(Severin 1956; Ruck et al 1970). The thickness of the deposit controlled the effective 
resistance of the layer, while the geometry of the pattern controlled its effective 
inductance and capacitance. Thus, the layer could be tailored into any value of 
impedance and its characteristic analysed in terms of lumped elements. The advantage 
of the CA-RAMs was that better performance could be obtained with comparatively 
less thickness. 

In the 1960’s, the possibility of using magnetic materials as RAM was explored 
intensively. Thin magnetic RAMs (e.g. the ferrites) with low reflection coefficients were 
fabricated which could operate at lower resonant frequencies than had earlier been 
feasible with the dielectric materials (Bowman 1968). Although thinner compared to 
the dielectrics, these materials are much heavier and prone to disintegration at higher 
frequencies. 

It came to light in the 1980s by serendipity, that some biotech products have 
ultrawide band absorption characteristics. Subsequent experiments confirmed that 
substantial RCS reduction was indeed possible by dissolving these compounds in 
aircraft structural materials. Following this spectacular revelation, the Department of 
Defense, US, immediately classified the compound. There are, however, conjectures 
that this ultrawide band RAM is a retinyl compound belonging to the class of 
Schiff-base salts. This is a powdery black substance which is much lighter (1/10) than 
the ferrites; it is also said to have been used in making RAM paints for stealth aircraft 
(Adam 1988). 

More recently, a new class of homogeneous materials, making use of the chiral 
property have also found applications as RAMs (Jaggard & Enghetta 1989). Such 
chiral RAMs depend upon their optical activity and circular dichroism, and the 
backscatter RCS is independent of polarisation. As compared to the dielectric and 
magnetic RAMs discussed above, the chiral RAMs have vastly superior RCS reduction 
characteristics (Jaggard et al 1990,1991). 

3. Approaches to the analysis of RAM 

The radar absorbing materials are characterised by their electromagnetic (EM) proper¬ 
ties such as the permittivity s, and the permeability g. These are usually normalised with 
respect to the corresponding free space quantities {e 0 ,g 0 ). 

£r=e/£ o> (1) 

/A = M//V (2) 

The relative permittivity e r and the relative permeability g r are in general complex 
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quantities, denoted by 

£ r = e' r +je% (3) 

fi r = g' r +jg';, (4) 

where e' r and g' r are the real components and s" and g" are the imaginary components of 
the dielectric constant and the relative permeability of the medium respectively. The 
imaginary components account for the losses in the medium, s" depends on the 
conductivity of the medium, and g" on the intensity of magnetisation. Equations (3) and 
(4) are usually written for convenience as 


£ r = £ r( 1 +jtan<5 e ), (5) 

g r = g! r {\ +;tan<5J, (6) 

where 

tan<5 e = 6"/e', (7) 

tan<5 m = g”/g' r . (8) 


In (7) and (8), tan S e and tan <5 m are known as the electric and magnetic loss tangents and 
are used to express the losses in the medium. Materials having large values for their 
dielectric loss tangents are generally called dielectric materials, while those with large 
tan<5 m , the magnetic materials. 

Both the dielectric and the magnetic materials could be used in the fabrication of 
RAMs. Although the loss mechanism in these differs, both absorb the EM energy and 
convert it into heat. Since the distances from the transmitting radars are usually large, 
this absorption of energy may not heat up the material much. But absorption is not the 
only mechanism by which RAMs operate. The reflections at the RAM free space 
interface could be controlled by properly designing the coatings. 

Two important parameters that dictate the propagation of waves through a medium 
are its refractive index n, and the characteristic impedance Z. In terms of the 
fundamental EM parameters, these are defined as 

n = (/i r e r ) l,2 > (9) 

Z = W*Y l2 = Z 0 We,Y 11 , ( 10 ) 

where Z 0 is the intrinsic impedance of the free space. This is a constant (= 377 Q) and is 
independent of frequency. It is clear from (9) and (10) that both n and Z are complex 
quantities. 

3.1 Analysis of RAMs on flat surfaces 

The method most commonly used for the analysis of a flat surface coated with a RAM is 
the transmission line approach. In this approach, the RAM is expressed as a lossy 
matching network between the free space and the scatterer which is represented in 
terms of a complex impedance. 

Reflections occur when the EM wave travels from one medium to another. This 
reflection is represented by a term called the reflection coefficient r. In the case of a wave 
entering into a large flat surface from free space, 

r = (Z 1 -Z 0 )/(Z 1 +Z 0 ), 

where Z x is the input impedance at the boundary surface. 


( 11 ) 
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Thus the condition for zero reflection is: 


( 12 ) 


Zi — z 0 . 

Equation (12) requires that the surface input impedance be independent of frequency, 
and real. The input impedance for a wave entering at normal incidence at the interface 
with free space is given by 

Zj = Ztanh}>d, (13) 

where Z is the characteristic impedance of the RAM defined in (10), d is the thickness of 
the RAM, 

y = (ja)li(a+j(OE)) 112 , (14) 

oj = 2nf, (15) 

y being the complex propagation constant of the medium, cr the electrical conductivity 
of the medium, and/ the operational frequency. 

The reflection coefficient of a coated body is affected by the interference of the waves 
reflected at the surface of the RAM and at the metal-RAM interface. Their phases 
depend on e, /! and d while their relative amplitudes depend on the losses in the material 
(c.f. (13) & (14)). 

The input impedances for the parallel and perpendicular polarisations are different 
for oblique incidence. At an angle of incidence 9 with respect to the normal, Z lpara , the 


input impedance for the parallel polarisation is given by: 

Zi para = Zcos0tanh(-ydcos0), (16) 

and Z lperp for the perpendicular polarisation is defined as 

Z lperp == [Ztanh(ydcos0)]/cos0. (17) 

The corresponding reflection coefficients are 

'•para = (Z lpara - Z 0 cos0)/(Z lpara + Z Q COS0), (18) 

Vrp = ( Z lperp COS0 “ Z 0 )/(Z lperp COS 9 + Z Q ), (19) 


where r para is the reflection coefficient for parallel polarisation and r perp is the reflection 
coefficient for perpendicular polarisation. 

Thus knowing e r , ju r and a of the material, the input impedance can be determined by 
substituting (l)-(4), (10) and (13)—(15) in (11). Similarly by using (16)-( 19) the reflection 
coefficient corresponding to an angle of incidence 9 can also be determined (Ono 
& Suzuki 1967). 

Multilayer absorbers having broad band absorption characteristics can also be used 
for the RAM applications. Their analyses are similar to those of the single layer RAMs. 
The input impedance of the entire substrate, upon inclusion of an additional layer, 
depends on the properties of all the previous layers. A numerical program for 
computing the characteristics of multilayered RAM using a minimisation method is 
presented by Maffioli (1970). Ono & Suzuki (1967) computed the reflection coefficient 
and the attenuation for both the perpendicular and parallel polarisations for multi¬ 
layered RAM structures using the transmission line approach. The measurements on 
actual samples (Ono et al 1967) showed their method to be viable. 
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3.2 Analysis of curved surfaces coated with RAM 

The foregoing discussion is valid for the RAMs on flat surfaces in general. Analysis and 
design of coatings on curved surfaces, in contrast, is extremely complex. 

3.2a Theorems on absorbers : Two theorems introduced by Weston (1963) facilitate 
the analysis of RAM on curvatured surfaces. 

Theorem 1. If a plane EM wave is incident upon a body consisting of material such that 
p r — e r , then the far zone backscattered field is zero, provided that the direction of 
incident propagation is parallel to an axis of the body about which a rotation of 90° leaves 
the shape of the body together with its material medium invariant. 

Theorem 2. If a plane wave is incident upon a body consisting of a material such that the 
total tangential EM field components satisfy the impedance boundary condition 

E —(n.E)n = Z[> 0 e 0 ] 1/2 n x H, (20) 

on the surface, and the surface of the body is invariant under a 90° rotation about some 
axis, then the far-field backscattered field is zero if the direction of incidence is along this 
axis. In the above expression n is the unit outward normal of the surface and % = 1 is the 
free space. 

The above theorems are extended to predict the specular backscatter from any 
smooth arbitrarily convex bodies. 

3.2b Effect of curvature: Bodies with curved surfaces in the high frequency domain 
(i.e. when they are electrically large) have two components for their backscatter, viz. the 
specular, and the diffracted field contributions. The specular contribution from an 
absorber covered body is \r\ 2 a. Here r is the flat plate reflection coefficient, and o is the 
specular reflection. The calculation for the diffracted field contribution is rather 
involved as it is affected by the shape of the scatterer, its conductivity and the direction 
of the ray with respect to the axis of the body. A detailed analysis is given by Weston 
(1963) and Bowman & Weston (1966). 

For extending these theorems to bodies in the resonance region (i.e. when their 
largest dimension is comparable to the wavelength), additional conditions must be 
satisfied by these curved surfaces. These ensure that, the creeping wave contribution is 
also reduced considerably for a general smooth convex scatterer. The boundary 
condition defined by the second theorem is a very good approximation under the 
assumptions (Leontovich 1957): 

(i) The refractive index of the material is very large and has a large imaginary part. 

(ii) The EM fields outside the boundary vary slowly in the near field, exterior to the 
scatterer. 

(iii) The radii of curvature of the surface are * large compared to the interior 
wavelength (i.e. corresponding to the wavelength of EM wave propagating inside 
the scattering medium). 

(iv) Variations of e and p in the body over the distance of the interior wavelength are 
small. This condition is relaxable for thin coatings on metallic surfaces. 
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To summarise, an absorber with a large and complex refractive index significantly 
reduces the backseattered RCS in the resonance region of arbitrary convex bodies for 
the resonant frequencies where the main contribution to scattering is from the specular 
reflections and the creeping waves. 

3.3 Approaches to analysis of RAMs on a general surface 

In the EM scattering and diffraction problems, the method of moments (MOM) 
presents one of the most versatile approaches. MOM is well known to predict accurate 
solution for the class of problems it treats. The applicability of MOM and other grid 
based approaches to the RAM analysis have therefore been explored by various 
workers. Rogers (1986) applied MOM to the RCS prediction of arbitrarily shaped 
conducting bodies coated with RAMs. MOM was also modified to effectively analyse 
the shapes used in the construction of anechoic chambers. Yang et al (1992) have 
developed a periodic moment method (PMM) for the analysis of two dimensional lossy 
dielectric scatterers. CRAY-YMP was used to calculate the reflection and transmission 
coefficients of periodically distributed wedges illuminated by a plane wave. 

The grid-like segmentation of the MOM is also employed in the spatial network 
method by Kashiwa et al (1990). The scattering body, an aircraft, is divided into 
a number of cells of size A/30. In this time domain method, the equivalent circuit of the 
coated scatterer is constructed in terms of bulk impedances to compute the field 
patterns. 

Yet another time domain analysis conforming to the Bergeron’s method in three 
dimensional space and time is used to analyse resonant RAMs (Aoto et al 1987). The 
'■^♦ribution for the changes in magnetic loss is given. Absorption 
waveform in the time domain are expressed as a function of the 

p DTD) approach is used in conjunc- 
optimisation routine for forward 
rs (Strickel & Taflove 1990). The 
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scattered by the air-dielectric interface and that by the equivalent conducting body 
which differs from the actual body because of the lens action of the shell. 

Alexopoulos (1969) applied GO to obtain the reflected electric field for large spheres 
{ka = 50 to 1000) coated with inhomogeneous dielectrics. A series solution was also 
obtained for the backscattering GO computations for bodies of revolution coated with 
homogeneous RAMs (Arsaev 1982). The effect of the curvature of the scatterer and the 
conditions for the RCS reduction were also examined. 

Although the GO is extremely simple to apply, it is well known to give rise to 
singularities and also predict discontinuous fields. This was overcome by the introduc¬ 
tion of a class of high-frequency asymptotic theories which essentially introduced 
corrections to the GO in the form of diffraction coefficients. The Geometrical Theory of 
Diffraction (GTD) is one such example which has been extremely popular and also 
extensively modified to suit the various modelling requirements. Consequently, the 
asymptotic theories have found their way into the analysis of RAMs as well. 

The RCS reduction of structures such as a cavity absorber has been pioneered by Lee 
etal (1985) for coatings on circular waveguides. They presented a high frequency 
asymptotic method to analyse EM scattering from circular waveguides coated with 
lossy materials. It was observed that a coating less than 1 % of the radius can give 13 dB 
reduction in the RCS of circular waveguide terminated with a perfect electric conduc¬ 
tor. It was also found that large reductions in RCS are possible at small angles of 
incidence. 

The asymptotic high frequency techniques could also be employed for the estimation 
of monostatic RCS of finite planar metallic structure coated with a lossy dielectric. The 
RCS is expressed in terms of the diffraction coefficient involving the Maliuzhinet’s 
function (Bhattacharyya & Tandon 1984). Bhattacharyya (1990) used the uniform 
asymptotic theory of diffraction (UAT) which is a modification of the GTD, to analyse 
the performance of flat plates. 

More recently, Shi et al (1993) have analysed conducting bodies covered partially 
with absorbent materials. A salient contribution of this work is in establishing an 
equivalence between the RAMs to be used in the scale model and the prototype. 

4. Design and fabrication of RAMs 

The design of RAMs involves identifying suitable materials, and specifying their 
dimensions and compositions. Similarly, the steps involved in the fabrication of these 
materials are crucial to their performance. 

There are basically two concepts that are used in the design of absorbers for the RCS 
reduction (Musal & Hahn 1989) as given below. 

Matched characteristic impedance concept : The characteristic impedance of the material 
is made nearly equal to the intrinsic impedance of free space. This essentially means 
that the values of its relative permittivity and relative permeability must be equal. 

Z = Z 0 , (21) 

hence from (10), 


E r = /V 


( 22 ) 
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Equation (22) is the condition for zero reflection at the front surface of the material. The 
material must be made sufficiently thick so that the EM wave is attenuated consider¬ 
ably as it propagates into the material. The internal reflections produced when backed 
by a metallic surface attenuate in the round-trip path. 

Matched wave impedance concept: The wave impedance at the front surface of a 
metal-backed absorber layer (the impedance when looking into it) is made equal to the 
intrinsic impedance of the free space. This results in the complete absorption of the EM 
energy falling on the surface. 

Thus for an absorber of finite thickness backed by a conducting plate, (13) gives the 
wave impedance. Combining (12) and (13), the thickness is determined as, 

Z 0 = Z tanh yd. (23) 

Thus the formulae (l)-( 19) can be used appropriately for computing the thickness of the 
RAM required for a specified performance. Knott (1979) has derived the formulae for 
the minimum thickness condition, for a specified reflectance of a single layer absorber, 
of given characteristic impedance and refractive index. 

The matched wave impedance concept is frequently used in the design of the resonant 
absorbers. Although such resonant RAMs require comparatively thin layers of absorb¬ 
er, they satisfy the conditions for RCS reduction only at one or more discrete 
frequencies. On the other hand, the nonresonant absorbers have RCS reduction over 
a broader band of frequency. Hence these constitute the two basic types of RAMs. 

4.1 Resonant RAM 

The thickness of the resonant i.e. narrow band RAMs are related to the material 
properties and the operating wavelength. 

4.1a Salisbury screen: The resonant absorbers discussed in §2, strive to efface the 
reactive part of their surface impedance by suitably incorporating the unique proper¬ 
ties of quarter wavelength. The simplest method is to make use of a resistance card, 
whose surface resistance is equal to the free space impedance. When sheet is placed at 
a quarter of a wavelength from the metal plate, the input impedance in the plane of the 
sheet and the impedance of free space are in perfect match. Such absorbers are called 
Salisbury screens (figure 1). A detailed analysis of the Salisbury screen is available in 
Ruck et al (1970). 

The equivalent circuit for the Salisbury screen is shown in figure 2. The conditions to 
be satisfied by the resistive sheet are: 


<»s', 

(24) 

Z 0 = 1/ad, 

(25) 

d = \IZ 0 o, 

(26) 


where d is the thickness of the resistive sheet. The choice of the material for the sheet is 
constrained by (24) while its thickness is determined from (26). 

A proprietary material, Uskon cloth manufactured by the US Rubber Company was 
used as the resistive sheet in the Salisbury screens. Materials of relative permittivity 
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approximately equal to unity, were used as spacers. The operational bandwidth is 
reduced if the spacer material has a higher permittivity, as the wavelength inside it, is 
a function of this value. Plywood or some jelly type material having e r < 1-05 can be 
used as spacer. In one such case, the calculated frequency sensitivity for VSWR was 
found to be below lTO (that is, the reflection coefficient of 0 05) over a bandwidth of 
± 5% (Emerson 1973). 

The operational frequency range for the Salisbury screen can be broadened by using 
a number of such layers. Fante & McCormack (1988) describe the analysis for Salisbury 
screens using multiple resistive sheets. The cases in which the screen is electric or 
magnetic or their multiple combinations are discussed. It is also shown that a relatively 
large reduction in specular reflection is possible over considerably wide bandwidths by 
using relatively few Salisbury screens. Furthermore, these reductions can be achieved 
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Figure 2. The equivalent circuit of the Salisbury screen. 
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even if the incident wave is not normal to the surface, or the surface is curved, or the 
fabrication is not accurate. 

A multilayer Salisbury screen uses several layers of Salisbury screens, suitably spaced 
to resonate at discrete frequencies, which results in broad band RAMs. The bandwidth 
of such a structure depends on the number of sheets used. Their construction is similar 
to the Jaumann absorber. The resistance of these sheets decreases near-exponentially 
towards the back plate. 


Alb Magnetic absorbers : The electric Salisbury screens place the lossy material 
where the electric field is maximum. It is also possible to place a magnetic screen where 
the magnetic field is maximum, that is, right on the metal surface. 


The necessary conditions to be satisfied are: 


g"»g\ 

(27) 

(DU"d , 

(28) 


(29) 


g and d , satisfies (13) along with (27)-(29) at the frequency of 
jcted for fabrication. When the thickness is very small compared 
1), the conditions (from (13)) to be satisfied by these materials are 
'’ll: 


(30) 

(31) 

(32) 

e order of 2/10. Thus the magnetic 
parts as they require less thickness 

presence of carbonyl iron or ferrites 
these materials which try to orient 
es fast, the dipoles lag the impressed 
pating energy in the material, 
he dissipating element. Each ferrite 
hicknesses which are related to an 
value (Naito & Suetake 1971). The 
uch a way that the first matching 
its. The first matching thickness is 
mm. This peculiar characteristic of 
aito 1970). Three mechanisms for 
the relaxation magnetisation, the 
resonance motion. The first among 

d their magnetic properties tend to 
Curie point (typically 500-1000° F). 
nperatures, they are constrained by 
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The permeability of the ferrites depends on their composition. The peaks of the 
permeability curves of such materials can be shifted over frequency by replacement of 
Fe +3 ions in the lattice by divalent and tetravalent metal ions such as Co + 2 and Ti + 4 
(Amin & James 1981). Cobalt-substituted barium hexaferrite has significant variation 
of permeability in the microwave region. The ferromagnetic resonant frequency of this 
hexagonal ferrite varies between 2 and 46-5 GHz depending on the amount of cobalt 
substituted. 

The ferrites are typically sintered in the form of small rigid tiles. The density of these 
tiles depends upon the composition and the method of processing (i.e. different 
temperatures and pressures applied during manufacture). The surface resistance can be 
varied during manufacture by suitable selection of molecular structure and loading 
concentration. The material is hot-pressed onto the metallic surface at several thou¬ 
sands of psi pressure. Bonding of the materials is also a crucial process. It is thus 
possible to obtain magnetic RAMs with different characteristics by slightly varying the 
fabrication procedure (Hahn 1991). The effects on the magnetic permeability of 
different Ni 0 . x Zn 0 . 5 Fe 2 0 4 samples are studied in a number of such cases. It is observed 
that the real and imaginary parts of permeability increase with the tile density. 

The commonly used binding media are natural rubber or synthetic elastomers such 
as isoprene, neoprene, nitrile, silicone, urethane, and fluoroelastomer. Although the 
EM absorption properties of the binding media normally remain constant, there are 
instances where they behave anomalously. It is reported (Veinger et al 1990) that at low 
filler contents, low molecular weight rubbers such as dimethyl-phenyl-siloxane rubber, 
dimethyl-diphenyl-siloxane rubber, and dimethyl-siloxane rubber showed strong ab¬ 
sorption near 10 GHz. 

High frequency absorption of a random assembly of noninteracting single domain 
ferromagnetic particles with positive uniaxial magnetic anisotropy can be calculated 
numerically for values of the signal frequency below the natural resonant frequency of 
such materials (Hempel & Roos 1981). If the lower frequencies are used, an additional 
absorption is found to be present at the negative field at which the resonance frequency 
crosses the signal frequency. A set of measurements has also been specifically reported 
by Hempel & Roos (1981) for the polycrystalline barium ferrites. 

Just as their dielectric counterparts, the magnetic RAMs too can be used in 
multilayered forms. Basically, this is done to enable the employment of materials that 
have favourable permeability curves at different frequencies. 

4.1c Dallenbach ( homogeneous ) layer : Yet another dielectric RAM, the homo¬ 
geneous absorber typically consists of a binding medium (e.g., polyurethane foam) in 
which carbon powder and TiO are dispersed. The amount of carbon determines the 
dissipation factor, while the TiO content accounts for the high permittivity thereby 
resulting in a thinner layer. 

The design for the absorber is generalised by Fernandez & Valenzula (1985). They 
considered the case for a single layered absorber backed by a perfectly conducting flat 
surface for normal incidence. Substituting (5), (6) and (10) in (13) one obtains 

gje r = {(cosh 2 fids + cos 2 fid) cos b m )}/{(cosh 2 fids — cos 2 fid) cos 5 e }, (33) 
sin 2 fid = p sinh 2 fids, (34) 

s = tan[(<5 m + <5 e )/2], (35) 


where 
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p = tan[(i„-«.)/2], (36) 

P = {2it(#*A)‘' 2 sin[(5„ + S,)/2] }/{,l(cosc5 e cos<5J 1 ' 2 }. (37) 

The design curves for different conditions are obtained from (33) and (34). These lead 
to the solution for d as: 

d = 7c/[2/3(l+ps)], p < 0-25, (38) 

d = (l/2j5)[6(l — ps)/(l + ps)] 1/2 , p^0-25. (39) 


The quarter-wavelength solution is obtained by substituting p = 0 in (38). 

The lowest order solution for the materials having both electric and magnetic 
losses, is given in terms of plots by Fernandez & Valenzula (1985). They have also 
concluded that the materials having moderate losses will have a larger bandwidth for 
absorption. 

An alternate solution along with optimal design curves for the EM and physical 
parameters such as the surface impedance, and magnetic and dielectric loss tangents, 
and the thickness in wavelength is given (Musal & Hahn 1989) for the normal incidence. 
Design curves are plotted for different d/l and tan<5 e . 

A further generalisation by Musal & Smith (1990) resulted in an integrated single 
design curve which is plotted as contours of constant d/X Re(e) and contours of constant 
Im(e)/Re(e) on a plane of (d/X Im(p), d/X Re(p)). 

Yet another example for the homogeneous absorbers is the Halpern Anti-Radar Paint 
(HARP). Disk-shaped aluminium flakes dispersed in a rubber matrix serve as the 
dissipating elements in HARP. High concentration and a high degree of alignment of 
these disks resulted in a large permittivity (Knott et al 1985). 

A new kind of single layer absorber postulated by Gauss (1982) could be made from 
filaments of radar absorbing chaff mixed in a solid binder. Metallic strands of 50 
micrometers length could be dispersed in the medium having a dielectric constant of 
about unity at spacings of about one-third their length. Theoretically, RCS reduction of 
about 30 dB is achievable over 10-100 GHz if the filaments are distributed uniformly. 

Multiple layers of the homogeneous absorbers, each with a different composition 
could also be used to obtain broad band absorbers. In practice, this results in an 
inhomogeneous absorber to be discussed later in § 4-2c. 

4.1 d Circuit analog RAM: If the continuous resistive sheets of the Salisbury screens 
are replaced by a conductive layer deposited in appropriate geometrical patterns, 
circuit analog RAMs (CA-RAM) are obtained. These shapes, usually in the form of 
loop antennas, dipoles, crosses or triangles may have susceptance as well as conduc¬ 
tance, which improves the flexibility of the design of such RAMs. 

The design of these patterns is similar to that of a frequency selective surface (FSS). 
But CA-RAM is different from the FSS since the latter does not require resistance 
variation. Both involves the design of frequency sensitive reactive elements. 

Control over admittance of the conductive sheet in CA-RAM is obtained by 
depositing the conducting materials in suitable geometrical shapes. The shapes 
employed have the property of resonating at a discrete frequency, thus dissipating 
the energy falling on it. Typical shapes include strips, wires, intersecting wires, dipoles, 
crossed dipoles, Jerusalem cross structure etc. These sheets with the lossy elements are 
separated from the conducting surfaces by lossless dielectric spacers. 
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Figure 3. The equivalent circuit of the CA-RAM. 


The equivalent circuit of the CA-RAM is shown in figure 3. By adjusting the shape of 
the elements, and/or by adjusting the number of elements per unit area, the resonance, 
and the zero reflection may be obtained at any frequency. The optimum values for both 
of these are selected. It is clear that a better uniform loading is obtained when the 
elements are in the molecular level; i.e., by using magnetic dipoles in molecules. 

For a periodically loaded surface, the input impedance at oblique incidence, for the 
parallel and perpendicular polarisations (i.e., E parallel and perpendicular to the plane 
of incidence respectively) are: 

Z i para = X cos 0 (parallel polarisation) (40) 

Z lperp = X/cos^ (perpendicular polarisation), (41) 

where X is the element impedance and 0 the angle of incidence. The reflection 
coefficients are obtained by substituting these expressions in (18) and (19) (Ruck et al 
1970). Consequently, at normal incidence (0 = 0°), the impedances and hence the 
reflection coefficients for the two polarisations are identical. 

The absorber is effective only if the magnetic vector of the incident wave has 
a component in the direction of the magnetic dipoles. Hence two separate grids with 
arrays of elements in parallel, with an angle of orientation of 90° between the grids, each 
for a polarisation, may be used to make a RAM independent of polarisation. 

4.2 Broad band absorbers 

Unlike the resonant absorbers discussed above, the broad band absorbers have a wider 
band of frequency over which considerable RCS reduction is possible. This band of 
frequency is usually limited only by the changes in the material properties. 

4.2a g — e absorbers : Thick layers of materials satisfying the condition g r = s r are 
used in the first type of nonresonant absorbers called the g = e absorber. In these RAMs 
most of the incident energy is absorbed by the material itself, before reaching the metal 
plate at the back. Thus for a material with both dielectric and magnetic losses (Severin 
1956), 

Z = llbM m W -taniJMl -tana,)] 1 ' 2 . (42) 

For the matching condition with free space, the characteristic impedance of the 
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material should be: 

Z = Z 0 = 377 Q. (43) 

An efficient p = s absorber should have large and equal values for dielectric and 
magnetic loss angles, and relative permittivity and permeability. Ideally, these EM 
parameters should also be independent of frequency. 

The electric and magnetic field strengths are attenuated in the bulk of the material 
exponentially. Thus the minimum thickness for a specified attenuation is determined 
from (Hurmuth 1983) 


[£ r /£,] = e- J ' M , 

(44) 

d = (l/2flln[£ r /£ ( ]. 

(45) 


where E r and E t are the reflected and incident fields respectively. 

Although absorbers with p r equal to e r are ideal, this is one of the most difficult 
conditions to meet in fabrication. The EM properties of the materials depend on the 
composition and are in general frequency sensitive.. Some ferrites have been made with 
this constraint for a limited frequency range (Bhattacharyya & Sengupta 1991). 

4.2b Low density absorbers : The low density absorbers are made with the materials 
with relative permittivity nearly equal to that of free space. If the relative permeability is 
also 1, the material will absorb all the energy falling on it. Most of the materials satisfying 
these conditions are materials of low density. Thick layers of these materials are used to 
dissipate the energy, thus reducing the reflections when backed by a metal plate. For 
analysis and design purpose, this may be considered as a special case, of the p = e absorber. 

One such material manufactured, Spongex was 2 inches thick, and had a reflection 
coefficient of about — 20 dB over a frequency range of 2-4-10-0 GHz. It was possible to 
further lower the (low) frequency limit by increasing the material thickness (Emerson 1973). 

4.2c Inhomogeneous layer absorber : The absorbers should present a minimum 
discontinuity at the free space-absorber interface and the losses should increase, as one 
moves into it, to absorb the incident EM eneFgy completely. This is obtained either by 
tapering of homogeneous layers of absorber material, or by arranging different layers 
parallel to the metal surface such that the loss tangent increases toward the base plate. 
The latter results in an inhomogeneous absorber. In another method of construction, 
layers of lossless dielectric and sheets of poorly conducting material, the surface 
resistivity of which decreases towards the base metal plate by a constant factor from 
one sheet to the next, are placed alternately. Theoretical predictions for one such case 
shows that reflection is below 0-1 for nearly 3 octaves (Bowman 1968). However, these 
values are not achievable in practice, since it is difficult to fabricate a number of layers 
having different parameters accurately. 

In a bulk absorber with a multilayered structure, the EM properties of different 
layers are made to vary for obtaining maximum absorption of the incident energy. The 
thickness of the bulk material is not important. The refractive indices of the layers are 
such that the value increases as one goes into the material. A simple fabrication 
technique, where the foam is dipped in an aqueous solution of graphite, the aquadag, of 
different concentrations, is described by Jones & Wooding (1964). The injection of 
carbon is made uniform by alternately compressing and expanding the foam inside the 
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solution. Yet another multilayer absorber was fabricated by making use of pyrolysed 
polyacrylonitrile fibre blankets for different layers (Ono et al 1967). 

It is possible to synthesise an inhomogeneous absorber with a number of layers with 
different composition of carbon using the convergence circle method (Ono et al 1979). 
The propagation constant is related by an empirical relation to the graphite density in 
the media. The thickness of the material as well as the required concentration of the 
graphite content in a polystyrene foam for a given angle of incidence and figure of merit 
have been computed. (The figure of merit is defined (Shimizu & Suetake 1969) as the 
ratio of the RAM thickness to the wavelength of the lowest frequency at the maximum 
angle of incidence.) These wide incidence angle response absorbers are reported to have 
very good reflection properties (— 20 dB) at normal as well as at oblique incidences up 
to + 70°. 

Analytical determination of the functional form of the variation of p(z) and e(z) 
necessary to produce a specified maximum reflection coefficient over a range of 
frequencies and incidence angles for a particular thickness of the material is an 
intractable problem. In practice, some model variations are assumed for p(z) and s(z ) to 
analyse the material (Ruck et al 1970; Perini & Cohen 1991). 

Naito & Suetake (1965) used a relay impedance method to design a multilayered 
absorber which was primarily meant for anechoic chambers. With the input impedance 
at the metallic wall being zero and the intrinsic impedance of free space 377 ohms, the 
absorber layer is essentially a transformer between these. When multiple layers are 
used, obviously, this transition is achieved in steps. Naito & Suetake (1965) have 
computed the frequency response of absorbers with different number of layers and 
plotted this for comparison. These formulae were later modified (Shimuzu & Suetake 
1967) for the optimum thickness using lossy dielectric materials. They have reported 
a figure of merit of 0-42 as compared to the earlier designs, for which it was nearly 2. 

4.2d Geometric transition absorbers : The difficulty in making multiple layers with 
different values of surface resistivity can be bypassed by using a single layer of uniform 
material in geometrical shapes in which the absorption coefficient increases towards 
the metal surface. Wedge, pyramidal, and conical shapes are typically used for this 
purpose. Such shaped absorber panels are widely used in the wall construction of 
anechoic chambers. However, these are not used in airborne applications due to 
obvious aerodynamic considerations. The geometric transition method can also be 
combined with the inhomogeneous layer method to further increase the bandwidth of 
the absorber. Combining these two techniques, results in a better angular performance. 
Typically, for an inhomogeneous pyramidal absorber, the reflection coefficient is below 
0T for incidence angles up to 50-60° (Ruck et al 1970). 

This section is concluded by emphasising that the broad band absorbers cannot be 
used by themselves, for aircraft and missile applications due to their inherent bulkiness. 
However, when mixed with structural materials, composites are obtained having broad 
band RCS reduction characteristics. 

4.3 Chiral materials 

The materials discussed so far are in general composites, and are sometimes even 
anisotropic in nature. The design and the manufacture of these materials are often 
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Table 1. Comparison of characteristics of simple isotropic and chiral media (Jaggard & 
Engheta 1989). 

Characteristics Simple medium Chiral medium 

Constitutive relations D = sE D — F.E + jC,B 

H ={\/n)B H =jCE + (l/n)B 

Wave equation VxVx£-/j 2 £ = 0 V x V x E — (5 2 E — 2wy(y x E =0 

Wave numbers [1 = c«[(/*e)] W2 P c± = ± <o/i( + [/? 2 + (a>/iC) 2 ] 

Wave intrinsic impedance Z = [(///t:)] 1/2 Z c = Z/(l 4- Z 2 C 2 ) 

Eigen modes Linear polarisation Only right and left circular 

allowed polarisation allowed 


difficult. Their EM properties are frequency-dependent and hence they are usually 
narrow band. In contrast, a new class of materials based upon EM chirality (handed¬ 
ness) when used as RAMs (Jaggard & Engheta 1989), offer an increased bandwidth 
along with improved reflection coefficient characteristics. 

Chirality combines optical activity (rotation of the plane of polarisation) and circular 
dichroism (change in polarisation ellipticity) in the medium which is expressed by two 
unequal wave numbers corresponding to the two circularly polarised eigen modes with 
opposite handedness. These materials are isotropic as well as reciprocal. The funda¬ 
mentals of chirality are available in literature (Post 1962; Kong 1975). The constitutive 
relationships and characteristics of such media are compared with simple isotropic 
media in table 1. 

In table 1, £ is the chirality admittance which expresses the handedness of the 
medium, and affects the wave transmitted through the medium. In fact, (is a measure of 
the cross coupling of electric and magnetic fields in the medium. The twist in the major 
axis of the polarisation ellipse depends on the sign of £. 

The chirals consist of an isotropic host medium which is embedded with randomly 
oriented identical microstructures, such as the microhelices. The geometric dimensions 
and density of these chiral microstructures determine the EM parameters s, /i and (. 

The absorption in chiral materials is determined by the chiral skin depth while their 
reflection properties are governed by a, ju and {. The backscattering in these materials is 
independent of polarisation. Hence the introduction of these chiral materials for 
Salisbury screens and Dallenbach layers, results in increased absorption over wider 
frequency band with the added advantage of thinner layers (J aggard et all 990). F or the 
electric Salisbury screen, the surface impedance may be expressed as 

Z 1 —jZ c cotfi c d, (46) 

while for the magnetic screen, it is given as 

Z x = — ;'Z c tanj3 c d. (47) 

Substituting these in (23), the thicknesses of the chiral layer for a required maximum 
reflection coefficient can be determined. Thicknesses of the order of 2/5 give rise to 20 
dB more absorption as compared to nonchiral materials (Jaggard et al 1990). 

An EM pulse propagation analysis in the chiral media has shown that the chiral 
coatings can be used for broad band RCS reduction even in high power pulsed radar 
environment (Engheta & Zablocky 1990). 
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More recently, thin layers of biisotropic (nonreciprocal) chiral materials that can act 
as polarisation transformers with ultrawide range of frequency have been proposed by 
Tretyakov & Oksanen (1991). 

4.4 The construction of RAMs in practice 

Antenna mount structures, especially those employed in measurement ranges or 
chambers are fabricated by using nonreflective materials. Blore (1964) describes the 
fabrication and testing of four different polyfoam structures made for this purpose. The 
scattering mechanism for polyfoam includes the effect of cell size, material homogene¬ 
ity, water content, surface finish and matching methods. The low echo area, mechanical 
rigidity and relative ease for manufacture and usage make these materials the best 
suited for antenna support. 

Bradshaw (1989) has studied the effect of construction methods on the RCS 
reduction in structural materials such as syntactic foam, light weight honeycomb, 
single skin fibre reinforced composites (FRC) as well as the sandwich composites. The 
application of these materials having microwave absorption properties for the aircraft 
RCS reduction is quite apparent. 

Broad band response becomes essential for RAMs used in radar targets in a wide 
band radar environment. The reflectivity curves of Eccosorb (NZ-series) materials 
suitable for such applications have been evaluated by Hurmuth (1983). Their perform¬ 
ance is analysed for a relative bandwidth of about unity. Compared to the performance 
at a relative bandwidth of 0-01, these absorbers are still less helpful in hiding targets 
from wide band radars (He et al 1992). 

Some of the Eccosorb materials have good absorption properties even above 
35 GHz. In particular, the CR-series Eccosorb materials have been characterised by 
Hemmati et al (1985). It is concluded that the measurements are in perfect match with 
the curves formed by extending the specifications for these materials at lower frequen¬ 
cies. The reflection and absorption coefficients increase with the density of iron in the 
material. It is also found that the absorption coefficient decreases by a factor of two, 
when the samples are cooled to cryogenic temperatures. 

Ferrites or carbon may be interspersed in rubber to make absorbers for covering the 
edges of paraboloidal reflectors (Yokoi & Fukumaro 1971). The antenna performance 
is improved by as much as 10 dB at 70°-180°. The addition of carbon increases the 
permittivity of acetylene black rubber, with the change in the imaginary part being 
more predominant; theoretical computations for the same have been compared with 
the experimental results by Kumar (1987). 

Aspects of modelling with spinel ferrites, magneto and dielectric composite materials 
for forming RAM are overviewed by Deleuze (1992). Ferrofluid composites containing 
mono layers of aluminium particles dispersed in a decalin carrier are considered by 
Davies et al (1986) for frequencies above 100 GHz. Chains of aluminium particles with 
oxide coatings are developed. It is observed that if the plane of polarisation is parallel to 
the chain axis, there is a minimum effect on the variation of frequency. 

Ferrite sludge obtained by precipitation treatment of industrial waste water has been 
reported to possess microwave absorbing properties (McCauley et al 1980). The 
powder so obtained is sintered into a mechanically coherent ferrite ceramic material. 
Composites of ferrites and resins 9-5 mm thick are reported to operate as microwave 
absorbers. These composites are found to be operational in the S-band (68% composite) 




834 


K J Vinoy and R M Jha 


as well as in the X-band (75% composite), and are suggested as an inexpensive choice 
for RCS reduction. 

A broad band absorber with 50% relative bandwidth is obtained with a two-layer 
ferrite-resin mixture incorporating short metal fibres. Materials of 4-6 mm thickness 
with fibres of 1-4 mm length and 60 micrometer diameter are found to be useful in the 
X-band (8-13 GHz). Good performance for wide angles of upto 45° is also obtained for 
these materials (Hatakeyama & Inui 1984). 

The constitutive relations for the maximum absorption of EM energy falling on 
a medium has been presented by de Hoop (1981). Metal coated carbon fibres were also 
used as conducting elements in RAM (Yi & Gan 1991). The use of such fibres results 
in improved attenuation and a wider bandwidth. The absorption properties are 
independent of the thickness of the material, but are related to fibre dimension and 
concentration. 

Broad band microwave absorption cannot be achieved by homogeneous materials. 
However, it was noticed experimentally that some compounds made with small 
conducting grains dispersed in a dielectric host can be used as broad band RAMs 
(Guillot & Bobillot 1991). Various mixture laws for such designs are developed by 
Guillot (1992) for the computation of dielectric permittivity. Mixtures, for which the 
quasistatic approximation is not matched inside the conducting medium, are consider¬ 
ed. This results in artificial lossy diamagnetic behaviour for which a model is derived. 

The application of chiral materials on spherical targets offers significant improve¬ 
ment in the reflection coefficient (15-25 dB) as compared to the conventional materials 
for the radii of curvature of the order of half a wavelength or larger (Jaggard et al 1991). 

Polymers containing silicon are observed to possess superior electrical and mechan¬ 
ical properties and can be used for RCS reduction of aircraft. A fabrication process, in 
which electropolymerised film comprising such polymers or copolymers is deposited 
on the surface of an anode, is described by Nagasubramanian et al (1990). Conductive 
polymers like polypyrrolle, polyaniline and polyacrylthiophenes have also been re¬ 
ported in literature as materials in the fabrication of RAMs (Olmedo 1992). 


5. Absorber measurement techniques 

The measurements on RAMs are conducted either to survey or verify product 
performance and material properties. In this context, usually three forms of material 
are measured: 

(i) Finished absorber panels or products, 

(ii) Thin sheets used as components in the design and fabrication process, 

(iii) Samples of uniform bulk materials. 

A number of methods are explained in various books and papers on microwave 
measurements (Ginzton 1957; Montgomery 1957; Ono et al 1983; Knott etal 1985; 
Curran 1993). Some of the important methods are listed below. 

* Parallel plate ■ Resonant cavity 

■ Coaxial probe ■ Admittance tunnel 

m Transmission line techniques ■ Free space techniques 

■ Waveguide system 


Trends in radar absorbing materials technology 


835 


5.1 Parallel plate method 

The parallel plate method requires that the material being tested is homogeneous, and 
nonmagnetic. The sample taken should be thin (less than 10 mm), flat and perfectly 
smooth. An LCR meter or an impedance analyser measures the impedance across 
a pair of parallel plates separated by the sample. The errors in this low frequency (less 
than 100 MHz) method are mainly due to fringing fields at the edges. Guarded 
electrodes can be used to eliminate the edge capacitance. 

The features of this method are as below. 

■ Broad band ■ Accurate and sensitive 

■ Simple and convenient a Inexpensive 

The RAMs, though usually operative in the microwave region, can nevertheless be 
accurately characterised in the audio frequency region. Lynnworth (1964) observed 
that the insertion loss in the microwave region is correlated to the capacitance of the 
material at the audio frequencies. Avcoat and polyurethane foams have been tested by 
the parallel plate method successfully (Lynnworth 1964). Acrylic panels in which 
molybdenum wires of 1 mil diameter are embedded at varying spacings are used as the 
standard for establishing the correlation. 

5.2 Coaxial probe technique 

The coaxial probe technique is useful for isotropic, homogeneous and nonmag¬ 
netic materials. Samples of ‘infinite’ thickness with one flat surface, and without any 
air gap may be tested with this method. This method is best suited for liquids and 
inviscid fluids. The minimum thickness for solid samples is about 20/el 12 mm (Curran 
1993). 

The advantages of the method are: 

■ Broad band ■ Simple and convenient 

5.3 Waveguide system 

Reflectivity measurements of finished products at lower frequencies (typically 
100-400 MHz), can be performed by placing a test sample at the aperture of a wave¬ 
guide. Since the measuring probe is not movable, adjustments are made at the attached 
shorting stub. Although only single frequency measurement is possible at a time, 
stepped frequency measurements is also possible with additional equipment. Swept 
frequency interferometer has been used with an aperture matched exponential horn 
for the rapid real time assessment of inhomogeneities in planar microwave absorber 
panels (Baker & van der Neut 1988). The residual reflection in such a system is as low as 
- 37 dB. 

A circular waveguide can be used more effectively than coaxial cables for measuring 
the plane wave reflection properties of layered dielectrics (Rudduck & Yu 1974). This 
method is applicable to media containing high refractive index absorbing materials. 
The measurement set up can be very easily calibrated and the method is obviously 
nondestructive. 
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5.4 Transmission line techniques 

The transmission line techniques require that the voltage reflection coefficient of 
a homogeneous material sample be measured. This is done by multiple measurements 
with different terminations using slotted section. The samples must be carefully 
machined so as to accurately fit inside waveguide or coaxial line without any air gaps 
and their faces should be flat, smooth and perpendicular to the line of axis. The 
thickness of the samples must be in the range A/18 to A/2. Periodic dropouts, when the 
sample thickness approaches A/2, is a major failure of the method. This method is 
suitable for anisotropic and magnetic materials. 

In the case of coaxial cable environment, the diameter of the standard cable may not 
be big enough for a test sample to be placed inside. This problem is avoided by using 
a larger section of cable of about one inch diameter connected to the standard cables 
using suitable taper sections. Unambiguous determination of the electrical thickness of 
the sample necessitates multiple frequency measurements. The sample is backed by 
metal sheet (a short), or an open circuited termination. The changes in the standing 
wave pattern inside the line upon inserting the sample lead to the characterisation of 
the material. 

The attractive features of the method include its being (i) broad band, and (ii) simple 
and convenient. 

The transmission line technique using coaxial cables is accurate up to 10%, Whereas, 
the waveguide method is precise up to 5% if the samples are machined properly (to 
a precision of 0-03 mm). 

Although the transmission line techniques are useful to a great extent in characteris¬ 
ing the RAM, they have some limitations as well. Better results are guaranteed by the 
SCOC and thin sample methods if some precautions are taken (Falkenbach 1965). The 
sample thickness should be of the order of A/50 for the thin sample method, while it 
should be A/5 for the SCOC method. It is preferred that the material sample have dc 
resistance of about 1 kQ/mm thickness. 

Some of the algorithms used in computation of results from transmission line 
measurements are listed in table 2. 

Table 2. A comparison of computational algorithms employed in transmission 
line methods (Curran 1993). 


Algorithm 

Measures 

Computes 

Ideal for 

Nicolson-Ross 

S n ,Si 2 , 
^21’ *^22 

£ ,U 
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Magnetic, short & lossy 
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speed 

Precision 

S ll' S 12' 
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Long low-loss samples; 

(NIST) 

SivS 22 


highest accuracy if no 
discontinuities present 

Fast 

$u.S 12 , 

^ 21^22 

£ 

r 

Faster and better method for 
lossy materials than the 
previous one 

Short-backed 


& 

r 

Liquids and powders 

Arbitrary-backed 


£ 

r 

Thin films, and high 

temneratnrft artnlirntinns 
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With swept signal sources or using time domain reflectometry technique, fast 
measurements are possible over a wide range of frequencies. The time domain 
reflectometry technique involves sending a narrow pulse waveform which is scanned by 
a computer controlled broad band sampling oscilloscope. This is transformed into the 
frequency domain by the fast Fourier transform (FFT). Selection of timing windows is 
crucial for the accuracy of the measurements. Improved stability of the pulse generators 
as well as the sharpness of the pulses result in more accurate performance. 

5.5 Cavity technique 

A part of the cavity resonator is filled with the sample without affecting the field 
distribution inside. This technique, essentially a single frequency one, involves complex 
analysis. But it is suitable for anisotropic materials as well. This is the most accurate 
technique, especially for low loss homogeneous materials and liquids. Tunable cavities 
are used in practice when multiple frequency measurements are required. 

5.6 Admittance tunnel method 

The admittance tunnel is a long absorber-lined box illuminated by a small antenna at 
one end and fitted with a test aperture at the other. A hybrid Tee is used to isolate the 
transmitted and received signals. A continuous wave (CW) signal source is used for 
transmission. Since the transmitting and receiving antennas are the same, the residual 
reflections could be minimised. The box shape is useful in obtaining a near planar 
incident wave with little disturbance from the outside environment. This method, 
developed by Rockwell International Corporation, is ideal for thin sheets. 

5.7 Free space techniques 

The free space techniques are in general high frequency, non-contacting and non¬ 
destructive methods, typically applicable to large, flat, parallel faced homogeneous 
samples. Notwithstanding the title, the measurements are not always performed in free 
space-like environment, but are not done inside the confines of a waveguide or a cavity. 
It is difficult to measure the loss of thin and low loss samples with these methods. In 
general, the errors in free space techniques are due to: (i) finite sample size, (ii) non-plane 
wave illumination (iii) mechanical instability or misalignment between sample and 
antenna, and imperfect quality of anechoic environment. 

Typically, the accuracy of measuring s r is 1-5% while the value of tan((5) is precise to 
± 0-005. Some of the important methods under this category are listed below. 

■ NRL arch method ■ Interferometer method (open resonator) 

■ RCS range method ■ Direct spatial standing wave measurements 

■ Tunnel method ■ Double horn-reflector system 

■ S-parameter method 

5-7a NRL arch method : This near-field method, developed by Naval Research 
Laboratory (NRL), operates by illuminating the absorber panel by a small horn 
antenna. A second horn antenna, which is also mounted on the same arch-like 
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framework, receives the reflected energy. This being a near-field method results in lower 
reflectivity values. Only the amplitude characterisation of the reflected signals is 
possible here. Using swept oscillators, the measurement over a range of frequencies can 
be taken (Lehto et al 1991). 

5.7b RCS range method: The RCS range method is a far-field technique in which the 
material sample is mounted on a flat plate which is in turn installed upon a target 
support column in a vertical position. The panels are evaluated in a more realistic 
environment akin to the one in which they may actually be used. In order to 
accommodate swept frequency signal sources, dedicated costly instrumentation may 
be required. The size of the plate is selected to be greater than 31, typically, 51. Due to 
limitation in determining the null width, the maximum length of the edges of the sample 
should be less than 151. 

5.7c Interferometer method: The nature of reflections from a sample measured 
outside the confines of a closed system can also lead to characterisation of EM 
properties of the material. This set-up is similar to that of transmission line technique 
(§ 5*4) but for the sample holder and sliding short, which are replaced by a tuner and 
a horn. The sample is mounted on a metal support plate, a few feet in front of the horn. 
The tuner is used to remove the residual reflections. At higher frequencies, this method 
is similar to the Fabri Perot Optical Interferometer technique, used in characterising 
optical materials. An open cavity is used at microwave frequencies. 

The open cavity method has been used successfully to obtain the complex magnetic 
permeability of materials loaded with conducting powders (Guillot & Bobillot 1991). 
The material characteristics are obtained by measuring the resonant frequency and the 
quality factor of the cavity when a slab of the sample is placed inside. This method is, 
however, not suitable for taking measurements at high temperatures. 

5.7d Free space tunnel method: The sample in the form of a sheet is placed 
between transmitting and receiving antennas. The spacing on either side of the sample 
should be greater than 2 d 2 /X, where d is the largest dimension of the antenna or the 
sheet. The power transmitted through the sample is measured to determine its EM 
properties. 

5.7e Free space S-parameter method: This is an accurate measurement method in 
which the S parameters are measured both in amplitude and phase using sophisticated 
equipment such as a vector network analyser. Although the material characterisation is 
better, the capital expenses are high. 

5.7f Direct spatial standing wave measurement technique: The direct spatial standing 
wave measurement technique is used for precise characterisation of absorber panels, 
especially of pyramidal shape. An infinitesimal dipole antenna is moved about the 
panel and the field strength is measured at each position. 

This method is suitable for measurements at oblique angles of incidence. Absorbers 
can be quickly characterised over a wide range of frequencies and incident angles with 
this method. The scattering parameters like VSWR and scatter angle have been 
measured for pyramidal absorbers using this method (Ono et al 1983). The error in the 
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measurement is found to be ± 20% for VSWR and ± 3° for scattering angle at 4 GHz, 
when measurements are taken 1*5 m away from the sample. 

5.7g Double horn-reflector system: A new method of characterising rectangular 
panels of homogeneous RAMs have been suggested by Kent (1982). In this wide band 
technique, the permittivity and permeability of the materials are obtained from the 
measured values for S n and S 21 . The accuracy of this frequency domain method is 
stated to be 5-10%. 

Of the methods discussed above, the NRL arch method is the most famous and the 
most frequently used. Although this method is universal, it has some disadvantages as 
well. The disadvantages of this method are listed and some novel free space methods are 
suggested by Brumely (1987). These are 

(i) inverse synthetic aperture radar imaging for samples with large area; 

(ii) wide band coherent radar measurements and; 

(iii) modification to the conventional time gathered RCS measurements, which 
are made as a function of the aspect angle of the sample wall of the targets. 

Yet another nondestructive technique for characterising absorber-lined chambers is 
to use short dipoles as the measuring probes (Mishra et al 1982). These dipoles are 
reported to have a discrimination of the order of — 25 dB. The microwave goniometer 
could also be used for characterising absorber panels (Jones & Wooding 1964). 
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Figure 4. The suitability of the measurement methods for given frequency range 
and the nature of the material. 
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Other novel methods also reported are microwave diversity imaging (Li et al 1989) 
and microwave scanning method (Wims & Palmer 1991) which are essentially non¬ 
destructive in nature. 

The frequency ranges of application of some of the measurement methods discussed 
above are given in figure 4 (Curran 1993). The optimum method for an application is 
chosen after taking the various limitations and advantages of these methods into 
consideration. Methods usually selected for different applications are listed below. 
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6. Applications of RAM 

Although initially intended for camouflaging aircraft, and for enhancing radar per¬ 
formance, RAMs have found their way into numerous other areas. Some of the 
important applications of the RAMs are given below. 

6.1 RCS reduction of aerospace vehicles 

RAMs are primarily used to achieve broad band RCS reduction for aircraft (Walking- 
ton & Huster 1979; Sweetman 1982,1987; McCluggage 1987; Schmitman & Warwick 
1990; Cobucci 1991; Brown 1992; Martin 1992) and missiles (Hanson & Kiehle 1982). 
Ferrite-based paints are one such class of materials developed earlier that were found to 
be useful, not only over wide band but also at the lower frequencies. Lighter RAMs 
were fabricated subsequently, but most of them are still on the classified list. 

6.2 Improvement of radar performance 

Absorber sheets are used to shield such reflecting objects in the vicinity of the radar that 
may otherwise cause spurious effects on radar signals at the base of the radome, parts of 
the antenna mechanism etc. In shipboard applications, the mast and the bulkhead are 
also covered with absorbers. Similarly, treating the buildings near an air traffic control 
radar with RAM, improves radar performance (Emerson 1973). 

6.3 Reduction of antenna RCS 

In some military applications it becomes necessary to keep the RCS of antennas as low 
as possible. Covering the aperture with an absorber coating can lead to considerable 
RCS reduction. In a typical case with a circular waveguide termination (Lee et al 1985), 
this RCS reduction can be as much as 13 dB. 
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6.4 Antenna sidelobe reduction 

The sidelobe level of antennas can be significantly reduced by placing the antenna 
within a tube made of absorber panels. Similarly obstructions from feed supports in 
reflector antennas are eliminated by covering them with absorber panels (Emerson 
1973). The effect of edge diffraction in very large reflector antennas can be significantly 
reduced by covering the edges of the reflector with absorber sheets (Y okoi& F ukumaro 
1971). 

6.5 Anechoic chambers 

Anechoic chambers are required in taking diverse EM measurements such as the 
aircraft RCS patterns, the antenna radiation patterns, the radome boresight error and 
various other simulation studies. An anechoic chamber for this purpose has to be 
isolated from the outside environment so that ideally no signal crosses the wall in either 
direction. The walls, floor and ceiling of such chambers are made absorptive to create 
a free-space-like environment. 

The suitability and design of various R AMs for this application are widely reported 
in open literature (Severin 1956; Cherepanov 1974; Mitsmakher 1980; Brumely 1987; 
Joseph 1988). The design of broad band absorbers for electromagnetic susceptibility 
(EMS) measurements chambers was suggested by Mishra et al (1982). 

Most of the chambers use absorbers cut in the pyramidal, conical or even an 
aggregate sine-wave shape for improved performance. Such chambers in the test region 
have maximum reflection levels 70 dB below the direct path signal, and shielding 
isolation as low as — 140 dB (Emerson 1973). State of the art anechoic chambers have 
a frequency range of about 100 MHz-100 GHz. 

6.6 Improvement of outdoor range performance 

The effect of scattering from isolated objects in an outdoor range can be offset by 
covering them with RAMs. This results in a free-space-like environment so that the 
tests are more reliable. Polyfoam structures are found to be very effective in such 
applications (Blore 1964). 

6.7 EMI applications 

With the increasing use of higher frequencies in' electronic circuits, it has become 
necessary to provide shielding from the electromagnetic interference (EMI) from/to 
nearby systems. Absorber-lined enclosures may be used when an ideal free space 
condition has to be met without disturbing the surroundings for electronic circuits. 
Customised absorbers are used in high power applications (Emerson 1973). 

6.8 /4s part of the microwave components 

A number of waveguide and coaxial microwave components make use of some absorbing 
material inside. For example, resistive vanes made of such materials are employed in 
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making microwave attenuators (Nagornov et al 1978) and phase shifters (Tretyakov 
& Oksanen 1991; Marty et al 1992). Absorber materials are also used for harmonic 
suppression in transmitters, and for gain stabilisation in amplifiers (Bostick 1985). 

6.9 Mode suppresser in circular waveguides 

When the inside surface of a circular waveguide is coated with RAM, the lower order 
modes are suppressed significantly. The amount of suppression depends on how lossy 
the material is. Such a mode of suppression can also result in the RCS reduction of 
circular structures. Coating the inside surface of a cavity with RAM reduces its RCS 
significantly (Lee et al 1985, 1986; Chou et al 1987; Chou 1988). 

6.10 In parallel plate waveguides 

When the inner surface of a parallel plate waveguide is covered with a RAM, it can 
propagate T M(e) 10 and TM(e) 10 modes over unlimited bandwidth. Such parallel plate 
waveguides are much more useful than a corrugated horn as a feed antenna. A detailed 
analysis of such structures is described by Cheng & Ostertag (1986). It is shown that the 
performance is insensitive to the RAM characteristics. 

6.11 Protective shields in microwave oven 

A household application of RAM comes in the form of protective shields in microwave 
ovens. Ferrite paints are coated over the oven casings to reduce any possible leakage of 
the radiation to the atmosphere. A commercially available paint from the Japanese 
company TDK was effectively used for this purpose (Ashley & Gilmore 1988). 

6.12 Removal of ghost images in TV pictures 

A paint from NEC could reduce the reflections significantly in the frequency range 
3-5-20 GHz (Ashley & Gilmore 1988). This sandwich ferrite RAM was successfully 
used for removing ghosts in TV pictures due to clutter effects from nearby buildings. 
Interestingly, in the early stages of the US Department of Defense stealth programme, 
this Japanese paint was in high demand. 

6.13 Electronic countermeasure applications 

Innovative applications of RAMs have also been proposed in Electronic Counter¬ 
measures (ECM). A recent patent suggests coating the inner surface of countermeasure 
cartridge with plastisol (Tsuji 1992). The heat generated while expending the counter¬ 
measure payload softens the plastisol coating so that the entire container acts as a radar 
absorber rather than a reflector. 

6.14 RCS enhancement of the missile decoys 

Quite ironically, absorber materials can themselves be used to increase the RCS as well. 
In order to improve the detectability and hence the effectiveness of a missile decoy, its 
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rear side is coated with absorbers (Harrington 1987). This reduces the creeping waves, 
thus resulting in increased RCS in the head-on, i.e. the backscattering direction. 

7. Trends in RAM 

As mentioned in §§ 1 and 2 of this paper, contrary to the popular belief, RCS reduction 
is not one of the latest “emerging” technologies. The active research endeavor in this 
area is at least three, possibly four, decades old. The Soviets by 1965 had already 
noticed and were alarmed by the massive and increasing US funding to explore RCS 
reduction as an effective means of combat. The US interest and the intensification of the 
programme could be judged by the sharp funding increase by the US Air Force from 
35-5 million (in 1962) to 119 million (in 1963) to 155 million dollars (in 1964) 
(Shneyderman 1985). Hence, the genesis for active research in RCS reduction can be 
traced back to the late Fifties rather than, as is popularly believed, to the early Seventies. 

When the stealth programme was officially unveiled in the early Eighties, the cost of 
Raytheon B-2 was placed at 650 millions per piece (as compared to 15 million for the 
Lockheed F-16). As was later to be extensively commented upon by the experts in the 
aerospace industry, B-2 is highly inoptimal from aerodynamic considerations; its 
novelty lies in its stealth characteristics. Of these, shaping as a means for RCS reduction 
was known, and depended solely on enormous computer time for the stealth design. It 
is apparent from the external features of B-2, that it used smooth blended convex 
profiles as a means of shaping. It can therefore be presumed that much of this money for 
RCS reduction (B-2 is a 20 billion-dollar programme) went for the novel research on 
RAM technology. 

A parallel exists in the Lockheed Have Blue and its successor F-l 17A models. From 
the shaping point of view, both the Have Blue and F-l 17A had a rather incongruous 
profile consisting of large external flat plates. The second means of achieving RCS 
reduction was to incorporate RAM in the aircraft structure itself. The cost of Lockheed 
F-117A is once again prohibitively large. 

Although US agencies have closely monitored the development of RAM technology 
in foreign countries, for example Russia and China, it has not been possible to put a cost 
tag to these efforts. In case of China, it is opined that much of the effort is being funded 
indirectly through the science and technology budget. It is conjectured that rather 
large RAM research programmes exist with the other four super powers, sometimes 
without even logical conclusion. For example, the French have displayed an extensive 
research interest in RAMs without apparently having a corresponding military aircraft 
programme. 

It has indisputably emerged, from the figures connected with stealth programme, 
that the price of RAM technology has tended to be prohibitively high. Much of this 
research must have been directed towards analysis, design and development, fabrica¬ 
tion and measurement aspects of RAM. We propose to look into the trends in RAM 
technology with this perspective. 

7.1 Bandwidth 

The EM absorbers made a rather modest beginning. It started with the quarter- 
wavelength transmission line concept of impedance matching. A constraint in applying 
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dielectric RAMs is their bulkiness. This could be offset by applying thin coatings of 
ferrites and other magnetic RAMs. However, both the dielectric and magnetic RAMs 
were frequency-dependent, and therefore narrow band absorbers. 

The narrow band RAMs can be used in multiple layers to obtain broad band 
absorbers. However, either due to the bulkiness (dielectric RAMs) or higher density 
(magnetic RAMs), the weight penalty is still considerably high. 

A new class of RAM based on the principle of EM chirality is reported to be much 
lighter than the ferrites and requires thinner coatings (A/5 to A/6) than the conventional 
RAMs. Of late, a newer class of RAM based on Schiff-base compounds has been 
reported which is said to be as much as ten times lighter than the magnetic RAMs for 
the same absorptive performance. Furthermore, these retinyl derivatives (bacterio- 
rhodopsin) are reported to have ultrawide band absorption characteristics. 

7.2 Analysis 

The analysis of RAM for future prediction has followed the developments in computa¬ 
tional electromagnetics. In the high frequency domain, the methods initially used in the 
1960s were geometrical optics (GO) and its extensions. The modification of GO to the 
GTD-type theories permitted RAM analysis of increasing complexity; these are 
discernible in the shooting and bouncing ray (SBR) method of Lee (1985) to analyse the 
RAM-coated engine cavity. The approach taken in the low-frequency domain has been 
that of the method of moments (MOM). 

However, both GTD-type theories, and the MOM and its derivatives suffer from 
severe shortcomings. The GTD requires the scatterer to be convex, and the symmetri¬ 
cal cavities have stretched the SBR method to its limit. As of today, asymptotic ray- 
theoretic methods cannot handle either a scatterer having concave and convex surfaces 
in the neighbourhood, or a surface of arbitrary concavity. The MOM can solve such 
problems with remarkable ease but the size of the scatterers must be electrically small. 
Unfortunately, the size (electrical) of the scatterer encountered in reality makes the 
application of MOM computationally intractable. 

The time domain methods, (based on finite difference), which are similar to the 
MOM, have of late caused much interest. The finite difference time domain (FDTD) 
method was proposed in 1966 (Yee 1966). But it is only with the advent of parallel 
computers that its potential seems to have been unlocked. FDTD can yet solve only 
low-frequency scatterers, although at a speed usually one order of magnitude better 
than the MOM. Kashiwa et al (1990) have successfully presented a three-dimensional 
time domain analysis of an electrically small aircraft structure coated with thin 
resonant absorbers. It is expected that, if the technological breakthroughs in parallel 
computers continue at the current pace, FDTD algorithms should be able to solve 
electrically large problems in another ten years time. In this context, it must be 
mentioned that the fastest supercomputers to date, require 10 days on stand-alone 
basis to analyse aircraft coated with RAM. 

7.3 Design 

Since most of the military and particularly the electronic warfare applications require 
wide bandwidths, it is essential that RAMs have broad band or even ultrawide band 
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characteristics. The present practice is to obtain broad band RAMs by multilayering of 
resonant RAMs. In actual practice however determining the thickness of individual 
layers and the concentration gradients are highly computation intensive. Bastiere 
(1989) has applied fuzzy set theory to evolve a multi-criteria optimisation method for 
designing RAMs consisting of as many as 5 different layers. 

7.4 Measurements 

As mentioned in § 5, well-known methods exist for characterising RAMs right from the 
audio frequencies up to 60 GHz with varying degrees of accuracy. In contrast, 
Hemmati et al (1985) made transmittance and reflectance measurements on Eccosorb 
in the frequency range 35 GHz-3 THz at both ambient and cryogenic temperatures. 
The development of measurement techniques in the quasi-optical range has acquired 
additional significance, since the extensive use of this section of EM spectrum for 
next-generation military applications is imminent. Such developments tie well with the 
NASA Terahertz Technology Programme which is being funded on a priority basis. 

7.5 Fabrication 

Although applying RAMs to military aircraft structures is decidedly advantageous, it is 
somewhat offset by the weight penalty it imposes on the cruise performance of the 
aircraft. It is now feasible to use radar absorbing structures (R AS) in the construction of 
the aircraft. Since the RAS consists of embedding RAMs within the parts of the aircraft 
structure, there is almost no weight penalty. The concept of RAS is also consistent with 
the present day trend of increasing use of composites in the fabrication of fighter 
aircraft. In the state-of-the-art Lockheed F-l 8 aircraft (which interestingly has inherent 
stealth characteristics) the composites used are 9-9% by volume. In contrast, for the 
YF-23 ATF, a future aircraft programme, this percentage is to go up by as much as 
45-50% (Cobucci 1991). 

RAS have been obtained by embedding magnetic RAMs in polymeric matrix 
(Stonier 1991). Similarly the use of hismaleide resins have also been reported in these 
composites. In the vicinity of engines, which is a major contributor to the overall RCS 
of the aircraft, polyamides have been used (Stonier 1991). Finally, novel fabrication 
techniques for RAS have succeeded in sustaining the RCS reduction characteristics at 
temperatures as high as 350°F which is well in excess of the maximum temperature 
encountered by the state-of-the-art supersonic fighter aircraft. 

As a further corollary of the RAS concept, the possibility of constructing inherently 
stealth radomes is also being considered (Kumar et al 1994; Vinoy & Jha 1994). 
Polyethylene fibres in composites result in RAS of high strength which can be used for 
EM window (radome) applications (Cain & Corda 1991). 

It is interesting to note that several radome material derivatives e.g., spinel ferrites 
(Deleuze 1992) have been successfully used as RAMs. This is hardly surprising since the 
electromagnetic analysis and design of radome and RAM involve similar physical 
parameters such as dielectric and magnetic loss tangents (Kumar 1994; Vinoy & Jha 
1994). 

Since the exterior of the aerospace vehicles are subject to environmental hazards, the 
current trend is to develop RAMs which are inherently resistant to the environmental 
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elements. Silicone rubber, which is a well-known anti-rain erosion material, has been tested 
as RAM (Army Material Development and Readiness Command 1982). Similarly 
RAMs have been developed with antiablation and nuclear-damage resistance proper¬ 
ties. Inorganic ceramics A1 2 0 3 and Zr0 2 have been tested in this context. Once again, 
it may be worthwhile to recall that both these inorganic ceramics are extensively used in 
constructing hypersonic radomes for missiles (Kumar 1994; Kumar et al 1994). 


8. Summary 

The research in Radar Absorbing Materials (RAMs) as a viable means for RCS 
reduction has been actively pursued for at least four decades. The RAMs fabricated 
initially were based upon transmission line approach and were narrow band absorbers. 
The applicability of these resonant absorbers was enhanced by multilayering to obtain 
broad band absorbers. 

Analyses have however shown that the ultrawide band radars can be effectively used 
as electronic countermeasures against broad band absorbers. This has motivated the 
current search for ultra wide band RAMs. Compared to the transmission line approach 
and multilayer concept, the initial successes reported in the area of ultrawide band 
RAMs have come from the completely novel approach of chirality and by exploring 
even biochemical products for this purpose. 

In order to overcome the penalty of weight imposed by multilayered dielectric or 
ferrite RAMs coated on the aircraft structures, there is a decided shift towards using 
Radar Absorbing Structures (RAS) where the RAMs are embedded in the polymeric 
composite matrix. The concept of RAS is compatible with the trend of using increasing 
volumes of composites in the fabrication of the latest and of future generation fighter 
aircraft. 

A comparison with our concurrent study in radomes has shown that several RAMs 
are actually radome materials. This observation is significant since the RAMs in 
general are classified, whereas extensive information on the radome materials is 
available in the open literature. 

Besides the obvious military applications, RAMs have many prevailing and poten¬ 
tial applications in the industrial and even domestic sectors. RAM has thus emerged as 
a typical dual-use technology, where commercially viable research and development 
can be pursued independent of the defense sector. 

An awareness of the underlying principles of RAM analysis, design and fabrication 
methods, and the actual materials used as RAMs, could enable material scientists to 
indigenise and even pioneer the next generation of RAMs. 
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Abstract. A bioprocessing approach for the extraction of base, nuclear 
and precious metals from refractory and lean grade ores has been reviewed 
in this paper. Characteristic morphological features of Thiobacillus fer¬ 
rooxidans, , the organism which has been extensively used for biooxidation 
of sulphide ores have been discussed. Mechanisms of chemoautotrophy and 
mineral oxidation have been illustrated. The current engineering applica¬ 
tions of this microorganism have also been brought out. Various methods 
for accelerating the growth of Thiobacillus ferrooxidans for faster biooxida¬ 
tion and genetic manipulation for development of desired strains have been 
outlined. 

Keywords. Thiobacillus ferrooxidans ; chemoautotrophy; iron oxidation; 
sulphur oxidation; bioleaching; genetic manipulation. 


1. Introduction 

Thiobacillus ferrooxidans, the most widely implicated bacterium in the bioleaching of 
sulphide minerals was isolated for the first time from the acid drainage of coal mines in 
1947 (Colmer & Hinkle 1947). Since then tremendous research activity followed in order 
to characterize these organisms and in vestigate their potential in the oxidation of various 
sulphide minerals (Duncan et al 1964; Brierley 1978; Dugan & Apel 1978; Harrison 
1982; Norris & Kelly 1982; Groudev 1985; Huber et al 1986; McCready 1988). 

Its commercial application in the leaching of copper from waste grade ore dumps 
was demonstrated by the Kennecott Copper Corporation, USA in 1963. During the 
last three decades, the commercial utility of this microorganism has been proved in the 
extraction of copper and uranium from waste and lean grade ores as well as tailings. 
Over 30% of the world’s copper is currently estimated to be produced through 
bioleaching operations. For about a decade, the industrial application of bioprocessing 
has been further extended to enhance gold recovery from refractory ores and concen¬ 
trates containing encapsulated gold particles in a sulphide matrix such as pyrite and 
arsenopyrite (Lawrence 1990). 
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Commercially, the following techniques have been adopted to bring about biooxida¬ 
tion of the desired mineral. 

<§ Dump/heap leaching, 

• solution mining or in situ leaching, 

• vat leaching and use of bioreactors. 

Although it has been proved technically feasible to bioleach almost any sulphide 
mineral to recover zinc, nickel, cobalt, molybdenum and a variety of other metals, its 
widespread commercialisation has been currently restricted due to unfavourable process 
kinetics. Unlike a number of heterotrophs, Thiobacillus ferrooxidans , a chemoauto- 
trophic acidophile grows very slowly exhibiting large generation periods ranging 
between 16 and 20 h. The slowness of their growth coupled with difficulties associated 
with generation of higher amounts of biomass, limit the universal industrial acceptance 
of this biotechnology. Current research efforts, therefore, have been directed towards 
finding ways and means to accelerate the growth of T. ferrooxidans with a view to 
enhancing bioleaching rates. 

In this paper different methods developed to augment biomass generation of T. 
ferrooxidans are analysed. The microbiology of the bacterium is brought out with 
respect to biochemical mechanisms. Engineering aspects of biomass growth and 
bioleaching technology are illustrated. 

2. Thiobacillus ferrooxidans 

Since Thiobacillus ferrooxidans is the most widely implicated organism in bioleaching 
operations, its microbiology is discussed in detail here. It is a motile organism having. 
a single polar flagellum. It is non-spore forming, gram-negative, rod-shaped (05— 
1-5 pm), aerobic, acidophilic (pH 2-3-5), and essentially mesophilic (optimum tempera¬ 
ture 30°C, but also tolerates moderately thermophilic conditions). It divides by binary 
fission and occurs either singly or in pairs. It derives its carbon from atmospheric CO z 
and obtains its energy for metabolism through the oxidation of ferrous and reduced- 
valence inorganic sulphur compounds. An electron micrograph of T. ferrooxidans with 
its characteristic morphological features is shown in figure 1. The medium used for 
growing the organisms under laboratory conditions is illustrated in table 1. 

The surface structures of cells of T. ferrooxidans differ depending on whether they are 
grown in liquid or on solid substrates. When grown in a medium containing sulphur, 
the cells develop peritrichous, rod-shaped, filamentous appendages called pili. The 
diameter of the pili can vary from 0-7 to 3-0 microns. On the other hand, when grown in 
9 K medium, the bacterium does not develop such characteristics, Hence, it can be 
inferred that pili facilitate cellular contact with the mineral surface for oxidising 
elemental sulphur to sulphuric acid (Gromova et al 1978). 

3. Mechanisms of chemoautotrophy 

3.1 Sulphur oxidation 

The details of the complex mechanisms involved in sulphur oxidation are beyond the 
scope of this article and hence are considered only in brief here. Oxidation of S 2 S°, 
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Figure 1. Electron micrograph of T. ferrooxidans (20000 x). 


S 2 0 2 and SO 3 by the bacteria are considered. A schematic representation of 
sulphur oxidation by T.ferrooxidans is depicted in figure 2. 

Sulphide is oxidised directly to sulphite: 

S 2_ + 3H 2 0-+S0 2 “ + 6 H + + 6e~. (1) 

The enzyme involved is sulphite reductase. Sulphide can also be oxidised to 
elemental sulphur (S°) with the help of enzyme, sulphide oxidase (Parker & Prisk 1953; 
Kasprzak & Steenkamp 1983). 


Table 1. Silverman and Lundgren* medium (9 K 
medium). 


Nitrogen source 

(NH 4 ) 2 S0 4 — 3 g/1 

Growth factors 

KCl-O-lg/l 


K 2 HP0 4 —0-5g/l 


MgS0 4 -7H 2 0-0-5 g/1 


Ca(NO) 3 ) 2 -0-01 g/1 

Energy source 

FeS0 4 -7H 2 0 - 44-8 g/1 

Distilled water 

11 

pH 

2-2-3 


*Silverman & Lundgren (1959) 
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Figure 2. A schematic representation of sulphur oxidation by T. ferrooxidans. 


Sulphur: 

S 0 + O 2 + H 2 O-+SOl~ + 2H + . (2) 

Elemental or colloidal sulphur is insoluble. It is necessary that the cells establish direct 
physical contact with elemental sulphur before oxidation can occur. Therefore, sulphur 
is oxidised at the cell surface, dissolved in the cell membrane, or taken up as elemental 
sulphur by the cells. The enzyme involved has been shown to be an oxygenase 
(Karavaiko & Pivovarova 1973). 

Thiosulphate: All the species of thiobacilli studied oxidise thiosulphate to sulphate. 
Two mechanisms of oxidation are observed depending on the species of thiobacilli; the 
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first involves the initial splitting of the S-S bond by enzyme rhodanase with the 
formation of sulphite and elemental sulphur. 

s 2 or-sor+s°, o) 

Sulphur is also found to be associated with the membrane and subsequently 
converted to sulphate. This mechanism involves thiosulphate: cytochrome 
c oxidoreductase (Peck 1960; Charles & Suzuki 1966; Schedel & Truper 1979). 

S 2 0 2 " + 5H 2 0 -> 2S0 2 ~ + 10H + + 8 e~. (4) 

Sulphite : Two mechanisms operate in the oxidation of sulphite depending on the 
species of thiobacilli. The AMP (adenosine-5-monophosphate)-dependent sulphite 
oxidation is cytoplasmic. 


SO 3 - + AMP 


Adenophosphosulphate 

-*. APS -f- 2e ~, 

(APS) - reductase 


(5) 


APS + PO 4 - 


Adenosine diphosphate 


(ADP) - sulphuryluse 


ADP + Sor, 


( 6 ) 


Adenylate 

2ADP-> AMP + ATP. (7) 

Kinase 

In the second mechanism, the enzyme is extracytoplasmic, namely, sulphite oxidase 
(sulphite cytochrome c reductase). The reaction catalysed by this enzyme is (Peck 1960; 
Bowen etal 1966; Yamanaka et al 1981; Toghrol & Southerland 1983; Lu & Kelly 
1984). 

SO 2 ' + H 2 0-»SC>r +2H + + 2e~. ( 8 ) 


3.2 Ferrous oxidation 


A schematic representation of ferrous iron oxidation by T. ferrooxidans is given in 
figure 3 (Pringsheim 1949; Lundgren et al 1974; Black et al 1989). The oxidation of 
ferrous iron does not involve production of protons produced in the enzyme-catalysed 
reaction: 


Fe 2+ -> Fe 3+ +e~. (9) 

The ferric iron formed undergoes abiotic hydrolytic reactions which result in the 
release of protons on the extracytoplasmic side. The most significant reaction at pH 2-0 
is: 


Fe 3 + + H 2 O Fe(OH ) 3 + 3H + . (10) 

The cell membrane associated electron transporters involved in the iron oxidation 
include iron oxidase, cytochrome c and a terminal oxidase cytochrome a. The soluble 
electron transport component include cytochrome c and a blue copper pro' 
rusticyanin. An iron cytochrome c reductase or iron oxidation st 
been reported by several authors (Blaylock & Nason 1963; Yates 
etal 1967; Sugio etal 1981). It is indicated by several observ* 
oxidation takes place outside the cell membrane, that is, in th 
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Figure 3. A schematic repre¬ 
sentation of ferrous oxidation 
by T. ferrooxidans. 


inferred from the fact that iron would be rapidly autooxidised at cytosolic pH values 
(pH 6-5) and that ferric iron formed at this pH would be insoluble. 


4. Mechanisms of mineral oxidation 

Mechanisms involved in biooxidation of minerals are very complex. When the 
organisms interact with mineral substrates in an acid medium, a number of reactions 
occur. The microbe-mineral interactions influence the rate of mineral dissolution by 
various chemical, biochemical and electrochemical factors. The main leaching mechan¬ 
isms involved are indirect, direct and electrochemical oxidation. 

4.1 Indirect leaching 

In this process, the bacteria do not attach to the mineral surfaces. Indirect leaching 
involves biological regeneration of ferric sulphate. 

2FeS0 4 + j0 2 + H 2 S0 4 -+ Fe 2 (SO J 3 4- H 2 0. 


( 11 ) 
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Ferric sulphate produced, is a strong oxidant and is capable of dissolving a wide 
variety of sulphide minerals. For example, 

CuFeS 2 + 2Fe 2 (S0 4 ) 3 - CuS0 4 4- 5FeS0 4 + 2S°. (12) 

(Chalcopyrite) 

The elemental sulphur formed in the reaction can be converted to sulphuric acid by 
T. ferrooxidans. 

2S° + 30 2 + 2H 2 0 -> 2H 2 S0 4 . (13) 

The sulphuric acid generated, not only maintains the pH at levels favourable to the 
bacteria but also leaches a variety of copper oxide minerals like azurite 
[Cu 3 (0H) 2 (C0 3 ) 2 ], chrysolla (CuSi0 3 -2H 2 0) and tenorite (CuO) very effectively. 

4.2 Direct leaching 

T. ferrooxidans also leaches mineral sulphides by attachment to mineral surfaces. This 
process can be described by the following reaction, 

MS + 20 2 -* MS0 4 , (14) 

where M is a bivalent metal. Incubation of T. ferrooxidans with the mineral substrates 
results in oxygen consumption and metal solubilization. 

Since iron sulphides are invariably present in the natural leaching environments, 
probably both direct and indirect leaching mechanisms occur simultaneously. 
Bacterial attachment was found to be governed by particle size of the mineral, period of 
incubation and by stationary or agitation conditions used during incubation (Na- 
tarajan 1992). Some of the researchers attribute enhanced mineral dissolution to the 
direct mechanism, while others suggest that bacterial assistance in the enhancement of 
mineral dissolution is due to constant supply of ferric ions. The attachment of T. 
ferrooxidans to a number of sulphide minerals has been studied quantitatively by 
estimating its protein and nitrogen concentrations. 

Attachment of T. ferrooxidans on to pyrite mineral was observed to be influenced by 
reducing particle size and increasing incubation periods. It has been also observed that 
increased bacterial attachment to pyrite mineral surfaces promoted iron dissolution 
(Murthy & Natarajan 1992). 

4.3 Electrochemical leaching 

Principles of electrochemistry are applicable to mineral dissolution in leaching pro¬ 
cesses (Natarajan 1990). Many natural minerals behave as electrodes in the presence of 
a leaching medium. Thus in leaching systems containing more than one mineral, 
galvanic interactions essentially come into play. When two sulphide minerals establish 
contact in a leaching medium, a galvanic cell will be formed; the more active mineral in 
the couple will undergo corrosion while the nobler (less active) one will be cathodically 
protected. Considering the example of a chaicopyrite-pyrite couple, chalcopyrite has 
the lower electrical rest potential and becomes the anode while pyrite behaves as 
a cathode. As a result, the chalcopyrite phase undergoes rapid dissolution while the 
pyrite phase remains essentially unaffected. T. ferrooxidans further catalyses the above 
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electrochemical reactions by continuously oxidising the film of elemental sulphur 
formed on mineral surfaces which would otherwise form a physical hindrance to 
diffusion of copper and iron salts away from the reacting phase. Prediction of the 
electrochemical behaviour of sulphide minerals in multiple combinations is more 
difficult. 

Typical reactions illustrating galvanic interaction mechanisms are given below: 


ZnS 


T. ferrooxidans 

—— -> Zn 

(anodic dissolution of sphalerite) 


4 - S° + 2e ~. 


(15) 


CuFeS 2 


T. /errooxidans 


(anodic dissolution of chalcopyrite) 


Cu 2 + + Fe 2+ + S° +4e . 


(16) 


Fe 2 + 


T. f errooxidans ^ , 

-> Fe 3 +e . 

(bacterial oxidation of dissloved ferrous iron) 


(17) 


T. f errooxidans 

s° —---> so 2 , 

(bacterial oxidation of elemental sulphur) 


(18) 


0 2 + 4H + +4<T 


(Cathodic reduction of oxygen 


on the nobler mineral substrate) 


2H 2 0, 


(19) 


Thus, the galvanic dissolution of an active mineral (chalcopyrite, sphalerite etc.) 
when contacted with a nobler mineral like pyrite, in the presence of T. ferrooxidans, is 
increased several fold (Natarajan 1988,1992; Jyothi et al 1989) 


5. Current engineering applications 

Microbial leaching is rapidly becoming an important process tool for the treatment of 
especially low and lean grade ore reserves. The role of microorganisms in the leaching 
of a number of minerals has long been established. The microorganisms, in most of the 
cases occur indigenously to the various mining operations. From a metallurgical point 
of view, as the grade of the ore becomes lower and lower, the technology for metal 
extraction becomes more and more difficult and expensive. The mineral industries are 
seeking development of modified extraction processes and lowering of costs. 
Dump/heap leaching, in situ solution mining and bioreactor leaching with the help of 
T. ferrooxidans have been attractive innovations in this direction. 

5.1 Copper 

On a commercial scale, dump/heap leaching operations at the mine site for lean and 
waste mined copper ores are being employed in many parts of the world. Biological 
copper leaching is practised in many countries such as USA, Russia, Chile, Peru, 
Australia, Spain, Canada and Mexico. 

Engineering layouts for leaching operations all over the world are remarkably 
uniform. Essentially, copper ore mined from open pits is segregated. Higher grade 
material is concentrated to produce a feed for smelting, while the lower-grade ore is 
subjected to leaching. The ore is piled on until a dump/heap of suitable dimension 
accumulates, the top of the dump is levelled and then the leach solution is flooded or 
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sprayed onto the dump. Bacterial colonization occurs mainly in the top layers. The 
temperature generally rises to 80°C-90°C in the central regions of the dump due to 
exothermic reactions. In this case indirect leaching by ferric sulphate prevails. Leach 
solutions enriched with copper exit at the base of the dump and are conveyed to 
a central recovery facility. The barren solution is then recycled to the leach dumps. 

In situ leaching is yet another popular technique in this direction. It facilitates 
extraction of deeper low value mineral reserves without the need for mining and 
expensive operations. It involves breaking the ore body with powerful explosives to 
make the deposit penetrable to solution. Leach solution and air are then injected under 
pressure. The resulting copper-enriched solutions are then recovered below the ore 
body. Copper is recovered by conventional methods. In situ mining eliminates environ¬ 
mental pollution problems associated with large scale mining (Ehrlich 1963; Duncan 
et al 1964; Brierley & Lockwood 1977). 

5.2 Uranium 

Uranium leaching using bacteria has been implemented since the 1960s in Canada. The 
process involves intermittent spraying or flooding of worked-out stopes and tunnels of 
underground mines with water or dilute sulphuric acid solution at regular intervals. 
The activity of T. ferrooxidans is limited to oxidation of pyrite and ferrous ions because 
it does not directly interact with uranium minerals. Uranium leaching proceeds 
according to the following reactions: 

U0 2 + Fe 2 (S0 4 ) 3 + 2H 2 S0 4 -»• U0 2 (S0 4 ) 3 " + 2FeS0 4 + 4H + , (20) 

U0 3 + 3H 2 S0 4 -► U0 2 (S0 4 ) 3 ~ + H 2 0 + 4H + . (21) 

Uranium bearing solution drains to the lower portions of the mine and accumulates 
in sumps. The solution is then pumped to the surface for uranium recovery (Harrison 
et al 1966; Tuovinen & Kelly 1974; Ferroni et al 1986; McCready 1988). 

5.3 Coal desulphurization 

Although bioprocessing has not been applied to coal on an industrial scale, a 
number of possible applications have been examined in the laboratory and on pilot 
scales. 

A preliminary economic analysis has shown that microbial desulphurization com¬ 
pares favourably with alternative chemical coal-cleaning methods. The parameters 
governing rates of pyrite dissolution in coal are particle size, pulp density, solution pH 
and temperature, inoculum size and characteristics of the leaching bacteria. In some 
instances, leaching rates are enhanced by supplemental nutrients and carbon dioxide. 
Fine coal slurries could be subjected to oxidation by T. ferrooxidans at a pH of 2-0-2-5 
and temperatures in the range of 25°C-35°C. 

The use of T. ferrooxidans in coal desulphurization involves surface modification of 
the pyrite. It has been observed that pyrite is more easily separated from coal during 
froth flotation following an initial exposure to these bacteria, because pyrite is rendered 
more hydrophilic by selective attachment of the bacteria; flotability increases with 
increase of hydrophobic character. Coal cleaning by this method would avoid the long 
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retention period required for microbial dissolution of pyrite (Dugan & Apel 1978; 
Kargi 1982; Monticella & Finnerty 1985; Huber et al 1986). 

5.4 Gold 

The most important application of biotechnology to gold recovery is the treatment of 
refractory ores and concentrates to increase gold dissolution during cyanide leaching 
(Natarajan 1993). In many such ores, finely divided gold is encapsulated in pyrite or 
arsenopyrite minerals and thus cannot be dissolved by the cyanide solutions unless the 
ore is very finely ground. An alternative to grinding is to use bacteria to selectively 
dissolve enough of the sulphides to expose the gold, and then to leach with cyanide by 
the normal process. The use of bacteria keeps the cost of sulphide mineral dissolution at 
a reasonable level, and since it is only necessary to dissolve a fraction of the pyrite to 
expose the gold, the leaching time can be kept to a few days. 

Large scale application of bioleaching of refractory gold ores and concentrates 
before cyanidation has been demonstrated at the Fairview mine in South Africa in 
1986. This industrial plant treats 18t/day of high grade gold concentrates. The San 
Bento plant in Brazil was commissioned in 1990 to treat 300t/day of low grade 
concentrate. A large number of commercial biotreatment plants have since been 
commissioned for processing of refractory gold concentrates in Australia and Ghana 
(Livesey-Goldblatt et al 1977, 1983; Griffin & Luinstra 1989; Hackl et al 1989; Morin 
& Ollivier 1989; Lawrence 1990; Torma Oolman 1992). 


6. Methods for accelerating growth of T. ferrooxidans for faster biooxidation 

T. ferrooxidans is a very slow-growing organism. Its generation time varies between 16 
and 20 h. Therefore obtaining a large biomass for industrial purposes using the classical 
methods is very time-consuming and complicated. Under normal growth conditions in 
9 K medium, it takes upto 30-40 h to obtain a cell mass of about 10 8 cells/ml. Hence 
there is a quest for developing methods which would produce large amounts of biomass 
in a shorter time, with a view to enhance leaching rates. Some methods developed for 
the enhancement of the biomass of T. ferrooxidans are illustrated below. 

6.1 Bacterial film oxidation (B AC FOX process) 

This method is proposed to raise the number of bacteria in the medium during heap 
and underground leaching. The method comprises the following procedures. 

Cells of T. ferrooxidans are placed in the form of a film on a corrugated surface of 
bioactive material. The corrugated material coated with the film of bacteria is 
submerged in ferrous sulphate solution saturated with air. Oxidation of Fe 2+ takes 
place. Bacteria bind with the precipitated jarosite; the film containing bacteria and 
jarosite may be grown on various materials like glass, plastic etc. Livesey-Goldblatt 
et al (1977) reported that the best results are obtained in the media where bacterial film 
was fixed on corrugated plastic. Maximum specific rate of ferrous to ferric oxidation 
was observed to be 7-5 g/h/m 2 of bacterial film. A schematic representation of the 
BACFOX process is depicted in figure 4. 
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Figure 4. Bacterial film oxida¬ 
tion (BACFOX). 


6.2 Biomass production by using electrical energy 

Another way to increase the cell concentration in the culture and at the same time to 
overcome all the above mentioned drawbacks of the classical methods is offered by the 
method proposed by Kinsel & Umbreit (1964). Several researchers have proposed 
similar methods (Kovrov et al 1978; Denisov et al 1980; Yunker & Radovich 1985; 
Natarajan 1992b). The main feature of this method is combining two processes, namely, 
the microbiological oxidation of Fe 2 + and electrochemical reduction of Fe 3 + to Fe 2 + . 
The method comprises electrochemical reduction of bacterially generated ferric ions 



Figure 5. A schematic representation of biomass production using applied DC 
potentials. 
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Figure 6. Experimental set-up of electrochemical cell. 


directly in the culture. Figure 5 shows a scheme of reactions taking place in an 
electrochemical cell that provides the energy source for T. ferrooxidans. The energy for 
cell growth comes from the electrochemical cell's electrodes, while the energy providing 
substrate (ferrous ion), serves merely as a carrier of energy from the cathode to the 
bacterial cell. The Fe 2 + is not supplied from outside but is formed in the culture on the 
anode surface. Figure 6 shows a typical laboratory experimental set-up of electro¬ 
chemical instrumentation. 

The electrochemical reduction of ferric ion is coupled with continuous cultivation of 
T. ferrooxidans. The concept involves the correlation of two mutually opposing 
reactions, namely: the bacterial oxidation of Fe 2 + and the electrochemical reduction of 
Fe 3+ . Most useful for practical application is the method of maintaining a constant 
cathode potential. The optimal bacterial biomass concentration during continuous 
cultivation ranges from 5 to 10g/l (dry weight). 

Application of negative DC potentials in the range of — 500 mV to — 1000 mV to 
a bacterial culture containing ferric ion, effectively converts ferric ion to the ferrous 
state, promoting bacterial activity and growth. Under a negative applied potential of 

— 500 mV the cell number of T. ferrooxidans increased from 2-5 x 10 7 cells/ml to about 
1-3 x 10 8 cells/ml within a period of about 60 h (Natarajan 1992b). Subsequent 
impression of - 500 mV to a fully grown culture containing 1-3 x 10 8 cells/ml promo¬ 
ted significant growth of the bacteria (Natarajan 1992). It has been reported that the 
generation time of T. ferrooxidans can be reduced from 16 h to 10 h with a ten-fold 
increase in biomass when an applied potential of — 500 mV is maintained (figure 7). 
Electrobioleaching also enables faster and selective dissolution of the desired mineral 
phase from complex sulphides, unlike the case with only bioleaching, as depicted in 
figure 8. Zinc dissolution from binary mixtures containing either pyrite or chalcopyrite 
in addition to sphalerite was observed to be the highest at an applied potential of 

— 500 mV in the presence of T. ferrooxidans (Natarajan 1992b). 
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Figure 7. Variation in number 
of cells per ml with time in the 
growth of T. ferrooxidans. 


6.3 Oxidation in packet-bed reactors 


Bacterial biomass can also be enhanced by using T. ferrooxidans in a fixed-film 
bioreactor containing glass beads, activated carbon particles or ion-exchange resin 
(Grishin & Tuovinen 1988). Both batch and continuous flow modes of operation have 
been studied for the biological oxidation of ferrous sulphate to ferric sulphate. Various 
packed-bed reactors have been designed to improve the rate of iron oxidation. Three 



Figure 8. Zinc dissolution 
from binary mixtures contain¬ 
ing pyrite or chalcopyrite with 
sphalerite under applied poten¬ 
tial. 
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parameters were taken into consideration; namely, providing a larger surface area for 
attachment of T. ferrooxidans to the support matrices, to reduce loss of biomass and 
the use of a low pH (between 1-3 to 1-5) to provide more extensive adsorption of the 
bacteria and also to eliminate ferric precipitation. 

The activated-carbon packed-bed reactor had the highest level of biomass, amount¬ 
ing to 32 mg of protein per ml of interstitial liquid or 128 micro g/cm 2 of matrix surface. 
The ability of the glass beads to retain biomass was about 50 times less. 

6.4 Growth in solid medium 

Growing T. ferrooxidans on a solid medium so far has been a very time-consuming and 
tedious process; the solidifying agent, agar-agar, neither forms a stable gel when 
sterilised at low pH values nor properly solidifies due to hydrolysis. To overcome these 
problems, a new medium has been designed by Khalid et al 1993. They have used 
bacterial polysaccharide, ‘Gellan Gum’ or ‘Gelrite’ as solidifying agent for acidophilic 
thiobacilli. This medium has been devised for estimating the growth of T. ferrooxidans 
on the solid substrate. The plating efficiency was determined to be 92-1 ± 5-8%. Dark 
brown, circular, well-differentiated colonies developed within 72-96 h. The cell count 
was approximately 7 x 10 6 cells/ml for ATCC 13661 strain of T. ferrooxidans. Dry 
biomass of 650 mg/1 can be obtained by this method which corresponds to 320 mg total 
protein/1. A linear relationship has been found which gave a value where 0-5 g dry 
mass/l was equivalent to 2-8 x 10 6 cells/ml of the ATCC 13661. This medium can be 
employed to purify cultures of T. ferrooxidans strains which have frequently been 
reported to have been associated with other acidophiles. The high plating efficiency 
also indicates that this method can be used for selection of mutants resistant to toxic 
metals and also for genetic investigation of T. ferrooxidans. 

6.5 Genetic manipulation for development of special strains 

The necessity to develop special strains of T. ferrooxidans which possess both high 
metal and temperature tolerance as well as better leaching capability becomes very 
evident because accumulation of toxic metal ions such as arsenic, copper and zinc 
hinders bacterial activity. Development of metal-tolerant, preadapted bacterial strains 
is thus necessary to obtain efficient bacterial activity during leaching. Adapted strains 
are found to be more efficient in gold liberation compared to the unadapted wild 
strains. One of the principal objectives of genetic manipulation is to develop techniques 
to introduce the desired characteristics into the leaching bacteria. Attempts have 
already been made to detect the plasmid DNA in T. ferrooxidans. Methods of plasmid 
isolation and identification from T. ferrooxidans have been worked out. The possibility 
of cloning them into Escherichia coli has also been evaluated. However, the develop¬ 
ment of genetically engineered T. ferrooxidans is still far-fetched (Groudeva et al 1980; 
Cox & Boxer 1986; Rawlings et al 1986; Yates & Holmes 1987;Holmes et al 1988; Yates 
etal 1988; Holmes & Yates 1990). 

Much progress has been made in understanding the pathways of ferrous and sulphur 
oxidation, energy coupling, and carbon assimilation. However, physiological descrip¬ 
tions do not at present appear to be leading toward a coherent taxonomic analysis of 
the thiobacilli. 



Thiobacillus ferrooxidans for engineering applications 


865 


The ribosomal ribonucleic acid (rRNA) of T. ferrooxidans is independent of 
growth conditions and metabolic status. So they are useful for comparing even 
physiologically disparate species. 5S rRNA is sufficiently conserved in all species. 
Hence sequence homologies are evident even between organisms that have 
immeasurably low overall deoxyribonucleic acid (DNA) homology. Mixed or con¬ 
taminated cultures are readily evident by detection of more than one type of 5S rRNA. 
The rRNA sequence, therefore, provides incisive criteria for the classification of new 
strains. It is understood that phylogenetic framework will offer a more directed 
approach toward the comparative biochemistry of these organisms. Since there is 
presently a great deal of interest in ‘engineering’ these economically important bacteria, 
knowledge of their natural evolutionary relationships should be of use in constructing 
fruitful recombinants. 

Thus, DNA and rRNA analyses are being used to classify strains into different 
genomic groups within the species. This will provide useful selection of genotypes and 
phenotypes best suited for particular application. Twenty-three strains of T. fer¬ 
rooxidans were examined by Harrison (1982), which were genomically diverse. 

A strategy for obtaining recombinants by genetic engineering is presented here. 

(i) The chromosomal and plasmid DNA from T. ferrooxidans are isolated by standard 
procedures. 

(ii) Both the chromosomal and plasmid DNA are subjected to restriction digestion 
with the help of restriction enzymes to cut the DNA into fragments. 

(iii) With advanced molecular biology techniques the gene of interest is selected and 
isolated. (Explanation of the procedures is beyond the scope of this review.) 

(iv) Once the gene of interest is isolated, it is incorporated into an Escherichia coli 
plasmid. This plasmid is a small circular DNA. 

(v) The circular plasmid DNA is cut with a restriction enzyme to form a linear strand. 

(vi) The gene of interest is then added to the linear strand of DNA. The gene then gets 
attached to the linear plasmid to form a recombinant molecule comprising part of the 
plasmid DNA, hence the term recombinant DNA. 

(vii) The recombinant DNA molecule is then sealed to reform a circle with an enzyme 
called ligase. 

(viii) The recombinant plasmid is then put back into E. coli. This process is called 
transformation. 

(ix) The E. coli cell is then allowed to divide many times in the culture media, producing 
a large number of colonies, each identical to the starting bacterium. Since the 
recombinant plasmid is also present in all the progeny bacteria, the inserted gene of 
interest is said to have been cloned. 

(x) After cloning, the recombinant plasmids are then isolated from the host E. coli, 
giving amplified copies of a single gene of interest. The characteristics of this gene are 
then expressed. 

A schematic representation of this method is depicted in figure 9. 

7. Summary 

Biological processing of minerals and metals is a rapidly growing, diverse field whose 
importance will become considerable in the near future. The use of genetic engineering 
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Figure 9. A strategy for obtaining recombinants by genetic engineering. 


techniques will allow organisms to be precisely tailored to their applications and thus 
improve their effectiveness. 

Futher development of bioleaching technology is aimed at extending the range of 
metals and ores that may be processed. New organisms, especially thermophilic 
bacteria, may accelerate the rates of metal extraction. There is a need for more 
fundamental research to determine the factors which limit the rate of microbial ore 
leaching, the characteristics of the ore bioleaching environment, the interactions of 
microorganisms and their effect on ore leaching. 
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Abstract. The three-dimensional flow field around a circular cylinder in 
a quasi-equilibrium scour hole under a clear water regime has been 
investigated experimentally. The distribution patterns of tangential, radial 
and vertical components of velocity upstream and longitudinal component 
of velocity downstream of a cylinder have been presented in this paper. The 
flow simulation results show the existence of a helical motion of fluid 
around the cylinder. 

Keywords. Three-dimensional flow; hydrodynamics; vortex flow; scour¬ 
ing; flow around cylinder; velocity profiles. 


1. Introduction 

A cylinder, embedded vertically in a loose bed, causes obstruction to form a three- 
dimensional separation of boundary layer flow and is conducive to the development of a 
localized scour. As a result of the interaction between the two-dimensional approaching 
boundary layer flow and the adverse pressure gradient set by the cylinder, the problem 
becomes complex. Detailed study of the general flow pattern for such a case has been 
reported by Hjorth (1975), Melville (1975), and Dargahi (1987). The components of the 
flow field are the downflow, horseshoe vortex, wake vortex and bow wave, as shown 
schematically in figure 1. The approaching flow gradually becomes zero along the 
vertical line in the front face of the cylinder. Since the profile of the approaching velocity 
along the vertical decreases downward, starting with a maximum at the free surface and 
becoming zero at the bed, the stagnation pressure gradient is negative downward. Thus, 
the flow is driven down in the close front of the cylinder. The horseshoe vortex, developed 
due to the separation of flow at the edge of the scour hole upstream, rolls to form a large 
helical flow which migrates along the sides of the cylinder downstream (Raudki vi 1986). 
A bow wave is formed at the upstream free surface adjacent to the cylinder, rotating in 
the opposite direction of the horseshoe vortex. The stagnation pressure also accelerates 
the flow past the cylinder, resulting in flow separation at the sides of the cylinder and 
creating the wake with the cast off vortices at the interfaces to the main flow. Enhanced 
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velocities in front of and to the sides of a cylinder scour the loose bed and carve out 
U-shaped scour hole with open ends of the U directing downstream. 

Depending on the supply of sediment by the approaching flow, localized scour 
occurs in two ways. Clear water scour refers to the situation where no sediment is 
supplied by the approaching flow into the scour zone. Live bed scour, in contrast, 
occurs where there is a general transport of sediment by the approaching flow to the 
zones subjected to scour. Summary pertaining to the work on local scouring has been 
comprehensively reviewed by Breusers et al (1977) and Dargahi (1982). Although the 
characteristic features of the general flow structure around a circular cylinder in the 
scour zone have been documented, the nature of the distribution patterns of the 
velocity components seems to have received little attention so far. 

The aim of the present experimental investigation is to determine the behavioural 
patterns of the distribution of velocity components of the three-dimensional vortex 
flow field around a circular cylinder in a quasi-equilibrium scour hole under a clear 
water regime. The experimental data are utilized to obtain the flow simulation results. 

2. Experimentation 

2.1 Experimental set-up and programme 

The experiments were conducted in a 1-3 m long, 081 m wide and 0-25 m deep test 
section located 6 m away from the inlet of a 10 m long flume. A false floor was 
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constructed along the length of the flume 0-25 m above the bottom, and uniformly 
graded fine sand of mean diameter (d 5Q ) 0-58mm used for local scouring was glued 
along its surface. Geometric standard deviation (cr g ) of the particle size distribution 
given by {d 84 /d l6 ) liz was 1-31 which was less than 1-4 for a uniformly graded sand 
(Dargahi 1987). A perspex circular cylinder of diameter ( b ) 0 076 m was placed at the 
middle of the test section. Experiments were run under clear water scour at an 
approaching flow depth ( h ) of 0-05 m and a mean approaching velocity {U) of 0-261 m/s. 
Although a ratio h/b up to 3 was found to be desirable for severe scouring, in this study 
h/b was set to 0-66 to have the controlled flow in the flume. The flow field, to be 
measured, would however remain similar for higher values of h/b. The semi-logarithmic 
plot of the velocity at 2 m upstream from the cylinder centre, found to be free flow, was 
used to determine the shear velocity of approaching flow (u*). The magnitude of 
u* found to be 0-0169 m/s was less than the critical bed shear velocity u* c for the 
initiation of particle motion i.e. 0-017 m/s. The experiments were started, setting the 
inflow rate by the inlet valve and maintaining the required flow depth by the 
downstream control. Run time for a period of 12 h was taken to ensure that the 
quasi-equilibrium state (discussed in the succeeding section) was reached. After 12 h, 
the inflow rate was slowly reduced to zero and the water was carefully drained out from 
the scour hole. 

2.2 Bed stabilization 

A synthetic resin mixed with water (1:3 by volume) was sprayed on the scour bed when 
it was reasonably dry. The sand was sufficiently impregnated with the resin when it was 
left to set for 12 h. The resin was polymerized by spraying a coat of dilute formic acid. 
After 48 h, the bed profile became rock-hard, facilitating measurements. 

2.3 Measuring techniques 

The three components of velocity vectors were measured using a five-hole pitot sphere 
(Shukry 1950) which had a spherical sensing head of 8 mm diameter and five holes lay 
on two meridians perpendicular to each other. The appropriate calibration curves 
enabled the pitch angle and the velocity magnitude to be determined. The direction in 
the horizontal plane of velocity vector was determined by a protractor attached to the 
probe, equalizing pressures at the two pressure points opposed in plan. In the regions of 
negligible vertical velocity (at 75° upstream and in the downstream), a three-tube yaw 
probe (Rajaratnam & Muralidhar 1967) was used to measure the horizontal velocity 
vectors. This probe was made of three brush tubes of 3 mm OD and 1-8 mm ID, rigidly 
fixed together. The centre tube had a flat nose, while the side tubes were chamfered at an 
angle 45°. The downflow in the front face of the cylinder was measured using a specially 
made pitot tube because pitot sphere was unsuitable there. As there was no flow across 
the downflow i.e. the pressure gradient normal to the flow direction is zero, this probe 
was found to be useful. It had a vertical static tube with radial holes at the end and 
a stagnation tube facing the downflow, with 2 mm OD and 1mm ID each. All the 
probes were connected to the precision manometers (inverted type) which had air at the 
top. The manometer reading up to 0-1 mm accuracy was measured by a Vernier 
attachment. The minimum velocity vector was registered to be 0-04 m/s. The flow 
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direction and magnitude within the scour hole changed extensively making the 
measurement difficult. A light aluminium fin, known as direction finder, was attached 
to the end of a steel rod of 1 mm diameter. The fin could rotate freely about its axis to 
show the flow direction approximately when it was dipped in the flow. It was estimated, 
however, that the mean velocity directions were within + 10°. The most difficult part in 
the flow measurement was to ascertain the reversed flow which was successfully carried 
out by the fin. To get the exact flow direction, the pitot sphere/yaw probe was placed at 
the appropriate position to face the flow and was rotated along a horizontal plane until 
it registered the same pressure in the side holes. Also, the points of detachment of the 
horseshoe vortex on both sides of the cylinder determined by the fin were at ± 75°. The 
flow field in the wake region was ill-defined and was beyond the realm of the study. 

2.4 Measurement errors and corrections 

The errors for pitot tube (Nwachukwu 1979) which may possibly occur in the 
measurements are: (i) the error in presence of a pitot tube near to the bed which results 
in higher impact of pressure record than the actual one due to shifting of streamlines; 
and (ii) the error due to the turbulent fluctuation. The error in the measured velocity 
near bed, based on l/7th power law of velocity distribution, was found to be 7% 
(Tingsanchali & Maheswaran 1990). Since no attempt has been made to quantify this 
type of instrumental error for a five-hole pitot sphere and three-tube yaw probe, it was 
assumed that these were subjected to the same error magnitude as that of the pitot tube. 
The error in the mean velocity due to turbulence was estimated from the relationship 
given by Goldstein (1983). Based on the turbulent intensity for equilibrium scour 
measured by Melville & Raudkivi (1977), the local velocity was found to be overes¬ 
timated by 2% (Dey 1991). So no correction was applied to suppress it. All the 
measurements were repeated with good accuracy until the two separate measurements 
at a point were registered to be equal. The errors of the measurements carried out in the 
reversed flow regions were usually higher than in other regions of the flow, which were 
taken with some reservation. 

The approaching flow crossed the upper portion of the pitot sphere and the reversed 
flow near bed was affected. However, the correction factor, determined using a micro¬ 
pitot tube, was used for the data of velocity measurements. The velocities measured by 
a micro-pitot tube along the sloping bed, providing a free passage of approaching flow, 
were compared with those measured by the pitot sphere. The correction factors 
estimated at 0°, 15° and 30° were IT8, 1 11 and 1-05 respectively. 


3. Quasi-equilibrium scour 

The process of digging by the downflow at the cylinder base and the erosion of slant bed 
continued until a quasi-equilibrium state (Dey et al 1992b) was reached. This happens 
when the particles can no longer be dislodged by the impact of the downflow. In this 
state, the slant bed eroded continually in the form of a few rolling particles which were 
subsequently carried out by the arms of the horseshoe vortex, without any noticeable 
change in scour depth. Full equilibrium was reached, when the rolling motion was 
ceased, after an impractically long experimental time. The average angle of the 
upstream sloping bed, in the quasi-equilibrium state, was about 15% greater than the 
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the flat base adjacent to the cylinder. The edge of the hole was found to be a circular arc 
up to the point of detachment of the horseshoe vortex from the cylinder at 
9 = ± \j /(« ± 75°). The width of zone 2 can be expressed as <5 = e(R — 05 b), where e = a 
geometric factor, and was found to be 0T (Dey 1991); R = radius of the scour hole i.e. 
[d/(l — e)] cot®-j-05b; d — depth of scour; and 0 = dynamic angle of repose. In 
cylindrical polar coordinates, the quasi-steady velocities in ( 9 , r, z) are represented by 

In the downstream, the measurement was confined to zone 1, because in zone 2 the 
flow separation took place to form the wake. L, the linear distance from O, to 
P (figure 2), can be approximated by (R — 0-5b)(l — g)tani//. The bed was further 
washed out at x ^ XL to attain another flatter slope due to an increase in longitudinal 
velocity with the distance. The factor A was found to be 08-09. The quasi-steady 
velocities in (x,y, z) are denoted by ( u d , v d , w d ). 


r/R 

0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 1.0 



Figure 3. Vertical distributions of u u in the azimuthal planes at (a) 15°, (b) 30°, (c) 
45°, (d) 60° and (e) 75° upstream; and u d in the transverse plane at (f) 0-2L 
downstream. 
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5. Results 

The velocity measurements were undertaken in the azimuthal planes at 0°, 15°, 30°, 45°, 
60° and 75° upstream, and in the transverse plane at 0-2 Ldownstream. At the upstream 
edge of the scour hole, the vertical profile of the approaching velocity, reduced 
considerably due to a larger exposure of flow area ahead, was similar to the profile of 
the free approaching velocity. The mean velocity at the edge of the hole estimated, 
comparing the velocity profile at r = R with the profile of the free approaching velocity 
at 2 m upstream of the cylinder centre, was 0-8 U. 

The vertical distributions of normalized tangential component uJU and longitudi¬ 
nal component u d /U are shown in figure 3. Here, u u and u d show the characteristic 
features of the passage of the main flow, as a skewed velocity profile, around the 
cylinder. The tangential component u u was too small for experimental detection at 15°; 
and also full detection of u u was not possible at 30°. The distribution of u u is similar to 
the turbulent flow over a flat surface and can, therefore, be represented by a power law 
with no-slip at the bed. In zone 2, the passage of downflow by the side of the cylinder 
enhances u u substantially. The radial distribution of u u is not shown here. However, 
from a close examination of the vertical distribution of u u it is revealed that there is 
a considerable skewness in the distribution pattern. This is believed to be due to the 
pronounced effect of the horseshoe vortex, which is a helical motion like flow, around 
the cylinder. While fitting the velocity profiles (as a function of normal distance from 
the bed) by a power law with an exponent l/n, it was found that the ranges of n were 
3 ^ n ^ 6 for meandering channels (Odgaard 1989) and 7 ^ n ^ 10 for straight chan¬ 
nels. In the present case, n is found to be approximately 3-8. Hence, the value of 
n decreases with increase in circulatory motion of vortex flow. Consequently, the 
skewness of the velocity distribution is more in both radial and vertical directions with 
the strong circulation in the azimuthal planes. The magnitude of u u is found to be 
increased with increase in 9, starting with zero at 0° and becoming a maximum at 75°. 
In fact, the horseshoe vortex, which originates at 0°, is drifted by the tangential 
component u u by the side of the cylinder. In the downstream, the longitudinal 
component u d is similar, in nature, to its preceding upstream part u u with increased 
magnitude. 

The vertical distributions of normalized radial component v u /U are shown in 
figure 4. The flow separation at the edge of the scour hole produces a reversal nature of 
v u in the hole. Thus, the radial component v u changes sign, in the scour hole, on either 
side of the locus of v u = 0. This may be compared with the radial velocity in meandering 
channels (Odgaard 1989). The locus may be assumed to be a median of the triangular 
section in zone 1. The vertical distribution of v u varies linearly in the hole, starting with 
zero at the mid-depth of the hole and becoming a maximum (positive) locally at the bed 
due to the reversed flow along the slope and a maximum (negative) at z = 0, with same 
magnitude. The variation of v u along z-axis is exponential above the hole, due to the 
effect of the approaching flow, starting from the profile at z = 0 and becoming an 
overall maximum at the free surface. Although the radial distribution of v u is not shown, 
a close examination of the vertical distribution reveals a parabolic variation of v u along 
r-axis, starting with zero at the cylinder boundary and becoming a maximum (positive) 
at the bed following a negative v u on other side of the locus of v u = 0 in the hole and 
a maximum (negative) at r = R above the hole. The decreasing nature of v u with increase 
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Figure 4. Vertical distributions of v u in the azimuthal planes at (a) 0 , (b) 15 , (c) 
30 , (d) 45 , (e) 60 and (f) 75 upstream. 


The radial distributions of normalized vertical component w u /U are shown in figure 
5. At the edge of the scour hole, the flow separation is also evident from the reversal 
nature of w u in the hole. The variation of w u is parabolic in the hole, starting with 
a positive value due to the reversed flow along the sloping bed and becoming 
a maximum (negative) at the cylinder boundary along r-axis and also increasing 
towards negative value along z-axis (revealed from the radial distribution), with 
opposite signs and a higher magnitude due to the downflow in zone 2. Locus of w u — 0 is 
a curved fine meeting at the extreme ends of the sloping bed of the hole. w u changes sign 
on either side of the locus. At 75^, w u was very feeble for detection. However, there lies 
a large helical motion of flow within the scour hole. 

In the downstream region, due to the attenuating nature of swirl flow, v d and w d were 
too small to be detected experimentally. The swirl flow loses its identity after travelling 
a distance of 5 to 6 cylinder diameters. However, the longitudinal velocity component is 
dominating. 

Based on the three-dimensional flow simulation developed by Dey ex al (1992a), the 
flow field around the cylinder in the quasi-equilibrium scour hole was visualized by the 
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Figure 5. Radial distributions of w u in the azimuthal planes at (a) 0°, (b) 15°, (c) 
30 Q , (d) 45°, (e) 60° and (f) 75° upstream. 


EGA colour monitor using the measured data. Figure 6 exhibits the velocity vectors in 
the azimuthal planes. The characteristics of circulatory flow in the scour hole together 
with the strong downflow along the upstream face of the cylinder are depicted. The 
circulation is strong along the upstream face and decreases with increase in 6. In z > 0, 
the flow is horizontal for 1 > r/R 3*08, and then the lower half of the flow gradually 
curves down and becomes almost vertical adjacent to the cylinder. In contrast, the 
upper half of the flow is slightly directed upward due to the obstruction caused by the 
cylinder. However, the flow velocity at the base of zone 2 is about 02 U in the present 
case. It was measured by Melville (1975) 03 U. The result is quite reasonable, because 
the velocity beneath the horseshoe vortex on flat bed, estimated from the formula given 
by Qadar (1981), v u = 0Q92b~°' 5 U°' 8i , is 0-417 which reduces considerably in the 
quasi-equilibrium state due to the expansion of vortex size. The magnitude of total 
(three-dimensional) velocities around thfe cylinder in the azimuthal planes are exhibited 
in figure 7. In absence of u u at 0°, the total velocity is solely circulatory as already shown 
in figure 6. At 75°, u u dominates and the circulatory flow becomes weak. In other 
azimuthal planes, the evolution of the circulatory flow to the exponential distribution 
of velocity is displayed. The horizontal velocity vectors in the horizontal planes are 
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Figure 6. Velocity vectors in the azimuthal planes. 
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Figure 9. Velocity vectors at 0 
(after Melville & Raudkivi 1977) 
used for comparison purposes. 


depicted in figure 8. The horizontal velocity vector at z = 0-8 h shows the inward 
motion of the flow. At z = 0, the flow resembles the irrotational flow past a circular 
cylinder. The helical motion of flow, in the scour hole, is evident from the flow field at 
z = — 04d. Figure 9, used for the comparison of a part of the present data, shows the 
observations of Melville & Raudkivi (1977) at 0° under equilibrium scour. Their 
observations agree with the flow characteristics at 0° presented in figure 6, which 
establishes the validation of the present measuring technique. 

The physics of the flow is analyzed with the help of the above results. The main 
characteristic features of the flow around a cylinder are a relatively large secondary 
vortex flow and skewed velocity distributions. The approaching flow separates at the 
upstream edge of the scour hole, known to be a line of separation, to form a vortex flow 
in the hole which acts as a zone of separation. The limiting streamlines along the 
original bed upstream (due to the approaching flow) and the sloping bed (due to the 
reversed flow) merge at the edge of the scour hole forming separated streamlines. Thus, 
a surface of separation is produced in the form of an envelope. In this process, the 
approaching flow curves down into the hole and rolls to form a vortex flow which 
migrates downstream by the sides of the cylinder. 


6. Concluding remarks 

The experimental results of the distribution patterns of velocity components for 
a three-dimensional vortex flow around a circular cylinder in a quasi-equilibrium scour 
hole under a clear water regime have been presented. In the upstream, the tangential 
component u u follows a power law, starting with zero at the bed and becoming 
a maximum at the free surface and at the cylinder boundary, with a higher magnitude in 
zone 2. The radial component v u varies linearly in the hole along z-axis, starting with 
zero at the mid-depth of the scour hole and becoming a maximum locally at the bed and 
at z — 0, with opposite signs and same magnitude. v u varies exponentially above the 
hole along z-axis, starting from the profile at z = 0 and becoming an overall maximum 
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at the free surface. Along r-axis, the distribution of v u is parabolic, starting with zero at 
the cylinder boundary and becoming a maximum at the bed in the hole and at r — R 
above the hole. The variation of vertical component vv u is parabolic in the hole, starting 
with a positive value at the bed and becoming a maximum at the cylinder boundary 
along r-axis and at z = 0 along r-axis, with opposite signs and a higher magnitude in 
zone 2. In the downstream, the attenuating nature of swirl flow becomes very feeble and 
the longitudinal component u d becomes gradually stronger than its preceding part u u 
upstream. The flow simulation results show a helical motion of fluid around the cylinder. 


This investigation is a part of the doctoral dissertation work by the author in the 
Department of Civil Engineering, Indian Institute of Technology, Kharagpur under 
the joint supervision of S K Bose and G L N Sastry. The helpful suggestions from 
S C Jain, Institute of Hydraulic Research, University of Iowa, are gratefully acknowl¬ 
edged. The author also appreciates the assistance provided by B Dey during the 
investigation. 
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cylinder diameter; 
quasi-equilibrium depth of scour; 

16, 50, 84% finer particle diameters; 
approaching flow depth; 

longitudinal extension of scour hole downstream; 
exponent; 

radius of scour hole; 

mean approaching velocity; 

velocities in cylindrical polar coordinates upstream; 

velocities in Cartesian coordinates downstream; 

shear velocity; 

critical shear velocity; 

Cartesian coordinates; 
factors; 

width of zone 2; 

angle of repose; 

dynamic angle of repose; 

cylindrical polar coordinates; 

angle of detachment of horseshoe vortex; 

geometric standard deviation. 
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Abstract. The paper describes the multigrid acceleration technique 
to compute numerical solutions of three equations of common fluid 
mechanical interest; Laplace equation, transonic full potential equation 
and Reynolds averaged Navier-Stokes equations. Starting with the simple 
and illustrative multigrid studies on the Laplace equation, the paper 
discusses its application to the cases of full potential equation and the 
Navier-Stokes equations. The paper also discusses some elements of 
multigrid strategies like V- and W-cycles, their relative efficiencies, the effect 
of number of grid levels on the convergence rate and the large CPU time 
saving obtained from the multigrid acceleration. A few computed cases of 
transonic flows past airfoils using the full potential equations and the 
Navier-Stokes equations are presented. A comparison of these results with 
the experimental data shows good agreement of pressure distribution and 
skin friction. With the greatly accelerated multigrid convergence, the full 
potential code typically takes about 10 seconds and the Navier-Stokes 
code for turbulent flows takes about 5 to 15 min of CPU time on the Convex 
3820 computer on a mesh which resolves the flow quantities to good levels 
of accuracy. This low CPU time demand, made possible due to multigrid 
acceleration, on one hand, and the robustness and accuracy on the other, 
offers these codes as designer’s tools for evaluating the characteristics of 
the airfoils. 

Keywords. Multigrid; full-potential solution; Navier-Stokes solution; 
Laplace equation. 


1 Various parts of this paper have been presented at the following conferences; (i) 5 th Asian Cong, 
on Fluid Mech., Taejon, Korea, 1992, (ii) Int. Conf. on Methods of Aerophysical Research, 
Novosibirsk, 1992, (iii) Fluid Dyn. Symp. in honour of Prof. R Narasimha on his 60th birthday, 
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1. Introduction 

Starting with the pioneering work of Federenko (1964), the theory and application of 
multigrid has come of age. The large time saving due to the multigrid acceleration has 
offered great relief to the workers in many areas of science and technology, especially in 
the developing countries where multi-megaflop computing machines are not yet 
common. Even today at many of these institutions, a typical 2-D turbulent Navier- 
Stokes solution on a grid size of, say, 385 x 65, requires 2 to 3 days of turn-around time. 
The multigrid technique has also provided an opportunity or potential to several codes 
to be used as designer’s tool. The combined effect of the high speed computing 
machines and the multigrid accelerated codes has led to such realistic design computa¬ 
tions that a well converged 3-D Navier-Stokes flow computation past a wing-body- 
nacelle configuration can be obtained in as little as 4 min of CPU time (Jameson 1993) 
on some of the high speed computers. Historically, the first publication on multigrid 
method put to practice was brought out by Brandt (1972) followed by another 
publication (Brandt 1977) five years later with convincing results which set the pace for 
rapid progress in multigrid methods and applications (Wesseling 1990). From the same 
historical angle, it may be mentioned that Hackbusch (1976) (as mentioned in Wessel¬ 
ing 1990) also independently discovered the multigrid method. Today, the multigrid 
method has crossed its domain of application to elliptic systems for which basically its 
theory was developed. One can find large scale application of multigrid in solving the 
time dependent Navier-Stokes equations and several other hyperbolic problems. 
During the last few years, multigrid has also made inroads in the areas of hy¬ 
drodynamic stability, integral and integro-differential equations, many-body interac¬ 
tions, semiconductor device simulation, general relativity, theory of material defects 
and atomic physics, real time optimal control, protein folding, combinatorial optimisa¬ 
tion, linear programming etc. (Brandt 1988). 

In CFD, one seeks converged solution to viscous flow problems on a grid which is 
fine enough to resolve all the length scales involved. In most of the cases, the actual 
number of node points thus obtained is quite large leading to a large number of 
algebraic equations to be solved resulting from the discretization. Also, most of the 
relaxation schemes are efficient at smoothing out the high frequency errors as seen on 
the scale of the grid. The solution on a single fine grid is too expensive for two reasons, 
(i) due to the large number of algebraic equations resulting from the discretization 
process, (ii) due to the relaxation schemes taking very long time to smooth out the low 
frequency errors. Quoting a concept from Brandt (1977), the multigrid method uses 
a sequence of grids “taking advantage of the relation between different discretizations 
of the same problem. This method can be viewed in two complementary ways: one is to 
view the coarse grids as the correction grids, accelerating convergence of a relaxation 
scheme on the finest grid by efficiently liquidating smooth error components. Another 
point of view is to regard finer grids as the correction grids by correcting their forcing 
terms”. The first view of Brandt (1977), which is an early and monumental work on the 
multigrid methods, refers to the definition of high frequency components with respect 
to each grid which are smoothed fastest by most relaxation schemes. The second view 
offers the possibility to “manipulate accurate solutions on coarser grids, with only 
infrequent ‘visits’ to pieces of finer levels”. 

The next section gives a brief description of the basic elements of multigrid technique, 
followed in the present work, to get accelerated convergence of (a) solution of Laplace 
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equation on Cartesian grids, (b) solution of transonic potential flow equations, and (c) 
solution of Navier-Stokes equations for 2-D transonic flow. 


2. A brief description of multigrid method 

Let G\G 2h ,...G Mh be a hierarchy of grids with h indicating the finest level. Let the 
equation to be solved be given as 

W>=/ (1) 

The discretized form of (1) on grid level h is written as; 

L h <t> h = f h . ( 2 ) 

If 0 is an approximate solution of $ and 0 is the correction term, then 

+ **) = /*> (3 ) 

or 

L h ((f) h + 0 A ) - L h (j) h = f h - L h (f) h = M h . (4) 

0t h is known as residual on grid G h . A certain number of relaxation sweeps are made 
to obtain 0\ Ideally, when the convergence slows down, the relaxation on the present 
level is stopped and the solution and the residual is injected down to the next coarser 
level 2 h where one can write (4) for this grid as follows; 

L 2 \<f 2 h {fl>" + 0*)) - L 2 h J 2 h h (f) h = J 2h @\ (5) 

where and ,? 2h are injection operators which need not be the same. Equation (5) is 
rewritten as; 

L 2 h (p 2l ' = J 2h @ h + L 2 h J 2 h (j) h 

= J 2 h {f h ) + {L 2 h J 2 h (j) h - J 2h (LV)), 

= S 2 H h (f h ) + f 2 \ ( 6 ) 

where 

• fl 2h = J 2h (<f) h + ij/ h ). (7) 

f 2h is the truncation error of the coarse grid relative to the fine grid. It is seen that 
without the addition of the term f Zh in (6), the discretized coarse grid equation has the 
same form as (2) for fine grid. In fact, when the solution has converged, the coarse grid 
solution will be same as the injected fine grid solution and f 2h will vanish. In that sense, 
the term f 2h , also known as forcing function, is identified as the “fine to coarse grid 
defect correction”. This term can also be seen to represent the interplay between the 
discretization and the solution process. 

This procedure is carried on till the coarsest grid is reached. The corrected coarse 
grid solution thus obtained needs to be transferred to the fine grid to obtain the 
updated fine grid solution. This step is written as; 

= +> 2 ^ 2/, -A 2h A (8) 

where J is a suitable interpolation operator. Note that if the coarse grid solution was 
simply prolongated to the fine grid without adding correction term as above, it would 
be devoid of all the high frequency components of the solution and also it would 
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contain interpolation error, in this sense, the process has taken the best advantage of 
the high frequency smoothing property of the chosen relaxation scheme on each level 
and a lot of CPU time is saved by working on coarser grid levels. 


3. Application to Laplace equation 

The Laplace and Poisson equations (differing only by the absence or the presence of the 
nonhomogeneous term) are of fundamental importance to fluid dynamics. The Laplace 
equation occurs as the main governing equation for the incompressible, inviscid and 
irrotational fluid flow, and the Poisson equation occurs as a component equation in 
case of incompressible Navier-Stokes flows. Also, the simplicity of this equation makes 
it a very illustrative platform. 

Since the Laplace equation is linear, the operator Lin (3) can be linearly distributed 
over 0 and 0 and one can proceed to solve directly for correction i jj using what is called 
the “Correction Scheme”. But in this work, a more general method called Full 
Approximation Storage (FAS) scheme, as applicable to the nonlinear equations, is 
used. 

Here the interest is to solve the Laplace equation using multigrid acceleration. In this 
section, the work of Singh (1989) for multigrid strategy will be followed. The Laplace 
equation is written as; 

V 2 0 = 0. (9) 

The boundary conditions are taken to be as follows; at (i) y = 0, 0 = 0, (ii) y — n, 
0 = 0, (iii) x = 0, 0 = sin(y), and (iv) x = 1,0 = e.sin(y). The problem is discretized over 
a domain given by 0 ^ x ^ 1 and Q^y^n using the Cartesian grid. 

The problem being purely elliptic, the equation is discretized by writing the second 
derivatives using the central difference scheme. The resulting difference equations are 
solved by successive line relaxation technique. A V-cycle FAS multigrid strategy with 
independently selected number of sweeps at various grid levels will be used. The results 
are presented for grid levels up to six. 

Figure 1 shows several convergence histories of the solution procedure as a function 
of number of grid levels used. In each case the solution is sought on the finest grid level 
of size 65 x 129. The figure shows the effect of number of grid levels on the multigrid 
convergence. The gain in terms of speed of convergence rapidly increases as the number 
of grid levels is increased from 1 to 4. The gain slows down when 5 grid levels are 
used. Beyond this, one can see that there is just about no gain in the rate of convergence 
at all as the number of grid levels is increased from 5 to 6. The results of this study are 
summarised in table 1 below. 

Table 1 shows the level of error obtained using one-level computation and multilevel 
computations using 2 to 6 grid levels after 100 work units. L2 (norm) stands for the 
absolute value of the L2-norm of the residual. The drastic gain in the solution 
convergence is clear. One can see that after 100 work units, the 5-grid level solution has 
converged to an accuracy of about 14 orders lower than the corresponding conver¬ 
gence on a single level. Of course, it is a different matter that, in practical applications, 
one never needs to drive convergence to more than, say, 3 to 4 orders of magnitude 
which is obtained here in far too fewer work units. The basic nature of this observation 
is true in general for all systems, i.e., the law of diminishing return applies as the number 



Multigrid acceleration technique in fluid dynamics 


891 



Figure 1. Multigrid conver¬ 
gence and effect of number of 
grid levels. 


of coarser meshes beyond optimum number is increased. Often four to five grid levels 
are optimum. Of course, as far as the level to which the solution converges after certain 
number of work units is concerned, it depends on the system being solved (nonlinearity, 
constraints, presence of discontinuities etc. tend to degrade the multigrid performance). 
The next two examples of full potential equation and Navier-Stokes solution will 
elucidate this point. 

3.1 Effect of boundary condition “defect" on multigrid convergence 

In the above, the Laplace equation has been solved with the exact boundary condition. 
This does not involve any approximation of the boundary values. However, when 
a problem is solved which involves approximation (and hence error) in the boundary 
values, that we call “boundary condition defect” here, the performance of the same 
multigrid accelerated iterative procedure is impaired significantly. To study this, we 
consider a computationally similar problem (Singh 1992) that involves extrapolation to 
compute boundary condition and hence, introduces error or “boundary condition 
defect”. To see this, we present the results of the following demonstration computa¬ 
tions, Study-1: The solution of Laplace equation with rectangular boundary and exact 
boundary conditions for comparison of convergence history, Study-2: the problem in 
study-1 is recomputed using the difference form of the Neumann boundary condition 
dfl/dy = exp (x) cos (y) as the boundary condition on the bottom edge. The truncation 


Table 1. Efficiency of multigrid scheme with respect to the number of grid levels used. 



1-level 

2-level 

3-level 

4-level 

5-level 

6-level 

Abs. value of L2 
(norm) after 100 
work units 

8-28 

4-7 x 10" 2 

2-5 x 10" 3 

6-3 x 10~ 8 

2-8 x 10" 14 

2-8 x 10~ 14 
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Figure 2. Computational do¬ 
main for Studies 3 and -4. 


error of the difference scheme introduces some error, Study-3: The problem in Study-1 
is solved with the following modification; the bottom edge is cut slant as shown in 
figure 2. The boundary condition implementation at this edge requires values of (f) at 
the dummy points for which the exact values are provided. The boundary conditions 
on the left, right and top edges remain the same, Study-4: all features identical to 
Study-3 except that the values at the bottom boundary point were obtained using 
Taylor series expansion as described below. This adds further to the errors. 

The value of <p at the slant bottom boundary is written as follows by opening a Taylor 
series around (ij — 1); 


(<t>) yb = (0) yi .,_ ,+iyb-yij-i )(<£ W, ’ 

where 

(0*,., = (- 3 (0IJ-1 + mij-m J+1 )/2Ay 

and 

(0W, = (WOij -1 - 2(0) u + (<j>) iJ+1 )/Ay 2 . 


.( 10 ) 

( 11 ) 

( 12 ) 


The subscript ‘V indicates values on the slant boundary. The values of ((f)) are 
provided exactly to compute ((f)) i j_ 1 . The higher level of truncation error introduced 
artificially is obvious. Figure 3 shows the comparison of the convergence histories of 
the above studies. The study illustrates the level of stagnation residues in various 
studies. The residue level (« 10" 7 in 32-bit arithmetics, « 10" 15 in 64-bit arithmetics) 
is the lowest in Study-1 where no “defect” is involved in the boundary condition 
implementation. Study-2 has relatively higher level of stagnation residue (ss 10“ 6 in 
32-bit arithmetics) where there is a relatively smaller amount of boundary condition 
“defect”. Even though we have considered a slant bottom edge in Study-3, which requires 
use of the dummy point, the stagnation residue level is the same as that in Study-1 
because the value there is provided exactly. Study-4, which has the highest level of 
“defect”, leads to the highest level of stagnation residue (« 10" 4 in 64-bit arithmetics). 

The effect of the boundary condition “defect” can also be seen by explicitly adding 
a perturbation term to the exact boundary condition in Study-1. Let the perturbation 
term to the boundary condition be chosen as A*sin(n7tx) where w= 1,2,3,... with 
0 < 1. The term is explicitly added to the boundary condition on the bottom edge in 

Study-1. One can infer from figure 4 that as the amplitude A increases, the magnitude of 
the boundary condition “defect” increases, which leads to convergence process stagnating 
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Figure 3. Comparison of convergence rates for studies 1 to 4. 


at higher levels. The above study illustrates the sensitivity of the convergence history in 
the multigrid process as it is affected by the boundary condition “defect”. 


4. Application to full potential transonic flow computation 

Even today, most of the CFD codes that are used in the preliminary aircraft aerody¬ 
namic design are based on simpler methods like panel method and full potential 



Figure 4. Effect of boundary 
condition “defect” on multigrid 
convergence. 
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equations. Thus, it is useful to have these codes accelerated using techniques like 
multignd. In this section, the implementation of multigrid acceleration in a full 
potential transomc flow solver due to Garabedian & Korn (1971) is described. 

is onca y, the hist successful application of the multigrid acceleration to potential 
tu net ion based formulation for the transonic flow was due to South & Brandt (1977) In 
their work, they presented the accelerated solution of small perturbation equatioh for 
non-lifting flow past airfoil. Singh (1989) used the multigrid acceleration to get 
solutions for flow past airfoils using the code of Garabedian & Korn (1971) modified 
and adapted by him at NAL. An acceleration of about 5 to 7 times (in terms of work 
units) was obtained using the multigrid strategy described. The basic method of 
Garabedian & Korn (1971) is described below. 

The 2-D lull potential flow equation in Cartesian coordinates is; 


(c 2 - ir)(p xx - 2 uv(f> xy + (c 2 - v 2 )(p yy = 0 , ( 13 ) 

where if) is the velocity potential, u, v the velocity components in x and y directions, and 
c the local speed of sound given by the Bernoulli’s law 


(u 2 + r 2 )/2 + c 2 /(y - 1) = (1/2) + M 2 /(y - 1). (14) 

Using Sell’s conformal mapping, the entire infinite flow field is mapped on to the 
inner circle of unit radius such that the perimeter of the circle corresponds to the airfoil 
contour and the centre corresponds to infinity. In this plane, using (r,0) coordinates, 
the full potential equation, (13), is written as; 


{c 2 - r 2 f -2 02 )- 2 r 4 / - 2 (j) e (f) r <j) gr + r 2 (c 2 - r*f~ 2 tf> 2 ) f rr 
+ r(c 2 ~r 2 f~ 2 (f) 2 0 - 2 r 4 /- 2 0 2 )<^ r + /- 3 (r 2 (/) 2 + r 4 ^ r 2 ) 

(/o0o + r 2 /r^r) =0 ’ (15) 

where / = r 2 \F'{re l0 )\ and F is the mapping function defined as x + iy — F[re i0 ). The 
boundary conditions for the problems are (a) the flow tangency condition on the 
surface, i.e., 


d(f)/dn — 0 , 


(16) 


and (b) outer boundary condition written as; 

<D = (r/27t) tan " 1 ((1 - M \) :1/2 tan (9 + a)), (17) 

where O is the full potential, a is the angle of attack, 0 is the coordinate in the (r, 9) plane 
and M rj0 is the free stream Mach number. Equation (15) is discretized using the mixed 
finite difference scheme due to Murman & Cole (1971). 


4.1 Multigrid acceleration of full potential equation 

The nonlinearity of the full potential transonic flow equation with shock sitting on the 
airfoil surface poses special problems in the multigrid process by making the solution 
very sensitive. It turns out that the high frequency errors introduced during the coarse 
grid correction interpolation, especially in the vicinity of the shock, keep offsetting the 
smoothness obtained on the finer levels. This impairs the rate of convergence. 
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Figure 5. Comparison of 
pressure distribution with and 
without multigrid, NACA 0012, 
M = 0-7, a = 1°. 


The multigrid strategy selected here also is the FAS scheme starting at the finest of 
the 3-level grid hierarchy with V-cycle. The finest grid is 160 x 32. The next two coarser 
grids are obtained by omitting the alternate points in both r and 0 directions. The 
coarse grid correction is interpolated using simple algebraic averaging of the values at 
the two coarse grid nodes. The relaxation sweep strategy adopted for the results 
presented here consists of a constant number of sweeps at all levels in both restriction as 
well as in the prolongation phases (though a choice exists to select any number of sweeps 
at any level). The boundary conditions are updated at all the grid levels. The computa¬ 
tions are terminated when the L2-norm of the residuals becomes less than 10 " 4 . 

Using this simple strategy, two typical results are presented for flow computations 
made for (i) NACA 0012 airfoil at =0-7 and a = 1° and (ii) Garabedian-Korn 



Figure 6. Comparison of con¬ 
vergence with and without 
multigrid, NACA 0012, M = 0-7, 
a = 1°. 
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Figure 7. Comparison of 
pressure distribution with and 
without multigrid, G-K airfoil, 
M = 0-702, a = M°. 


airfoil, =0-702 and a = 1-1. Figure 5 shows the pressure distribution for case (i) 
which is supercritical though without a shock. The agreement between the single grid 
computation and multigrid solution is just about total. Figure 6 shows the convergence 
history for this case. The single grid computation took 562 work units to converge to 
the specified tolerance level while the multigrid solution took only 125 work units 
leading to an acceleration of about 4-5 times in terms of work units. Figure 7 shows the 
comparison of pressure distribution for case (ii). This is a supercritical case with shock. 



Figure 8. Comparison of convergence with and without multigrid, G-K airfoil, 
M = 0-702, a = M 3 . 
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Like the earlier case, the agreement between the single grid computation and multigrid 
solution is good except for a small difference in the vicinity of the shock. The reason for 
this is explained earlier which is associated with the interpolation of coarse grid 
correction. Figure 8 shows the convergence history for this case. The single grid solution 
took 696 work units to converge to the specified tolerance level, whereas the multigrid 
solution took only 100 work units giving nearly 7 times faster convergence speed. The 
gain in convergence speed for more difficult cases like the ones with strong shock and 
high angle of attack is higher. This is because the instability of shock which greatly 
impairs the single grid convergence is effectively handled by the smoothing capability of 
the basic scheme at multiple levels with the added advantage of the coarse grid correction 
of the multigrid process. (The simple grid sequencing technique also derives the advan¬ 
tage of multiple levels though without the advantage of any interaction between the 
solution process and the discretization seen in the form of coarse grid correction.) In fact, 
this speed-up can be further improved by adopting strategies like implementing conver¬ 
gence rate based switching criterion from fine to crude grid in the injection phase, 
working on additional coarser levels, using W-cycles and full multigrid, trying out 
a better smoothing of corrections in the prolongation phase etc. Next, in the following 
section, implementation of multigrid acceleration to obtain solution of a more complex 
system, i.e., compressible Reynolds averaged Navier-Stokes equations, is described. 


5. Multigrid acceleration of Navier-Stokes solver 

The full Navier-Stokes equations are believed to provide complete physics of the 
various fluid flow phenomena. Thanks to the developments in the area of efficient 
algorithms and acceleration techniques, particularly the multigrid technique, on one 
hand, and the faster computing machines, on the other, that it has become possible to 
compute Navier-Stokes flows around practical configurations in reasonable turn¬ 
around time. Still when it comes to simulating turbulence, the situation is not very 
encouraging at least as far as the computing resources are concerned. In spite of these 
difficulties, the solutions of Navier-Stokes equations are of great interest where one 
solves what is called Reynolds averaged Navier-Stokes equations which involve 
introduction of an empirical model to simulate turbulence. The details on these aspects 
are now available in the text books and only a brief description is presented here for 
ready reference. Some important aspects of discretization of these equations, the 
solution process and the implementation of multigrid will be described in the next 
sections. This will be followed by discussions of a few cases of multigrid solution of the 
transonic flow computations. The performance of the multigrid convergence of the 
present Navier-Stokes code, based on the described strategy, has been tested over 
a range of Mach numbers between 0T to about 4-0 (Singh et al 1993). 

Most of the progress in the multigrid solution of the Navier-Stokes equations has 
taken place during the last ten years or so. Several algorithms and methods have been 
devised to handle specific flow situations. A good number of these Navier-Stokes 
solvers use the finite volume discretization and Runge-Kutta time stepping scheme 
with addition of an explicit artificial viscosity to stabilise the solution. The solver in the 
present work is based on this method, (for details see Jameson et al (1981), Jameson 
(1985), Martinelli (1987), Kloppmann et al (1993), Singh et al (1993) and additional 
references therein). 
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5.1 The Navier-Stokes equations 


The time-dependent Navier-Stokes equations for two-dimensional flow are written as 
(Singh et al 1993a); 


d 

dt 


Wd(vol) + 


H*n dS = 0, 


(18) 


where vector W is the cell averaged mass, momentum and total energy and is written as; 


W = {p,pu,p\v,pE) T , 


(19) 


and n is the unit vector normal to the cell boundaries, u and vv are the Cartesian 
velocities. The energy equation is written as; 

E = e + (u 2 + w 2 )/2, (20) 


where e is the mass averaged internal energy given as; 

e = P/(P(7“ 1))< ( 21 ) 

and p is the pressure. The flux tensor H is given as; 


with 


H = 


pu 

pu 2 - o x 
PUW — T xs 

(pE — a x )u — x xz w + q x 


pw 

PWU - T zx 

pw 2 — (X, 

(pE — a z )w - x 2X u + q z 


= - p + |p(2u x -w>.) 
- P + |p(-2w s -uJ 


( 22 ) 


= X zx = M( U z + w x) 

q x and q, are the components of the heat flux vector. 
q x =- kT x’ T.= -kE z . 

p is the molecular viscosity and is computed using the Sutherland law. k is the 
thermal conductivity. For turbulent flows, the Baldwin-Lomax (1978) model for eddy 
viscosity has been used. 


5.2 Spatial discretization 


The time dependent Navier-Stokes equations (18) are discretized based on the method of 
lines in which the spatial derivatives are discretized using the cell centred finite volume 
scheme. Applying these conservation equations to one of these cells, we can write 

^W(Q c + Q..-D),* = 0. (23) 

In the above, Q c and Q r are the convective and viscous flux vectors. This discretization 
amounts to central differencing and requires explicit addition of artificial viscosity to 
provide (a) a high order background dissipation to avoid the odd-even decoupling and 
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(b) a second component active in the vicinity of the shock to prevent shock-induced 
oscillations. This term is indicated by D in (23) and is described in the sub-section 5.3. 

5.3 Artificial dissipation 

In the present work, the Jameson-Schmidt-Turkel (Jameson et al 1981) scheme for the 
numerical dissipation has been used which contains a blend of second and fourth 
differences of the flow variables. The second difference term damps oscillations in the 
vicinity of the shock sensed by a pressure based sensor. The fourth difference term helps 
to avoid the odd-even decoupling and provides the background dissipation in the 
smooth regions of the flow. This scheme is described below. The Jameson-Schmidt- 
Turkel numerical dissipation scheme (Jameson et al 1981; Martinelli 1987) is given as; 


where 


^i.fc — d i+ 1/2 ,fc d t _ 1/2,/c + d ik+Xj2 


d-i + i/2 ,k ~ 2 i + l/2.k[ £ l+ l!2,k.d x di,k 1 • (25) 

3 X is the forward difference operator. The coefficients s {2) and e (4) are written as; 

+ i/ 2 .k = fc* 2 ’ max(v, + i, v f ), (26) 

e< i+ 1 / 2 ,k = max(0,fc (4) e{-+ ) 1/2t k)‘ (27) 

The pressure sensor v is given as; 

v = IP i+ i.k ~ 2 Pu + Pi-iJ/(\P i+ i,*l +1 2 Pi.k I + \Pi- i.kl (28) 

The Navier-Stokes grids are highly stretched. The grid spacing near the wall in the 
direction normal to it is typically of the order of 10 -5 c (c being the airfoil chord) to 
resolve the boundary layer leading to streamwise aspect ratio of the cells very much 
larger than unity in the vicinity of the airfoil. This leads to a large value for artificial 
dissipation leading, in turn, to a large drag. In the far field, the situation is reversed as 
a result of which the cell aspect ratio defined as before is less than and often much less 
than unity. This disparity leads to very different high frequency modes in the two 
directions, and cause difficulty for both the convergence and accuracy of the steady- 
state flow computation as well as adversely affecting the multigrid procedure. To 
surmount this difficulty, the variable scaling factor (Martinelli 1987; Swanson & Turkel 
1987) A has been used instead of /„ in (25). For this, X is written as 

^ + 1 ,2. l = 05[(A x ) w + (AJ 1+1Jl ], (29) 

where 

A x = fi x X x , (j) x = 1 + {XJXfif, 

X x and X z are the scaled spectral radii of the Jacobian matrices of the convective fluxes, 
a is a constant. With this, it has been possible to handle highly stretched grids with 
significantly improved damping properties. 

5.4 Runge-Kutta time stepping 


The semi-discrete equation (23) may be written as; 
(dW iJk /dt) + «(W ltk ) = 0. 


( 30 ) 
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8k is the discretized flux term and is known as residual. A hybrid 5-stage Runge-Kutta 
scheme has been used to advance the solution of (30) in time. Following Jameson (1985), 
this is written as 

W (0> = W ( " , ) 

W (m) = W (0) - a m Ar[0 f (W (m " n ) + Q,,(W (0) ) - D (m ~ !) ], (31) 

\y(n + l) __ ^y/(5) 

m is the Runge-Kutta stage number (1 to 5 for 5-stage), n the time index, W <0) the 
starting solution or the solution at previous time step. The convective fluxes are 
computed at every stage. For time economy, the viscous fluxes, considered in full, are 
computed only at the first stage and the numerical fluxes are computed at alternate 
stages employing a combination of present and previous values as follows 

D (0) = D (1) = D(W (0) ), 

D (2i = D (3) = £ D(W ( 2 ') + (i _ jj)D {0 \ (32) 

D (4) = yD(W (4) ) + (1 — y)D (2) . ■ 

The weighting factors used above are /? = 0-56 and y — 044. The coefficients ot m are 
taken as sq = 1/4, a 2 = 1/6, a 3 = 3/8, = 1/2, a 5 = 1. It has been shown by Martinelli 

(1987) that this scheme has a large parabolic stability limit. Since only the steady state 
solution is of interest here, a local time stepping scheme is used to accelerate the 
solution process wherein the time steps for each cell are taken to be the maximum 
permitted by the local CFL criterion. 

5.5 Implicit residual smoothing 


A second technique that is used to accelerate the solution procedure is that of implicit 
residual smoothing which enlarges the stability range of the basic time stepping scheme 
enabling larger time steps to be taken. Also, the implicit residual smoothing is known to 
significantly enhance the smoothing characteristics of the time marching scheme for 
use with multigrids (Wigton & Swanson 1990). In two dimensions, the residual 
smoothing operation is defined as follows; 


(1 - £^<5^1(1 - B x b zs )0t* = 8k 


(33) 


where 8k* is the smoothed residual. The smoothing parameters e’s are different for each 
cell because of different aspect ratios. The coefficients e developed by Martinelli (1987) 
are quite effective for highly stretched meshes and have been used in the present scheme. 
These locally varying coefficients are given as; 


e x = max 



( CFL X x 
[cFL q a x + X : 




(34) 


CFL 0 is the natural Courant-Friedrich-Lewis (CFL) number of the basic scheme. The 
expressions for e. can be written in a similar way. While this variable coefficient implicit 
procedure requires additional CPU time to compute e’s for each cell, this is more than 
compensated for by the fact that it allows CFL numbers two to three times larger than 
the original scheme. The highest CFL number tried in the present computations is 
ahnut 7>S 



Multigrid acceleration technique in fluid dynamics 


901 


5.6 Boundary conditions 

The airfoil surface is treated as an adiabatic wall on which no-slip boundary condition 
is applied and zero normal pressure gradient is assumed. The formulation being cell 
centred, the variables are not specified on the surface. This requires considering 
a dummy cell adjacent to the boundary. To apply the wall boundary condition, simple 
extrapolation of pressure to the dummy cell is done and the density is found by 
symmetry. The velocity components are set antisymmetric. 

The farfiield boundary condition is based on the use of Riemann invariants for 
one-dimensional flow normal to the boundary. The boundary conditions for normal 
velocity component and speed of sound are constructed by adding and subtracting the 
invariants (for details see Kroll & Jain 1987). For subsonic free streams, the tangential 
velocity components and entropy are extrapolated from the interior at an outflow 
boundary and are equated to freestream values at the inflow boundary. With this, the 
flow variables in farfiield are computed. 

5.7 Multigrid strategy 

The basic features of the multigrid scheme used here are along the lines developed by 
Jameson (1985). The specific features of the present method can be seen in Singh et al 
(1993) and Kloppmann et al (1993). The Runge-Kutta time marching procedure 
quickly smoothes out the high frequency errors (short waves) on the fine meshes where 
further smoothing operations would become progressively slower. Note here that in 
the context of multigrid, the schemes like Gauss-Seidel, predictor-corrector, Runge- 
Kutta and so on are seen as smoothers rather than solvers. This is because one intends 
to get a good coarse grid approximation from the fine grid solution by having the fine 
grid starting approximation smoothed, i.e., high frequency errors diminished. The 
solution with the remaining error components are passed on to the next coarser mesh 
until that coarsest mesh is reached where the low frequency errors (long waves) of the 
finer meshes are seen as high frequency errors. The basic Runge-Kutta smoother is 
again used to remove these long wave (with respect to the finest mesh level on which the 
solution is sought) errors. 

The scheme may use any number of levels of mesh refinement which is read as input 
in the code. In all the results presented here, the Full Approximation Storage scheme is 
used in which one starts with an initial solution on the finest mesh. In the present 
computations, three or four mesh levels have been used with V or W-cycles. The scheme 
for the residual transfer and correction interpolation are important steps for a good 
multigrid performance. The scheme should be such that the coarse grid correction 
improves the finer grid solution in smooth regions as well as correcting the discontinu¬ 
ity location. Towards this requirement, the flow variables are transferred from fine to 
coarse grid as follows (in the following, the subscript will indicate the grid level 
identifier). 

WMK,W JV lh , (35) 

where V is the cell volume and h and 2 h refer to the finer and coarser meshes levels, 
respectively. The summation is performed over the four fine cells constituting one 
coarse cell. This volume weighted averaging preserves mass, momentum and energy 
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conservation. The residuals are transferred as simple sums of the residuals in the four 
constituent fine cells. The eddy viscosity is computed only on the finest mesh and is 
carried on to the coarser mesh by simple averaging. To update the solution on the 
coarse mesh, the residual is modified by adding the forcing function defined as; 

/ 2 *=-lR.(W # ) + R 2 »(W'»'). (36) 

The summation is performed as before. With this, the multistage scheme on the coarser 
mesh is written as; 

w£> = w« 


W&> = W<°> - a.Aj(Kg- » - f 2t ), (37) 


W<" + 1) — w (m) 

™ 2h — 2h > 

where n is time level and m is the number of stages. Note that in the first stage above, the 
entire corrected residual reduces to HR h (W) h implying that the evolution on the coarse 
mesh is influenced by the residuals on the finer mesh. This procedure is followed until 
the coarsest mesh is reached. Only a single fine grid boundary condition update offers 
computation time economy without sacrificing the accuracy of the solution. The 
corrections computed on the coarse mesh are prolongated to the finer mesh by simple 
linear interpolation. This step introduces high frequency errors causing degradation of 
the convergence which is more pronounced if the grids are highly stretched. To improve 
the situation, the corrections are smoothed before update. An effective way to take into 
account the effect of high grid stretching is to use the implicit residual smoothing. The 
coarse grid correction is transferred to the finer mesh by 

W, = W Wold) + > 24 (W 24 -^*W Wold) ), (38) 

where J and J are suitable interpolation operators. 

5.8 Results from Navier-Stokes computations 

The multigrid accelerated Navier-Stokes code developed and based on the above 
strategy has been extensively used to compute several flow conditions on a variety of 
airfoils. Table 2 lists the four typical cases of airfoils and the corresponding flow 
conditions for which the computed results will be presented. 

For the cases presented, a 257 x 41 size C-grid with a total of 64 points in the wake or 
321 x 65 size C-grid with a total of 128 points in the wake has been used. The farthest 
grid points are placed at about 12 to 18 chords away from the airfoil and the grid line 
just above the airfoil is about 10~ 5 chords away to enable a good resolution of 
the boundary layer. The computations have been performed on the Convex 3820 
computer at the CSIR Centre for Mathematical Modelling and Computer Simulation, 
Bangalore. Depending on the mesh size and the flow conditions, the solutions, 
converged to the level of acceptable tolerance, take just about 5 to 15 min of CPU time. 
The corresponding single grid solution takes about 2-3 hours of CPU time for similar 
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Table 2. Cases selected for Navier-Stokes computations. 


Case 

Airfoil 

Grid 


a(deg) 

Re(10 6 ) 

Xtr 

1 . 

CAST-7 

257 x 41 

0-70 

2-00 

4-0 

7% 




0-70 

2-64 

60 

8% 




0-70 

3-60 

6-0 

8% 

2. 

RAE 2822 

321 x 65 

0-73 

2-79 

6-5 

3% 

3. 

NACA66,2-215 

257 x 41 

0-601 

2-0 

1-5 

64% up 
73% low 

4. 

NACA 0012 

257 x 41 

0-6 

8-437 

9-0 

5% 




0-6 

90 

9-0 

5% 


xtr - transition location as % of chord 


and, in most cases, higher levels of residuals. This feature of the code, along with its 
robustness and accurate prediction capabilities, offers itself as a designer’s tool for 
evaluating the characteristics of the airfoils. 

5.8a CAST-7 airfoil : This is 11-8% thick supercritical airfoil with moderate rear 
loading, designed by the Dornier Company. This airfoil has been extensively studied by 
this author during the early phase of the work with this code without multigrid (Singh 
1990). Table 3 shows the details of the studies made on this airfoil and presented in this 
report. 

The single level computation and 3-level grid sequencing computations at serial 
numbers 1 and 2 are included from the point of view of illustrating the effect of factors 
other than multigrid as will be explained. The single grid computation (no. 1) refers to 


Table 3. Comparison of performance of Navier-Stokes code with and without multigrid 
(Cast-7 airfoil). 


Computational 

strategy 

Cl 

c d 

Work 

units 

Convergence 

level 

CFL 

Remarks 

1. Single grid 

0-8197 

0-01905 

6500 

ss 1-97 x 10" 1 

0-6 

4-stg RK, No IRS 
std JST 

2. 3-Level grid 
sequencing 

0-8175 

0-01899 

3437 

— 

0-6 

4-stg RK, No IRS 
std JST 

3. Single grid 

0-8117 

0-01898 

3000 

«10“ 5 

1-6 

5-stg hybrid RK, 
VC-JST, VC-IRS 

4. 3-Level MG 
V-cycle 

0-8174 

0-01926 

147 

2-06 x 10~ 4 for 
stabilised Cl 

1-6 

5-stg hybrid RK, 
VC-JST, VC-IRS 

5. 3-Level MG 
V-cycle 

0-8172 

0-01921 

496 

9-51 x 10~ 8 
for convergence 
tolerance of 10“ 7 

•1-6 

5-stg hybrid RK, 
VC-JST, VC-IRS 


4- and 5-stg RK - 4- and 5-stage standard Runge-Kutta scheme 

IRS - implicit residual smoothing 

VC - variable coefficient (eigenvalue weighted) 

JST- Jameson-Schmidt-Turkel artificial dissipation 

CFL - normalised CFL number defined as CFL/CFL 0 (see (34)) 



the original status of the code when the work on the development of the accelerated 
Navier-Stokes code was started under the NAL-DLR (Gottingen) collaboration. 
Computation no. 2 refers to the first stage of the acceleration work using grid 
sequencing (Singh 1990). Both these computations use a standard 4-stage Runge- 
Kutta scheme with the Jameson-Schmidt-Turkel dissipation (Jameson et al 1981). 
Also, to stabilise the solution properly, a normalised CFL number of 0-6 has been used 
which gives slow convergence rates in these two cases. The single grid version at serial 
number 3 and the multigrid version have additional features like eigenvalue scaled 
dissipation coefficients, hybrid 5-stage Runge-Kutta time integration and variable 
coefficient implicit residual smoothing whose effects are described in § § 5.3 to 5.5. These 
elements offer the multigrid version of the code not just the weighted dissipation but 
higher CFL too. The normalised CFL used in the presented multigrid example is 1-6. 
These points should be borne in mind while making comparisons based on the data in 
table 3. 

Figure 9 shows the comparison of the pressure distributions obtained using single 
grid (dashed line), 3-level grid sequencing results (shown by *) and 3-level multigrid 
V-cycle results (full line). This study presents the validation test to check the faithful 
reproduction of earlier verified and well-studied single mesh and 3-level grid se¬ 
quencing computations (Singh 1990) by the present multigrid computations. Under 
these flow conditions, the airfoil develops a shock of moderate strength on the upper 
surface at around 35-37% of chord. The agreement of C p -distributions obtained from 
the three computations is almost total. Figure 10 shows a similar comparison of the 
skin friction (normalised with respect to free stream quantities) obtained from the single 
grid version (dashed line), the grid sequencing version (no. 2 with *) and the multigrid 
version (no. 4, full line). As in the case of C p , the agreement of skin friction obtained 
from these versions is also nearly total. The skin friction is a more sensitive quantity 



Figure 9. Comparison of pressure distribution with and without multigrid, N-S 
solution, Cast-7, M = 0-7, a = 2°. 
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than C p and these agreements confirm the general reliability of the codes as well as that 
of the multigrid implementation. Figure 11 shows the comparison of convergence rates 
of the single grid computation (case 3) and multigrid computations. This figure shows 
that to achieve a drop of five orders in the normalised maximum residual, the multigrid 
requires approximately 280 work units, whereas the single grid computation requires 
approximately 2800 work units. Figure 12 shows the comparison of evolution of the 
lift. It is seen that while the single grid computation takes about 1200-1300 work units 
to reach stabilised lift level (but with higher residue level), the multigrid computation 
shows a rapid and smooth evolution stabilising in about 200 work units with one order 
smaller residue level. This rapid stabilisation of lift coefficient, which is sustained 
without any oscillation, indicates a reliable, robust and stable computation. 



Figure 11. Comparison of 
convergence history, N-S sol¬ 
ution, Cast-7, M = 0-7, a = 2°. 
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Figure 12. Comparison of lift 
evolution, N-S MG solution, 
Cast-7, M = 0-7, a = 2°. 


Before proceeding further to discuss a few computations and their comparisons with 
experimental data, it would be of interest to discuss here the effects of V and 
W multigrid cycles and the number of smoothings at each level on the performance of 
multi grid technique. Figure 13 shows the effect of V and W-cycles on the convergence 
history of flow past RAE 2822 airfoil. The two computations have all other inputs 
identical. A marginal gain in convergence rate is seen with W-cycle (Kloppmann et al 
1993). Figure 14 shows the effect of number of smoothings performed at each grid level 
and also the effect of V and W-multigrid cycle strategies for flow past NACA 0012 
airfoil. The numbers in the legend indicate the strategy used. For example, “case 1:3-lvl 
V-cycle 8/4/2 & 0/0/0” indicates that the computation in case 1 uses 3 level V-cycle with 
8 smoothings at the coarsest level, 4 at the medium level and 2 at the finest level in the 
injection phase and no smoothing during the prolongation phase. This experiment 
showed that most computations, especially the ones using W-cycle, failed to converge if 
no smoothing was performed during the prolongation phase. The figure also indicates 
faster drop of residual with larger number of smoothings performed on the coarser 



0.0 400.0 800.0 1200.0 Fi 8 ure13 - Effect of V-and W- 

cycles on the multigrid conver- 
WORK UNITS aence 



907 


Multigrid acceleration technique in fluid dynamics 


0.0 n 


-3.0 H 


CASE 

MULT! GRID STRATEGY 

1 

3 LVL, V-CYC, 8/4/2 & 0/0/0 

2 

3 LVL. V-CYC, 32/16/4 & 0/0/0 

3 

3 LVL, W-CYC, 8/4/2 & 1/1/1 

4 

3 LVL, V-CYC, 32/16/4 & 1/1/1 

5 

4 LVL, W-CYC, 32/16/4/2 & 1/1/1/1 

9 

4 LVL, W-CYC, 1/1/1/1 & 1/1/1/1 

10 

3 LVL, W-CYC, 3/2/1 & 1/1/1 

11 

4 LVL, W-CYC, 4/3/2/1 & 1/1/1/1 



CASE 1 


-4.0 


H ' TT I TT 'I I' [ TT I' T I I I 1 I | I I I ( I I T I I | 'I I' I 1 I I I 1 I | 

0 100 200 300 400 

WORK UNITS 


Figure 14. Effect of number of 
smoothings on the multigrid 
convergence. 


levels. The fastest convergence is obtained with the combination used in case 5 whereas 
the residual is driven to the lowest level in case 1. 

Next, the results of some computations for transonic flow past airfoils will be' 
presented and compared with the experimental data to demonstrate the robustness and 
accuracy of the code and the efficiency obtained due to multigrid technique. 

Figure 15 and 16 show the comparisons of the predicted pressure distributions with 
experimental data (Stanewski et al 1979) for flow past Cast-7 airfoil at M = 0-7 at angles 
of attack of.(i) 2-64° and (ii) 3-6°. The Reynolds number for both cases is about 6 x 10 6 
with transition forced at 8% chord. The agreement for the case (i) is good everywhere 
including the shock location and its intensity. The agreement for the case (ii) is also 
good but the computed shock intensity is somewhat stronger. The grid for this case is 
a 257 x 41 C-grid with 64 points in the wake and 193 points on the airfoil. The lateral 
separation of the grid is such that there are nearly 25 grid lines inside the boundary 
layer and the grid lines in the outer flow are sparsely located. This indicates that 
if the flow inside the boundary layer is well resolved, the outer mesh density plays 
a secondary role. This fact is well borne out not just by the good pressure prediction but 
also by the well predicted drag. Table 4 summarises the quantitative comparison for 
these two cases. 

5.8b RAE 2822 airfoil : For this study, the standard transonic test case number 
9 from the AGARD set of data (Stanewski et al 1979) is selected. This computation has 
been carried out on a fairly fine mesh of size 321 x 65. The experimental incidence for 
this case is 3-19°. The present computations have been made at a corrected angle of 
attack of 2-79° as suggested by the experimenters (Stanewski et al 1979) to accommo¬ 
date the effect of wind tunnel wall interference. Figure 17 shows the comparison of 
computed pressure distribution with the AGARD data (Stanewski et al 1979). The 
agreement is seen to be good almost everywhere including at the peak plateau and 



908 


J P Singh 
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Figure 15. Comparison of C p -distribution, Cast-7, M = 0-7, a = 2-64°, 
Re = 6 x 10 6 . 


shock location. It may be pointed out that very often in the literature, a significant 
difference is seen in the peak plateau region. The experimental values of lift and drag 
coefficients (c, = 0-803 and c d = 0-0168 respectively) agree well with the predicted values 
(c, = 0-799 and c d = 0-019 respectively). As seen in figure 18, the comparison of skin 



Figure 16. Comparison of Cp- 
distribution, Cast-7, M = 0-7, 
a = 3-6°, Re = 6 x 10 6 . 
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Table 4. Quantitative comparison with experimental data Cast-7 Airfoil. 




a 

Re 


c t 


c d 


Case 

M x 

(deg) 

10 6 

Xtr 

(present) 

c* 

(present) 

L <i 

(i) 

0-70 

2-64 

.6-0 

8% 

0-8809 

0-867 

0-02548 

0-01879 

(ii) 

0-70 

3-60 

6-0 

8% 

1-0289 

1-027 

0-0393 

0-03286 


* Reference - Stanewski (1993b) 


friction (normalised with respect to the quantities at the boundary layer edge to be 
in conformity with the experimental data) is also in good agreement with the experi¬ 
mental data. 

5.8c NACA66,2-215 low drag airfoil : This is one of the low drag airfoils (what is 
now known as NLF airfoils) designed at the erstwhile NACA during the 40’s. Its 
characteristics are studied and reported (Graham et al 1945). The airfoil was designed 
to perform at high subsonic Mach numbers. The transition points on the upper and 
lower surfaces, as given in table 2, were fixed at locations approximately predicted using 
the Granville criterion as the related information was not available. Figure 19 shows 
the comparison of the computed pressure distribution with experimental data 
(Graham et al 1945). The agreement is seen to be good. Figure 20 shows the corre¬ 
sponding convergence history. The residual drops to about 3 x 10 ~ 5 in about 400 work 
units. 



x/c 


Figure 17. Comparison of C p -distribution. N-S MG solution, RAE 2822, 
M = 0-73, a = 2-79°, Re = 6-5 x 10 6 . 
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Figure 18. Comparison of 
skin-friction. N-S MG solu¬ 
tion, RAE 2822 airfoil, M = 
0-73, a = 2-79°, Re = 6-5 x 10 6 . 


5.8d NACA 0012 airfoil: This airfoil was chosen to demonstrate the application of 
the multigrid Navier-Stokes code to compute flows at high angles of attack with large 
trailing edge separation. The Baldwin-Lomax model for turbulence is not adequate 
when the flow is largely separated. With the present experience, the Granville correc¬ 
tion to this model appears to give results in much better agreement under separated 
flow conditions (Singh et al 1993). However, under the nonequilibrium conditions of 
shock induced separation, this model does not seem to be as effective as under the 
conditions of the trailing edge flow separation at low speeds. Figure 21 shows the 
comparison of C p distribution at M oo =0-6, a(corrected) = 8-437°, Re = 9x 10 6 
and transition fixed at 5% chord. This case corresponds to the stall condition with 



x/c 


Figure 19. Comparison of pressure distribution, NACA 66, 2-215 airfoil, 
M = 0-601, a = 2°, Re= 1-5 x 10 6 . 
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Figure 20. Convergence his¬ 
tory, NACA 66, 2-215 airfoil, 
M = 0-6, a — 2°, Re = 1-5 x 10 6 . 


a exp , = 9-86°. The results are compared with the experimental data of Harris (1981). 
The predicted pressure distribution with the Granville correction is seen to be in 
better agreement with the experimental data than that predicted using the original 
Baldwin-Lomax model. Figure 22 shows the computed skin friction (quantities nor¬ 
malised with respect to free stream values). A separation bubble between about 16% to 
23% of chord is seen with a brief reattachment followed by separation on the entire 
upper surface. Figure 23 shows streamline pattern around this airfoil under the same 
conditions but at slightly higher (corrected) angle of attack of 9°. The separation 
bubble, which was very small for the case with a = 8-437° to offer good graphic 
representation is slightly enlarged in the present case and can be seen clearly in 
figure 24. The flow reattaches briefly after the bubble and separates again over the 
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Figure 22. Computed 
skin friction NACA 0012 
airfoil, M = 0-6, a(corr) = 
8437', Re = 9 x I0 6 . 


entire upper surface. The skin friction inset in figure 24 clearly shows the location of the 
bubble and the separated flow on the upper surface. The flow evolution for this 
computation is wavy with lift coefficient oscillating over a couple of hundreds of work 
units with the amplitude of oscillation of lift continuously reducing and finally settling 
down. This indicates that the flow has still not become unsteady and the results are 
fairly acceptable for design estimates as indicated by the stabilised lift. The normalised 
residual could not be driven below about 10" 2 . 


6. Conclusions 

The multigrid technique is, perhaps, the best known acceleration technique for iterative 
solution procedures. It offers the possibility of a realistic turn-around time for many 
computations which otherwise take hours and days on most of the commonly used 



Figure 23. Streamline pattern, N-S MG solution, NACA 0012 airfoil, M = 0-6, 
a(corr) = 9° (Granville correction to B-L model, stall condition). 
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Figure 24. Magnified view of the separation bubble of figure 23. The ins'et shows 
the skin friction. The reversed flow inside the bubble and the fully separa-ted How 
downstream is clear from the negative skin friction. 


computing machines. Its application to three common fluid dynamical problems is 
demonstrated and the quantitative gains are seen to be in the range of 3 to 10 times in 
terms of work units. The accelerated codes, which have been demonstrated to be fairly 
robust and accurate, offer themselves as designer’s tools for evaluating the characteris¬ 
tics of the airfoils, especially when used on high speed computing machines. 
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Numerical simulation of a 2-D jet-crossflow interaction 
related to film cooling applications: Effects of blowing rate, 
injection angle and free-stream turbulence 
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Abstract. The aerodynamics of a coolant jet in a hot crossflow in an 
environment representative of the gas turbine practice, is numerically 
investigated for film cooling applications. The time-dependent, mass- 
averaged Navier-Stokes equations coupled with the compressible form of 
a two-equation low-Reynolds number (/c-s) model are solved based on an 
explicit finite volume formulation. The computed flow-field and surface 
temperature distributions along with the turbulence quantities are pres¬ 
ented to illustrate the variation of flow-physics and heat transfer phenom¬ 
ena which occur in a jet-crossflow interaction with changing blowing rates, 
slot angles, and inlet free-stream turbulence levels. 

Keywords. Numerical simulation; slot-film cooling; turbulent flow. 


1. Introduction 

Components of a modern aircraft gas turbine are exposed to extremely high tempera¬ 
ture. Control of both temperature level and gradient is needed for reliable operation 
and prolonged useful life of these components. Film cooling, one of the most popular 
methods, is widely used for this purpose. A cold secondary fluid is locally injected into 
the hot crossflow through a slot or a series of holes at one or more discrete locations 
along the surface to be protected. The secondary fluid is expected to create an insulating 
film which would reduce the convective heat transfer rate from the hot gas stream to the 
exposed surface. The flow-field near the injected coolant jet is very complex with high 
gradients of velocity, temperature, turbulence intensity and a local flow reversal 
(figure 1). 

Studies on film cooling are widespread in the literature. Unfortunately, most of these 
investigations are only devoted to the analysis of surface heat transfer phenomena. 
Aerodynamic aspects related to the jet-mainstream interactions and the coolant layer 
formation are not yet completely understood. Numerical analysis, which provides 
detailed information of flow-field and heat transfer phenomena, can improve the 
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Figure 1. A general descrip¬ 
tion of flow-field. 


understanding of film cooling process when used in conjunction with experimental 
studies. 

The highly complex flow-field created by the secondary fluid injection, depends on 
a wide variety of parameters such as the injection angle (a), the blowing ratio (m), the 
density ratio (DR), the free stream turbulence intensity and the state of incoming 
boundary layer. An excellent review by Goldstein (1971) describes some of these factors 
and earlier experimental investigations on film cooling. 

There are several experimental studies that measure the adiabatic wall temperature 
to define the film cooling effectiveness (rj) for different slot and hole injection configur¬ 
ations and also for different density and blowing ratios (Papell 1960; Petersen et al 
1977; Foster & Lampard 1980; Sinha et al 1990). The flow-field generated by injection 
from a row of holes is more complicated than that of slot injection because of three- 
dimensionality. Aerodynamic investigations related to the film cooling rarely appear in 
the open literature. Among the very few, Andreopoulos & Rodi (1984) and Pietrzyk 
et. al (1989) made a detailed turbulence study and aerodynamic description of a jet in 
a crossflow. 

The free-stream turbulence is a strong influencing parameter for the turbine blade 
cooling application. Experimental investigations (Brown & Minty 1975; Ko & Liu 
1980) indicated a loss of cooling effectiveness with an increase of free-stream turbulence 
for both slot and hole injection due to greater mixing of the injected secondary air with 
the mainstream. Kadotani & Goldstein (1979) reported the effects of free-stream 
turbulence, both in intensity and scale, on a turbulent jet. They concluded that the 
free-stream intensity and scale affected a number of parameters such as the boundary 
layer thickness, the height of the jet penetration and the mixing between the main¬ 
stream and the injected flow. Jumper et al (1991) pointed that a strong interaction with 
the cooling film would be expected as the free-stream turbulence levels are raised to 
a level comparable to that in the film. They concluded that the high turbulence levels of 
14-17% could result in significant reduction in the film cooling effectiveness compared 
to the data bases with low turbulence level. Thus, the use of the data base with low 
free-stream turbulence would not be appropriate in assessing heat transfer phenomena 
for turbomachinery applications. 

In the present study, the low-Reynolds number compressible form of a two-equation 
model, coupled with Favre-averaged mean flow equations (Favre 1965), is implemented 
into a purely explicit scheme which is applicable for a wide range of Mach numbers. 
There are limited attempts which incorporate the higher order turbulence models into 
an explicit solution, as it might have been thought that the stiffness associated with the 
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explicit treatment is mainly due to the large source terms in the transport equations of 
turbulence models. But, the analysis of Kunz & Lakshminarayana (1991) illustrates 
that the slower convergence rates that occur when explicit schemes are used to compute 
turbulent flows, are primarily due to the hyperbolic and parabolic stability constraints 
associated with the high grid clustering, and not due to the stiffness associated with 
large source terms in the transport equations of the turbulence models. Numerical 
results of flow and thermal fields along with turbulence quantities are presented to 
illustrate the effects of free-stream turbulence in a jet-crossflow interaction for film 
cooling applications in an environment representative of gas turbine practice. The 
variation of aerodynamics, shifting positions of flow reversal and change in film cooling 
effectiveness due to the change in blowing rate and injection angles are also discussed. 
The analysis is limited to a two-dimensional simulation. 


2. Governing equations and turbulence models 


In the present analysis, the density-weighted time-averaging (Fa vre 1965) is used. The 
density-weighted averaging leads to a simpler form of the governing equations with 
terms that are more amenable to physical interpretation. This is used for velocity 
components, temperature, turbulent kinetic energy and dissipation rate, whereas, the 
conventional time averaging is used for pressure, density, stress tensor and heat flux 
vector (Cebeci & Smith 1974). The effects of fine scale turbulence are simulated by 
incorporating a two-equation model of turbulent kinetic energy k, and turbulent 
dissipation t: and by specifying a turbulent Prandtl number Pr = 09. The low-Reynolds 
number terms of Lam-Bremhorst (1981) is used here in order to improve near wall 
modelling. 

The two-dimensional, time-dependent, mass-averaged compressible Navier-Stokes 
equations alongwith two additional equations for k and r. can be expressed for each 
discrete quadrilateral cell in the integral form as. 


where 


d 

dt 


Udft-f- 


QdS = 

s 


HdQ. 



U = 


p v 
pe 
pk 
pt: 


Q, the flux vector = (F,G) 
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G = 
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The total stress tensor and the heat flux vector are given in Cartesian coordinates as: 


and 



\dxjdxj 


3 ij dx 


kJ 






PCpdT 
Pr dx t 


+ 4te 


( 3 ) 

( 4 ) 


In the present study, the laminar viscosity coefficient p is assumed to be a function of 
temperature only, and is evaluated following Sutherland’s law. The perfect gas equa¬ 
tion of state p — pRT, is also considered to be applicable where* 




In preceding expressions, x tlJ and q u represent the turbulent stress tensor and heat flux 
vector, which are to be evaluated for the closure of (1). Incorporating an eddy viscosity 
formulation, the turbulence stress tensor and heat flux vector are given by: 


and 


X tii Pt 


( dui duj \ 

\ dx j + dx J 


hj^k' 
3 lj dx h 


--xtiipk. 
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(p<c P \dT 
V p r JdXi' 


( 5 ) 

(6) 


where, the eddy viscosity 


Pt ~ c P f !l (ph 2 /£)- 

The production term P k is expressed in Cartesian coordinates as: 


( 7 ) 


du 




f du dv\ 
+ dx) 


+ x u, 


dv 

8y’ 


(8) 


Here, c„ f c el , c a , a k and o E are the model constants./ M ,/ el ,/ l2 are the damping 
functions used to simulate the low-Reynolds number effects. For the Lam-Bremhorst 
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(k- e) model, considered in the present analysis, they are as follows: 
f tl = [1 -exp(- 0-0165 Re } .)] 2 (l + 20-5/Re r ) 

/ £l = l+(0-05/4) 3 

fez = 1 — exp(Re^) ( 9 ) 

= 0-09; c £l = 1-44; c c2 = 1-92; a k = 1-0 and cr £ = 1-3. 

On solid wall, k=0 and e = {p/p){d 2 kjcy 2 ). 

It should be noted that, although the k-e equations have been cast in compressible 
form, the modelling assumptions invoked here are essentially those for incompressible 
flow. The terms containing density fluctuation and pressure diffusion are neglected. 


3. Numerical procedure 


The unsteady equations (1) are marched in time using the explicit, second order 
accurate time-split, cell centred finite volume scheme based on MacCormack & Paul- 
lay (1972) until the steady state is achieved. The advantage of a time-split system is that 
the computation proceeds with larger time increments as the stability criteria are less 
stringent. 

In order to maintain stability in the presence of steep gradient of flow variables and 
to suppress any odd-even decoupling in the numerical solution, artificial dissipation 
terms are added to the finite volume formulation. Artificial dissipation used here is 
similar to that derived by Jameson et al (1981), which is a blend of second and fourth 
order differences with coefficients that depend on local pressure gradients. Thus, the 
artificial dissipation is introduced in the present formulation by defining an effective 
numerical flux across the cell face as. 


F|+ 1 / 2 .j — F;+ i/2,j D ;+1/2 j (10) 

where, F f+ 1/2 j is the physical flux and D i+ ll2 j is the dissipation term, given by 

^t'+ 1/2, j = ff i+ l/2,j[ £ /+ l/2,j(U|+ 1 j — fj;,j) 

— fi l+ 1/2 j(^i + 2 ,j — 3‘U f + jj + 3-Ufj — U,_ ltJ -)]. 

The scaling factor 


o-,- + i/ 2 j = max(Q,. +1J /Ar, Q fJ /Ar). 

The coefficients s (2) and £ (4) use the pressure as a sensor for shocks and stagnation 
points, and they are defined as 

+ 1/2 j = k{2) min[max( v ; + uy v tJ ), 0-5]. min(M,. + i), 

£i + i,zj = max[0,(k (4) min(M i + 1/2J /M a , 1) - 

, \Pi+Uj- 2 Pi.j + Pi-Uj\ 

UJ \Pi+iJ + 2 IPi.jl T \Pi- i.jl 

where k {2) and k (4) are constants set at 1/4 & 1/256 respectively. To reduce the effect of 
artificial dissipation within the viscous region of flow, a function of local Mach number 
normalised with respect to the free-stream Mach number, the maximum value of which 
is limited to unity, is used here. Similar dissipation terms are defined in the j-direction. 
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4. Grid generation 

The geometry consists of a flat plate with a slot of width D = 6-35 mm, represented in 
figure 2. An algebraically generated non-uniform Cartesian computational mesh, 
consisting of 60 x 50 grid points in x and y directions respectively, is used. For 
computational purposes, the geometry is divided into the following three regions: (i) the 
region near the slot where most of the interaction takes place; (ii) the region between the 
inlet and the beginning of the interaction zone; and (iii) the rest of the computational 
domain. The grid in the x-direction is generated judiciously by placing a number of fine 
uniform grid points in the interaction region, within the slot and stretching the grid 
towards the inlet and the exit. A fine exponentially stretched mesh spacing is employed 
in the region near the wall for resolving viscous layer and a course uniform mesh 
spacing is-used in the outer region. The first mesh point above the plate surface is 
chosen such that the average y + is of the order of unity. 


5. Boundary conditions 

At the inlet, the total pressure, total temperature and the flow angle are specified; the 
static pressure is extrapolated from interior and hence the other variables at the inlet 
are updated for each time integration. The exit plane is positioned considerably far 
downstream of the injection (100 D) so that a zero gradient condition is applied for all 
variables at this plane. On solid surfaces, no-slip conditions are imposed along with 
vanishing normal temperature gradient to simulate the adiabatic wall condition. The 
upper boundary of the computational domain is chosen sufficiently far from the plate 
and a ‘slip-wall’ type boundary condition seems to be appropriate. Thus, a slip 
condition is imposed by setting u-component of velocity and normal gradients of all 
other dependent variables to zero. The two-equation model needs specification of k and 
e values at inlet. The constant values of k and e are imposed at the inflow boundary 
based on the specified free-stream turbulence intensity and length scale as, 

k a = 1-5 Tu*V 2 a , 

e a = cl /4 k 3l2 /l a . 


(13) 
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The free-stream turbulence intensity is considered as 2-12%. 

The experimental study of Pietrzyk et al (1989) indicates that a uniform mean 
velocity profile at the jet exit may be a better approximation of the actual film cooling 
conditions. Therefore, across the slot exit the following uniform conditions are applied. 

pu = m s cos x, 
pv = m s sinx, 

T= T s , 

P = Ps ’ 

k = 1-5 Tu; K;, 

e = c^k]' 2 /0-5D, (14) 

where Tu s is considered as 8% here. 

6. Results and discussions 

6.1 Model validation 

Comparisons of the computed flow-field with experiment are presented in this section. 
All computations have been made on a PC-AT 486 based machine which has a CPU 
speed of 1 MFLOPS. The solutions are assumed to be converged when the logarithmic 
value of residuals have dropped by at least four orders of magnitude. Steady state has 
been achieved in 25,000 to 30,000 time steps. 



(a) 



(b) 


Figure 3. Streamwise varia¬ 
tion of adiabatic film cooling 
effectiveness for two different 
blowing and density ratios. Ex¬ 
perimental values from Papell 
(1960). 
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A majority of experiments, appearing in the open literature on film cooling for the 
gas turbine practice and consequent computational studies made to simulate those 
experiments, have considered low free-stream velocity. An exception is the work of 
Papell (1960) where the free-stream conditions were more representative of the gas 
turbine environment with M a =0-5 or above, Tj = 790 K and DR ~ 2-0. For this 
reason, the data set provided in Papell (1960) is chosen for validation. 

Figures 3a and b compare the predicted and measured film cooling effectiveness 
downstream of a normally injected coolant jet for different blowing ratios, m — 1-0 and 
1-8 with DR ~ 2-0. Although the overall stream wise trend is correct, the film cooling 
effectiveness is under-predicted to some extent at m = 1-0 (figure 3a). At a relatively 
higher blowing ratio of m = 1-8 (figure 3b), predicted distribution of film cooling 



Overshoot Momentum deficit 
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Figure 4. Computed velocity field (a) m = 1 -0, a = 45°; (b) m = 1 -0, a = 90°. 
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effectiveness matches very well with experiment. The under-prediction of film cooling 
effectiveness for a relatively lower blowing rate may possibly be attributed to the 
over-predicted mixing by the turbulent model used in the present study. Another 
uncertainty is that the upstream boundary layer thickness and its character is not 
clearly specified in Papell (1960). Hence, for the present study, the length upstream of 
the slot is fixed as x/D = 25, considering the fact that the influence of upstream 
boundary layer is not very strong for slot injection (Seban 1960), though this may 
slightly affect the prediction. The free-stream turbulence considered here is 2% which 
gives the turbulent velocity profile near the leading edge. However, the overall 
magnitude and trend of effectiveness are reasonably well predicted by the present 
flow-solver. On this basis, the variation of flow-field and heat transfer phenomena with 
changing blowing rate, injection angle and free-stream turbulence are numerically 
investigated. 

6.2 Numerical results 

6.2a Effects of injection angles and blowing rates: The computed velocity field 
downstream of an injected jet at an angle of 45° for m = 1-0 and DR = 2 is shown in 
figure 4a. These velocity profiles are very similar to those measured by Pietrzyk et al 
(1989) and they clearly demonstrate a near-field wake region and a far-field relaxation 
region. In the wake region, the velocity profile exhibits two distinguishable zones, (i) the 
momentum deficit zone indicating the development of a shear layer between the outer 
edge of the injected jet and the crossflow, and (ii) right above the momentum deficit 



(a) 



Figure 6. Computed stream- 
wise variation of adiabatic film 
cooling effectiveness demon¬ 
strating effects of blowing rate 
m and injection angle. ' 
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zone, a region with a slight velocity overshoot caused by the crossflow accelerating over 
the coolant jet. There is no flow separation downstream of the slot exit. For the same 
blowing rate, as a is increased to 90°, the aerodynamics of the jet in crossflow has 
remarkably changed (figure 4b). The jet penetrates more into the mainstream, bends 
over, reattaches to the surface forming a separation bubble and subsequently flows 
downstream approaching the boundary layer type flows. Injected coolant undergoes 
turbulent mixing with the mainstream while crossing over the bubble and at the same 
time, a small fraction of the injected coolant mass drifts into the recirculating bubble, 
inducing cooling action near the slot. This leads to high effectiveness very near the slot 
exit with downstream flow separation also for 2-D slot injection. But in 3-D hole 
injection, the region beneath the jet is ventilated by the mainstream and a low 
effectiveness occurs. 

One important aspect for film cooling processes is the jet trajectory which depends 
mainly on the injection angle and the momentum ratio. A high trajectory implies 
deeper penetration of jet into the crossflow, development of a reversal zone, pro¬ 
nounced flow mixing and hence less effectiveness. On the other hand, a low trajectory 
results in an attached coolant layer, less mixing and high effectiveness. In the present 
study, temperature field is used to determine the trajectory of the coolant jet. 

Figures 5a and b illustrate the temperature contours for jet-crossflow interaction 
with DR ^2 and m= 1-0 and 1-8 respectively, for a = 90°. As seen from the above 
figures, with the increase of blowing ratio, the coolant tends to retain its wall-jet 
character with strong penetration into the crossflow and thereby makes a high 
trajectory. To show the influence of injection angles on the trajectory, the temperature 
contours due to a = 45° for m = 1-8 are presented in figure 5c. This clearly demonstrates 
that for the low angle of injection, the coolant hugs the surface to be protected, retaining 
its identity for a larger length. Thus, under this condition, high cooling effectiveness is 
expected even far-field. As the angle of injection increases, the coolant penetrates more 
into the crossflow resulting in more jet dilution which would cause a rapid decay of 
film-cooling effectiveness. 



x / D 


Figure 7. Computed streamwise variation of pressure ratio for different blowing rates. 
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Figures 6a and b indicate cooling effectiveness for different angles of injection and 
blowing rates. They clearly demonstrate that the film cooling effectiveness increases in 
proportion to the injected mass flow rate both in the near and far field region for a low 
injection angle (a = 45 c ) but only in the far field region for a high injection angle 
(a - 90°). Increased a leads to a higher jet trajectory and also forms a separation bubble. 
The size of the separation bubble and turbulence mixing change with blowing ratios, 
strongly influencing near field effectiveness. Figure 5 also reveals that a shallow 
injection always improves cooling effectiveness because of attached coolant layer and 
low mixing. 

The static pressure distribution along the surface is depicted in figure 7 for different 
blowing ratios. As seen, for m ~ 1, the ratio of p/p^ increases to a maximum of 1 -08 at the 
upstream of jet, drops suddenly to a minimum of 0-8 at the slot trailing edge and 
subsequently recovers downstream. The upstream pressure rise increases with the increase 
of blowing ratios, indicating that the static pressure developed at the upstream corner 
of the slot is approximately equal to the injection total pressure. The flow separation 
downstream of the slot is pronounced with the enhancement of the blowing ratio, which is 
reflected by the clearly seen pressure plateau in figure 7, especially for m — 1-8. 

6.2b Effects of free-stream turbulence: Figure 8 shows the distribution of ?7 as the inlet 
free-stream turbulence (TuJ changes for m — 10 and a = 90°. It clearly demonstrates 
that as Tiq increases, rj decreases due to the pronounced mixing of the injected coolant 
jet and the crossflow. Effectiveness decreases almost by 25% as Tu a is increased from 
2% to 12%. Temperature fields, as shown in figures 9a and b, also illustrate the change 
in mixing characteristics and jet trajectory as Tu % changes. The near-field mixing is 
more for high Tu x resulting in a thick downstream thermal layer. 



I 1 _!_1 L 1 1_I 1 

0 °‘ 2 0 1 /. 1 °' 6 °' 8 1,0 0 0,2 °' 4 °- 6 08 1-0 
U/U 0D (r- \ U / , .. 


Figure 10. Effect of free-stream turbulence on mean velocity profiles (m= 1-0, 
a = 90°) at different values of x/D. 
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Figure 11. Effect of free-stream turbulence on turbulent kinetic energy profiles 
(m— 1-0, a = 90°) at different x/D values. 


The mean velocity profiles and the distribution of local turbulence levels (2/3 1/2 k / V a ) 
are presented in figures 10 and 11 respectively for different spanwise locations for the 
same blowing rate and injection angle, m— 1-0; a = 90°. Figure 10a shows a separation 
region and velocity overshoot for Tu a — 2% indicating the local crossflow acceleration 
over the coolant jet and a corresponding drop of turbulence kinetic energy at y/D 2 as 
in figure 1 la. With the enhancement of inlet turbulence intensity both the separation 
region and velocity overshoot decreases due to the enhanced mixing. At all other 
stations near the wall the gradient of velocity profiles decreases with increase of Tu a . 
Highest turbulence occurs in the interaction zones ( x/D < 20) due to shear layer that 
develops between the coolant jet and the crossflow and is seen to increase with increase 
of Tu a . The trends of distributions of mean velocity profiles and local turbulence levels 
are similar to those observed by Seban (1960). 

The computed normalised turbulent shear stress (u'v'/V 2 ) for m = 1 -0 and a = 90° are 
presented in figure 12 for different free-stream turbulence level. As seen, the magnitude 
of maximum value of the shear stress increases and its location is also shifted towards 
the slot as Tu a increases because of increased mixing. 










Figure 12. Predicted normalised turbulent shear stress contours for different 
free-stream turbulence levels Tu ro = 2% (a), = 7% (b), = 12% (c). 

6.2c Turbulence quantities : The computed results presented in this sub-section ai 
to highlight the complex turbulent field created by the jet-crossflow interactions f 
changing blowing ratios and injection angles. 
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Figure 13. Predicted turbulent kinetic energy profiles at different spanwise loca¬ 
tions and varying blowing ratios for an injection angle, a = 90“. 

Figure 13 shows local turbulent kinetic energy distributions at different spanwise 
locations downstream of the slot exit for an angle of injection, a = 90° with varying blowing 
ratios, m= l-0andm = 1-8. The same is represented in figure 14 for a shallow injection, 
a = 45°. The corresponding velocity profiles at the same axial locations are presented in 
figures 15 and 16 respectively. They clearly explain that a shallow injection produces less 
generation and a gradual diffusion of turbulent kinetic energy because of an attached 
coolant layer and a slower mixing rate. Thus, injection at low angles would result in 
a high film cooling effectiveness. Figure 14 also demonstrates that except very near the 
injection, the predicted maximum level of turbulent kinetic energy has not increased, 
but rather has dropped at the axial location of x/D = 14-7, as the blowing ratio is 
increased to 1-8. The corresponding velocity profiles (figure 16) reveal that for shallow 
injections, the momentum deficit between the injected jet and the crossflow, which is the 
main source of turbulence generation for these kind of problems, has dropped with an 
increase of blowing ratio. But, for a high injection angle (a = 90°), the maximum level of 
turbulence is increased and the peak in turbulent kinetic energy profiles (figure 13) is 
shifted outwards with blowing ratios because of more jet penetration and flow mixing. 
Here also, the peak in turbulent kinetic energy profiles can be directly linked with the 
maximum momentum deficit in velocity profiles (figure 16) which demonstrate a dras¬ 
tic change in the relaxation behaviour of velocity and length of flow separation as 
blowing ratio increases. As seen in figure 14, the flow has not relaxed to turbulent 
boundary layer type flow even far downstream, x/D = 35, for m = 1-8 with a = 90°. 

As expected, because of less penetration and slow mixing of the coolant jet at a low 
injection angle, the maximum level of normalised shear stress has dropped to — 0-0025 
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Figure 14. Predicted turbulent kinetic energy profiles at different spanwise loca¬ 
tions for a shallow injection angle, y ~ 45’. 



Figure 15. Predicted velocity profiles at different spanwise locations and varying 
blowing ratios for an injection angle, a = 90°. 
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Figure 16. Predicted velocity profiles at different spanwise locations and varying 
blowing ratios for a shallow injection angle, a = 45°. 


from —0-005 for m = 1-0. For a high blowing rate (m= 1-8), the maximum value of 
computed shear stress decreases further to -0-002 (figure 17) because of greater 
uniformity in velocity profiles as already explained. 


7. Conclusions 

A Navier-Stokes procedure coupled with compressible low-Reynolds number form of 
the k-e turbulence model in an explicit finite volume treatment is used to simulate 



Figure 17. Computed normalised turbulent shear stress for m = 1*8, and a = 45°. 
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jet-crossflow interactions with changing free-stream turbulence levels, blowing ratios 
and injection angles for film cooling applications. The present analysis is limited to 
a two-dimensional simulation where free-stream conditions are similar to the gas 
turbine practice. 

The following important points emerge from the present investigation. 

(i) The injection angle and blowing ratio strongly influence the aerodynamics of 
a coolant jet in a crossflow and thus downstream adiabatic film cooling effectiveness. 
For large injection angles, the coolant penetrates more into the crossflow resulting in 
greater turbulence generation, followed by enhanced mixing and a rapid turbulent 
diffusion which leads to a faster decay of film cooling effectiveness in the near field 
region. A flow reversal also occurs just downstream of the slot exit. For shallow 
injection angles, the coolant hugs the surface with low turbulence generation and slow 
mixing rate, retaining its identity for a larger length. This results in high cooling 
effectiveness even in the far field region. For an increase in blowing rate, the overall 
downstream film cooling effectiveness improves due to the thickened thermal layer and 
more available cold fluid. 

(ii) The free-stream turbulence affects the temperature distributions downstream of 
a coolant jet significantly by changing the mixing characteristics, local turbulence 
levels, mean velocity profiles and the jet trajectory. Film cooling effectiveness drops by 
almost 25% as inlet free-stream turbulence increases from 2 to 12%. 

(iii) Shallow injection changes the computed turbulent field indicating less generation 
and a gradual diffusion of turbulent kinetic energy. At a relatively low injection angle, 
an increase of blowing rate reduces the momentum deficit and in turn the maximum 
turbulence level which confirms that the shear layer between the crossflow and the 
coolant jet is the main source of turbulence generation for this kind of problem. 

List of symbols 

D coolant injection slot width; 

DR density ratio = p s /p 2 ; 

e total energy per unit mass; 

F, G flux vectors; 

/ turbulence length scale; 

M Mach number; 

m blowing ratio = (p V)J{p V) a ; 

m s coolant mass flow rate; 

Pr, Pr, Prandtl numbers; 
p pressure; 

q Cartesian components of heat flux vector; 

R gas constant; 

Re Reynolds number; 

S surface normal to a side of the cell; 

T temperature; 

T flVV adiabatic wall temperature; 

T r mainstream recovery temperature; 

T rs recovery temperature of coolant; 

At time step; 
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Tu 

turbulence intensity 

U 

solution vector; 


friction speed = (t w /p) 1/2 ; 

It, V 

Cartesian velocity components; 

V 

magnitude of velocity vector; 

•w y 

Cartesian coordinates (defined in figure 1); 

y + 

•law-of-the-wall coordinate = y [(p/ p){du/6y)fU 2 \ 

a 

injection angle of the coolant jet; 


Kronecker delta; 

t] 

adiabatic film effectiveness = (T aw , - T r )/( T rs - T r 

/■t, 

molecular and turbulent viscosities; 

P 

density; 

X U 

Cartesian components of stress tensor; 

Q 

cell volume. 

Superscripts 


fluctuating quantity in density-weighted 
time averaging; 

density-weighted time-averaged quantity. 


Subscripts 

/,./ logical cell indices in space; 

s’ coolant slot exit conditions; 

f turbulent; 

w wall condition; 

a mainstream conditions at the inlet plane. 
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X-ray streak cameras 
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Abstract. We describe the development of an optical and an X-ray streak 
camera with picosecond time resolution. The entire peripheral electronics 
and testing systems have been developed indigenously. Both the streak 
cameras provide ~ 15 mm/1 ns streak rate with a sweep voltage of ~ 1 kV 
amplitude and rise time of 1 ns. The time and spatial resolution of the 
optical streak camera have been found to be ~ 17 ps and 100 jam respective¬ 
ly. The sweep pulse generator developed for this purpose provides a step 
pulse of rise time ^ 1 ns and amplitude ~ 2 kV. The laser diode used for 
testing the optical streak camera provides multiple pulsation when the 
pump current is increased beyond a critical threshold. 

Keywords. Optical and X-ray streak cameras; picosecond time resolution; 
multiple pulsation. 


1. Introduction 

A streak camera is one of the most versatile instruments (Schelev et al 1972; Bradley 
etal 1980; Campillo & Shapiro 1983; Kinoshita etal 1983; Stradling etal 1983; 
Tsuchiya 1983; Dashevsky et al 1988) used today for high speed photometry in the field 
of physics, chemistry, biology and nonlinear optics. Conventional optical detectors 
such as biplanar photodiode and PIN silicon photodiode used with the fastest available 
oscilloscope provide a time resolution of 10~ 10 s. Various other techniques generally 
used for investigating very fast events are the autocorrelation method, second 
harmonic generation, two-photon fluorescence and four-photon parametric mixing. 
However, certain amount of errors (inaccuracies) are always associated with these 
measurements. Amongst all the available methods, streak photography is considered 
to be the best for recording ultrafast optical phenomenon with an excellent time 
resolution (< 1 ps). It can provide a temporal profile of any optical event directly even 
in a single-shot operation. In addition to the time duration, rise time as well as shape of 
the optical pulse also can be recorded along with intensity modulation (fine structure) if 
present, during the optical event. However, measurements will be limited by the time 
resolution of the streak camera. Streak cameras with nanosecond time resolution were 
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built long ago, but temporal resolutions of the order of picoseconds or sub-picoseconds 
has been achieved only a few years ago. A new streak camera with femtosecond time 
resolution has also been reported (Kinoshita et al 1987). Highly sensitive streak 
cameras with picosecond time resolution are now commercially available which are 
capable of recording any phenomenon in the spectral range extending from near 
infrared to ultraviolet and soft X-ray (100eV-10 keV) regions with the help of suitable 
photocathode attachment (Schelev et al 1972; Bradley et al 1980; Campillo & Shapiro 
1983; Kinoshita et al 1983; Stradling et al 1983; Tsuchiya 1983; Dashevsky et al 1988). 
In fact, the streak camera can functionally be referred to as a > 300 GHz band width 
optical oscillosocope. However, the streak camera possesses many more special 
features and scores over an ordinary oscilloscope, for example, it can provide three- 
dimensional information such as spatially time-resolved as well as time-resolved 
spectroscopic intensity profiles (Campillo & Shapiro 1983; Tsuchiya 1983). Either 
a photographic system or a digital image memory is used to store and analyse 
three-dimensional streak images. The latter provides almost real-time measurements of 
the events. 

The present paper reports on the development of a picosecond resolution optical 
(Rai et al 1994b) and X-ray streak cameras (Rai et al 1994c) intended for studying fast 
processes in IR, visible, UV and soft X-ray wavelength regions. The design of the 
instrument, its peripheral electronics, radiation sources for testing the cameras as well 
as test results are presented. Section 2 describes both types of streak tubes used in these 
experiments. Section 3 reports the development of related electronics to drive the streak 
camera. Section 4 provides information about various kinds of optical sources used for 
testing the streak cameras. Test results and discussions are presented in § 5, whereas 
conclusions are given in § 6. 

2. Streak tubes 

2.1 Optical streak tube 

We have used the optical streak and image intensifier tubes manufactured by General 
Physics Institute, Moscow for building our optical streak camera (figure 1). It has an S1 
type photo cathode (Ag-O-Cs) deposited on a flat glass plate of 5 mm diameter and is 
sensitive-in the spectral range 450-1150 nm with peak response at 800 nm. An 
accelerating grid placed at distance of 1 mm from the photocathode has 60% trans¬ 
mission. The grid is used for gating as well as for reducing the time resolution limit of 
the tube to < 1 ps. The tube has a focussing electrode which focusses the photo electrons 
on to the phosphor screen and also decides the spatial resolution of the tube. The photo 
cathode, accelerating grid and focussing electrodes require biasing voltages of — 15, 
— 13 and — 12 to - 13 kV respectively. It has an anode which acts as an aperture in the 
tube and remains at the ground potential. There are two pairs of deflector plates out of 
which one pair is used to sweep the photo electrons on the screen whereas the other pair 
is used to fly back the photo electrons to the initial position without streaking back on 
the screen to the position from where sweeping started. The phosphor screen of the 
camera is made of aluminized PI 1 material which has peak response at 460 nm (blue 
region) and is coupled to the output face of an optical fibre plate. The tube is of the 
sealed-off type with a glass window on the photocathode side. An image intensifier tube 
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STREAK TUBE IMAGE INTENSIFIER TUBE 


Figure 1. Schematic diagram of streak camera. 


is coupled to the streak tube as shown in figure 1. The image intensifier tube consists of 
a photocathode, a phosphor screen and a multichannel plate (MCP) to multiply the 
photoelectrons. This system is used to enhance the intensity of the image obtained at 
the phosphor screen of the streak tube. It has a maximum gain of ~ 10 4 divided into 
5 steps. Finally, the image on the screen of the intensifier is recorded by using a contact 
photograph or a charge coupled device (CCD) camera. 

2.2 X-ray streak tube 

The input end of the X-ray streak tube (also obtained from the Institute of General 
Physics, Moscow) is attached to a metallic flange, which has a clearance for the 
photocathode at the centre and a number of small holes around it to evacuate the tube. 
Generally, an X-ray streak tube has a demountable photocathode and is evacuated 
directly during experiments. The rest of the parts of the tube are similar to that of 
optical streak tube. The X-ray sensitive photocathode consists of a 300 A thick gold 
film deposited on a 1000 A thick parylene or nitrocellulose film which is fixed on the 
cathode disc having a 100 /im x 9 mm slit. This photocathode is sensitive to soft X-rays 
ranging from 100 eV to lOkeV as well as to UV radiation. The slit image produced by 
the incident soft X-rays is focussed on to the phosphor screen by the electron lens made 
by the combination of accelerating mesh, the focussing electrode and the anode. The 
photoelectrons are streaked on the 40 mm diameter screen by means of the vertical 
deflection electrodes. The intensity of the image is then enhanced by the image intensifier 
tube and the output image is recorded using a camera. 

3. Electronics 

The peripheral electronics of the streak camera contain various kinds of fast and high- 
as well as low-voltage pulser circuits. The main high voltage pulse requirements for the 
operation of streak camera are (i) a high voltage (~ 1 kV) linear ramp of dual polarity 
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for full deflection of the image across a 25 mm image intensifier screen, (ii) a short 
rectangular pulse of variable time duration (100 ns - 2 /is) and amplitude (max 2 kV) to 
gate the accelerating grid as well as to increase the time resolution of the camera, (iii) 
a trigger pulse generator, which triggers the sweep circuit. This generator is externally 
triggered by a fast photo diode signal, (iv) high voltage DC power supplies to bias 
various electrodes of streak and intensifier tubes such as photocathode, accelerating 
grid and focus electrode as well as to drive the sweep and gate pulse circuits. 

3.1 Sweep pulse circuit 

A picosecond time resolution is obtained in a streak camera by streaking the photo¬ 
electrons with a streak velocity ~ 10 lo cm/s. This indicates the requirement of a sweep 
voltage having a rise time ^ 1 ns. Rise time obtained using an RLC integrated circuit is 
slower than this requirement. In order to prevent image distortion and improve camera 
dynamics as well as spatial resolution, two symmetrical well-balanced sweep pulses are 
required which in fact makes the sweep pulse faster in comparison to the single sweep of 
same rise time. The nonlinearity in the sweep pulse and, as a result, the sweep speed are 
improved by increasing the amplitude of the sweep pulse beyond the required value 
(overpulsing). Many circuits have been reported for generating a sweep pulse for the 
streak camera. These utilize various kinds of fast switching elements such as krytron 
(Bradley et al 1972), power-beam microwave triode (Alcock et al 1970), laser triggered 
spark gap (Alcock et al 1970; Schelev et al 1972) as well as avalanche transistors (Cunin 
et al 1980; Baker 1991). The transistorized circuits available to date provide rise times 
of 1-5 to 2 ns depending on whether avalanche transistors are used in a series stack 
mode or in a Marx bank configuration. Two pulser circuits (Rai et al 1994a; Rai 
& Shukla 1994) were developed in our laboratory. One of the pulser circuits was 
developed (Rai & Shukla 1994) by using a stack of 15 avalanche transistors, 2 N 5551, 
connected in series as a single switching element which is shown in figure 2. A trans¬ 
mission line of 75(50) ft impedance and ~ 1-5 m in length was charged to 4-5 kV and 
discharged through a 75(50) ft load. We obtained a square step pulse of ~ 2 kV across 
a 75 (50) ft resistor instead of ~ 2-25 kV as a result of a small voltage drop across the 
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Figure 2. Diagram of sweep 
pulse generator circuit. 
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Figure 3. Oscillograms of 
sweep pulses recorded on a Tek¬ 
tronix model 7104 oscilloscope 
(1 GHz) having vertical plug in 
model 7A29. (a) Horizontal - 
200 ps/div., vertical - 400 V/div.; 

(b) horizontal-2ns/div., verti¬ 
cal - 400 V/div. 

transistors due to its finite resistance during the breakdown. This circuit provides a 
square pulse of rise time 800 ps (10-90%) as shown in figure 3 when recorded on a 
Tektronix oscilloscope (model 7104) having a vertical plug in model 7A 29 which had 
a combined bandwidth of 1 GHz. To achieve rise times better than 1 ns, with a series 
combination of transistors; reduction in excess leads and proper soldering are needed to 
reduce the extra inductance in the circuit. A carefully designed printed circuit board was 
prepared for this purpose. This circuit was triggered through a wide band pulse 
transformer which consists of a small toroidal ferrite core with five turns of 50 Q coaxial 
cable, where the cable shield acts as a primary and the central conductor as a secondary. 
Triggering of any transistor in the stack switches the circuit, but better stability was 
obtained when a middle transistor of the stack was triggered. Triggering pulse was 
sharpened using a 20 PF capacitor and 50 Q resistance at the output of the transmission 
line pulse transformer in a differentiation mode. Small pulse width of the trigger pulse 
effectively protects the base emitter junction of the transistor being triggered and as a 
result enhances the life time of the circuit. Many earlier reported circuits (Gunin et al 1980; 
Baker 1991) use a ballast resistance between emitter and collector of each transitor to avoid 
self-triggering in the circuits. We, however did not observe any self-triggering in our 
circuits even without the ballast resistances. Nevertheless, a properly designed ballast 
circuit may decrease the high jitter (< 500 ps) observed in this case to < 100 ps and may 
keep the string from self-triggering over a wide temperature range. 

A simple technique was used for generating two symmetrically opposite polarity 
pulses from a single pulse of either polarity. This arrangement is shown in figure 
4 which had two similar length (2 m) 75(50) Q. impedance cables connected in parallel at 
the output of the pulser circuit and terminated with 75(50) (2 resistances. Towards the 
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Figure 4. Arrangement to ob¬ 
tain positive and negative po¬ 
larity pulses from a pulse of 
either polarity. 
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Figure 5. Oscillograms of two 
opposite polarity pulses re¬ 
corded on a Tektronix model 
7348 oscilloscope (400 MHz) 
having vertical plug in model 
7A19. (a) 10 ns/small div., verti¬ 
cal - 160V/small div.; (b) Ins/ 
small div., vertical - 160 V/ 
small div. 


termination end, the first cable was grounded at the shield and the output was taken 
from the central pin which provided a similar polarity pulse, like an input pulse but 
around one half in amplitude. The second cable has its central pin grounded at the 
termination end and the output was taken from the shield of the cable (figure 4) which 
provided an opposite polarity pulse in comparison to the input pulse and similar in 
amplitude as was obtained from the first cable. Care must be taken to isolate the shield 
of the latter cable with the normal ground at the termination end, otherwise the rising 
edge of the pulse becomes distorted. Figure 5 shows the two symmetrically opposite 
polarity pulses of rise time better than 1 ns. In the case of application of these pulses in 
the streak camera only the rise time, amplitude and a small delay are important 
parameters of the pulser circuit. These opposite polarity pulses were used to operate the 
streak camera. However, it was noticed that the rise time of the pulse was modified to 
~ 2-5 ns due to capacitive loading from the deflection plate. The superimposed 
fluctuations observed on the pulses in figures 3 and 5 appear to be due to the unwanted 
signals travelling on the outside of the outer conductor. These fluctuations can be 
removed or minimised by placing the circuit in a properly shielded box or by wrapping 
the coaxial cable on a large ferrite core. The signal on the outside of the outer conductor 
will travel at a slower velocity due to the high /i value of the ferrite. For the slow streak 
velocity, an RLC integrator was used at the output of the circuit. 

3.2 Gate pulse circuit 

It has been reported earlier (Schelev et al 1972) that the resolution of the streak camera 
strongly depends on the electric field between photocathode and grid. Normally 
1-2 kV is sufficient but for better time resolution a higher voltage is required on the grid 
beyond the normal value, which is provided in the form of a pulse to avoid undesirable 
effects (Schelev et al 1972). Sometimes a gating is needed to avoid the background noise 
also. For this purpose, a gate pulse of positive polarity is required, having amplitude 
1-2 kV and time duration ~ 100 ns, much higher than the sweep voltage time (~ 1- 
20 ns) but synchronized with it. A pulse of this time duration (100 ns) can be obtained by 
using the earlier circuit (figure 2) with at least ~ 10 m length of a coaxial cable of 50 Q 
impedance. To simplify this circuit the coaxial cable was replaced by a discrete element 
transmission line. The impedance of this transmission line was increased to 1000 £2 to 
decrease the current flowing in the circuit to avoid the failure of transistors as was 
observed in the case of a 50 Q impedance line providing ^ 100 ns time duration pulses. 




Picosecond time resolution streak cameras 


943 


+Vcc 



Figure 6. Diagram of gate pulse circuit. 


The gate pulse circuit is shown in figure 6 which provides a square pulse of ~ 100 ns 
duration with 1-5 kV amplitude. This line also acts as a low pass filter for all the 
frequencies between zero and ( 2n{LC) ll2 )~ 1 whereas the impedance of the line is given 
by 

R l = (L/C) 112 . (1) 

The quantity (LC ) 1/2 represents the delay time per section of the line (low pass filter). 
Therefore, an n-section filter will produce a pulse of duration t given by, 

t = 2 n(LC) 112 . (2) 

The values of L = 10 p H and C = 10 pF was calculated for a discrete component 
n — 5 section transmission line considering R L = 1000 111 and x — 100 ns using (1) and (2). 
The output of the gate pulse was connected to the accelerating grid through a capacitor 
to isolate the high voltage present on the grid with gate circuit. 

3.3 Trigger pulse circuit 

A fast (< 1-5 ns rise time) trigger pulse generator was developed to trigger the sweep as well 
as the intensifier circuit. The block diagram of the circuit is shown in figure 7. This pulse 
generator employs transistors (2N5551) in avalanche mode as switching elements. 
A capacitor of 40 pF was charged up to 300 volts and discharged through a resistive 
divider to provide pulses of ~ 50 volts amplitude and, 5 ; 3 ns duration across a 50 Q 
load. It could be operated in manual, external or interhal modes. The frequency of 
operation in the internal mode was from 10 Hz to 100 kHz. Manual operation in this 
generator was achieved by using 4001 CMOS ICs in mono stable mode where IC X acts 
as a buffer and IC 2 acts as dual monoshot IC. Internal mode operation of the generator 
was achieved using 4047 CMOS IC in a stable mode as shown in figure 7, where the 
frequency of operation from 10 Hz to 100 kHz was achieved by proper selection of 
R and C. The transistor T2 (2N5551) had a 50 Q resistance at its base for triggering 



944 


V N Rai et al 


o 15V 


18V 



EXTERNAL 0 .-, 1uF «- 

TRIGGER 0 -If— 

INPUT J=. 


FREQUENCY 

c 

10Hz 

0.1 uF 

100Hz 

0.01 uF 

1 KHz 

1 KpF 

lOKHz 

lOOpF 


Figure 7. Trigger pulse generator circuit. 


purposes. Since CMOS ICs can drive only high impedance loads, another transistor T1 
(2N5551) was used in the saturation mode to amplify the current level. The external 
triggering of the generator was achieved by direct application of an external trigger at 
the base of the transistor T2. The mode of operation, that is internal, manual or external 
is obtained with the help of a DPDT switch. The fast electrical pulse obtained from this 
circuit is shown in figure 8. 

3.4 Biasing power supply 

Several DC high voltage power supplies are needed to drive the picosecond streak 
camera. The photocathode, accelerating grid and focussing electrode of the X-ray 
streak camera require biasing voltages of — 12, —• 10 and — 11 kV respectively, with 



Figure 8. Oscillogram of the 
fast trigger pulse recorded on 
Lecroy model 9360 oscilloscope 
(5 GHz). 
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respect to the anode and the phosphor screen. A 4-5 kV power supply drives the sweep 
voltage and the gate circuit whereas a variable ± 500 V supply controls the position of 
the streak on the screen. Similar power supplies were used to test the optical streak 
camera also instead of the - 15, — 15, - 13kV biasing voltages required for photo¬ 
cathode, accelerating mesh and focussing electrode respectively. 

4. Setup for streak camera testing 

4.1 Laser diode: A source for optical streak camera 

A single heterojunction GaAs laser diode LD-62 emitting at 904 nm was used as an 
optical source for testing the optical streak camera. These optical pulses were generated 
with the help of a driving circuit (Rai et al 1995a) having series stack of two 2N5551 
transistors operating in avalanche mode (figure 9). A capacitor of 40 pF charged to 
~ 600 volts was discharged through the laser diode. The peak current in the laser diode 
was decided by charge stored in the capacitor (Q = CV). The value of C and V can be 
changed according to the requirement of current in the diode. The rise time and pulse 
duration of the electrical pulse were recorded as ~ 1 ns and ~ 2-5 ns respectively. The 
current pulse duration depends on the value of C and the resistance in the discharging 
path. However, an attempt to reduce the rise time and the time duration of the current 
pulse by increasing the breakdown current was not successful. The laser diode driver 
circuit was triggered by the trigger pulse generator described in § 3-3. Trigger pulse was 
applied through a fast transmission line pulse transformer to isolate the driver circuit 
with other components. The laser diode (LD-62) obtained from Laser Diode Incorpor¬ 
ated, USA has a lasing threshold current ~6A, maximum peak forward current 
~ 20 A and total peak radiant flux 5-6 W with peak emission wavelength at 904 nm. 
There is no critical requirement of power (radiant flux) emitted from the laser diode to 
test the streak camera. Even a diode laser operating below the threshold current of 
lasing (spontaneous emission mode) can be used to test the camera due to its better 
sensitivity. This diode was mounted on an aluminium block which acted as a heat sink. 
The circuit diagram of the laser driver is shown in figure 9. The laser diode was operated 
in three modes, (i) manual mode (ii) external trigger mode and (iii) internal trigger mode. 
Optical pulses were available with repetition rates varying from 10 Hz to 10 kHz when 
operated in internal mode and in single pulse when operated in external or manual 
mode. The laser diode could provide the smallest light pulse of 120ps time duration. 


Figure 9. Driving circuit for 
the laser diode. 
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Figure 10. Circuit diagram of the spark gap driver. 


4.2 Source for testing X-ray streak camera 

Two types of sources were used to test the X-ray streak camera, (i) UV radiations from 
a spark gap kept in a vacuum chamber and (ii) soft X-rays emitted from the laser 
produced plasmas. 

4.2a Spark gap driver: Figure 10 shows the circuit diagram of the spark gap driver. 
In this circuit 1 pF capacitor charged up to ~ 400 V was discharged through the 
ignition coil by triggering the SCR. It provided a high voltage pulse at the output of the 
ignition coil which was connected to one of the electrodes (pin-like sharp) of the 
spark-gap (kept in 10" 5 torr vacuum and separated by a fraction of a millimetre), 
whereas the other electrode was grounded (a plate). About 5 kV pulse applied on the 
spark gap produced a spark of duration ~ 1-3 ps. It has been reported (Dashevsky et al 
1988) that the radiation spectrum of such a source has a maximum in the UV range and 
has sufficient power to produce a bright image of the slit on the screen of the streak 
camera. This source was operated manually. This driver was used to test the operation 
of streak camera for nanosecond time resolution. Low energy reproducibility in 
different shots is the main disadvantage of the spark gap driver. Pulsed UV emissions 
from the spark gap were studied using a slow sweep voltage (~ 5-20 ps) applied on the 
deflection plates of streak camera which are generated with the help of transistors. 
Experimental results indicate the presence of a substructure of ~ 100 ns duration 
within the flash duration (~ ps). 

4.2b X-rays from laser produced plasma: The short time duration X-rays generated 
from the laser produced plasma are used as a source to test the picosecond X-ray streak 
camera. The experimental system (figure 11) consists of a target chamber of 20 cm 
diameter and 20 cm height with 6 ports in radial directions out of which 4 ports are at 
90° and two ports are at 45° in comparison to the port which is used for passing the laser 
beam. The streak camera was attached to the chamber at a port 90° to the laser axis. 
A copper target was kept at the centre of the chamber evacuated upto 10" 5 torr. 
A 35-picosecond mode locked Nd:YAG laser delivering 75 mJ energy was focussed 
onto the copper target to produce the plasma and as a result a subnanosecond duration 
soft X-ray pulse was emitted. A portion of the laser pulse energy was used as a reference 
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IMAGE RECORDING 
FILM 


Figure 11. Experimental set-up for testing X-ray streak camera. 


for triggering the sweep and gate circuits. The main laser beam generating the plasma 
and X-rays was delayed by ~ 25 ns to synchronise the arrival of photoelectrons and the 
start of the sweep voltage on the deflection plates. Two parallel plates, one at — 12 kV 
and other at ground voltage were kept in front of the photocathode in such a way as to 
deflect and collect the ions and electrons generated in the chamber avoiding any direct 
impact of charged particles on the photocathode. A B-10 polycarbonate foil was used to 
cover only half of the vertically located slit such that the open part of the photocathode 
was sensitive to UV and soft X-rays whereas the covered part was sensitive to soft 
X-rays only. A 5-ns rise time sweep streaked the image in the horizontal direction. 

4.3 Recording system 

The streaked image from the screen of streak camera was recorded on a polaroid film 
by the contact photography method as well as by using the CCD camera. The images 
recorded using the CCD camera were analysed on a personal computer using the 
software PROMISE (Profile Measurement of Image Size and Edge location) to obtain 
the temporal intensity profile of the image (Vora et al 1994). 

5. Results and discussion 

5.1 Study of multiple pulsation in diode laser using optical streak camera 

The experimental setup for testing the optical streak camera is shown in figure 12. The 
optical pulse from a laser diode was focussed on the photocathode of optical streak 
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Figure 12. Experimental set-up for testing optical streak camera. 


camera using an //I lens of / = 5 cm. The photoelectrons were streaked on the screen 
using a 1-ns sweep circuit and the image was recorded on the polaroid film. A second 
shot was taken by delaying the arrival of the light pulse by 1 ns. The images of the first 
and the second light pulse were separated by ~ 15 mm in space which corresponds to 
1 ns. This provided us with a fastest experimental streak rate of ~ 15 mm/1 ns (streak 
velocity 1-5 x 10 9 cm/s) which is less than the expected value of ~ 40 mm/1 ns. This is 
due to the capacitive loading of the deflection plates which slows down the sweep rise 
time to ~ 2-5 ns and ultimately provides less streak velocity. By shifting the single 
optical pulse on screen using a known delay in steps we found the linearity in the sweep 
within 10%. The measurements indicate that the smallest duration light pulse from the 
laser diode lasts for ~120ps when the charging capacitor is ^40pF. The pulse 
duration increases to ~ 180 ps for a capacitor value of 120 pF (figure 13). This indicates 
that the optical pulse duration is dependent on the RC time constant of the circuit and 
intensity is dependent on the product of capacitance and the charging voltage. Figure 14 
shows the variation of light intensity emitted from diode laser with the pumping current 
which was recorded using the photodiode. Here the diode laser was operated by 
discharging two capacitors C = 270 pF and 120 pF one by one. This indicates that the 
emitted intensity increases slowly up to the threshold (/ 0 // Th ~ 1) and then increases 
very quickly. The intensity variation was similar whether the diode laser was operated 



Figure 13. Single optical pulse 
emitted from laser diode and 
recorded using optical streak 
camera. 
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I o / ! Th ~~ 


Figure 14. Variation of optical 
intensity emitted from laser 
diode with pumping current 
pulse (injection current/thresh¬ 
old lasing current). 


by discharging C = 270 pF or by C = 120 pF except for a higher lasing threshold in the 
latter case. It was noticed that a single optical pulse was obtained just above the 
threshold current for lasing. By increasing the current above the threshold, i.e. the 
charging voltage on the capacitor, the number as well as the duration of the light pulse 
emitted from the diode laser increases (Rai et al 1995b) (figure 15A). A further increase 
in the laser diode current decreases the time separation between the emitted light pulses 
which ultimately merge with one another to form a long duration pulse similar to 
a current pulse. This type of behaviour has been observed earlier in various cw and 
pulsed semiconductor diode lasers (figure 15B, from Elliot et al 1983), and is known as 
self-sustained pulsation (Chik et al 1980; Elliot et al 1983; Lau et al 1983; Yariv 1985; 
MacFarlane & Tatum 1992; Rai et al 1995a). These pulses occur at certain frequencies 
which coincide with the relaxation oscillation frequencies of the diode lasers (Rai et al 
1995b). Figure 16 shows the variation of experimentally observed relaxation oscillation 



Figure 15. (A) Multiple pulsation observed in laser diode emission at / 0 // T h ~ 1 '32 
which is recorded using optical streak camera. (B) Multiple pulsation observed with 
GaAlAs diode laser, (a) and (b) are two typical examples of individual pulses (Elliot 
et al 1983). 
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Figure 16. Variation of meas¬ 
ured relaxation oscillation fre¬ 
quency with the root of 
intensity emitted from diode 
laser. 


frequencies (v reI ) with the square roots of the emitted light intensities, proportional to 
the photodiode signals. A linear variation of data points indicates better agreement 
with the relation given by Lau et al (1983) for relaxation oscillation frequency as 

v rel = (l/2n)(APJx P ) 112 

where P 0 is the steady state photon density in the active region, r P is photon life time 
and A is optical gain coefficient. This could be because (Yariv 1985) optical pulse 
generation combining the relaxation oscillation phenomenon of laser diode with the 
large current capability of avalanche transistors can lead to optical pulses of ~ lOOps 
width. According to Baker (1991) if the current impulses of the avalanche transistors 
return to zero before the occurrence of the second oscillation of the laser diode, only one 
optical impulse will be generated, whereas a current step applied on laser diode 
generates a train of pulses. This type of phenomena (Chik et al 1980; Elliot et al 1983; 
MacFarlane & Tatum 1992) seems to be due to the presence of defects in the laser 
cavity. These defects act as saturable absorbers for laser light and, as a result, produce 
small time duration optical pulses. Detailed emission characteristics of these diode 
lasers are published elsewhere (Rai et al 1995b). 

This phenomenon of multiple pulsation observed in the laser diode, has been used to 
test the streak camera (Rai et al 1995a). Figure 15 shows the intensity profile in time of 
a laser diode emission at I 0 /I Th ~ 1-32 where J 0 and J Xh are the pump and threshold 
currents of the diode laser. It shows the time period of repetition of pulses as ~ 400 ps. 
Figure 17 shows a similar period of repetition (~400ps) when optical pulses were 



2 ns/DIV 


Figure 17. Multiple pulsation 
similar to figure 14 but recorded 
using L&T Gould model 7074 
oscilloscope (100 MHz). 
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recorded using a fast photodiode and an oscilloscope (L and T gould model 7074) in 
equivalent time sampling mode. This indicates the successful operation of the streak 
camera. The equi-distant optical pulses provided similar information about the 
linearity of the sweep (within 10%) as well as similar streak velocity (~ 1-5 x 10 9 cm/s) 
as was observed by shifting the single optical pulse as discussed before. Comparison of 
figures 15 and 17 indicates that the streak camera record has two equal intensity peaks 
between two small pulses, while the CRO trace has a single central pulse of large 
intensity. The above discrepancy is due to the amplitude jitter in the diode laser 
emission because figures 15 and 17 were recorded during two different shots. In each 
and every shot the amplitudes of the optical pulses changed, while the nature and time 
period between the pulses remain the same. However, a time jitter of ~ 500 ps has been 
recorded from one shot to another. 

5.2 Temporal resolution of optical streak camera 

The temporal resolution of streak camera is mainly dependent on three factors 
(Tsuchiya 1983); (i) the transit time spread which is caused by the initial velocity 
distribution of the photoelectrons emitted from the photocathode at the same time. 
This occurs in the low velocity domain between the photocathode and the mesh 
electrode and is given by 

t x = 2-34 x 10“ 6 [(Ae) 1/2 /£], (3) 

where Ae is FWHM in energy distribution of the emitted photoelectrons which is 
~04eV when a photon of 904 nm falls on the SI photocathode. E is the accelerating 
electric field near the photocathode which is ~ 10 6 V/m in our case, and can provide 
a time spread At x ~ 1-5 ps. However, a small time spread will exist between the mesh 
and the phosphor screen, which is very small in comparison to this time. 

(ii) The technical or streak limited time resolution is given by 

A t 2 = l/6v, (4) 

where 1-5 x 10 lo mm/s is the measured streak velocity and <5 ~ lOlp/mm is the 
measured spatial resolution of the streak camera. This provides Af 2 ~ 6-6 ps. 

(iii) The time spread Af 3 which occurs due to the effect of the deflection electric field on the 
photoelectron beam. It depends on the beam diameter, length of the deflecting plate, the 
deflection electric field and on the axial (directional) velocity of the electrons. This time 
spread is typically ~ 1 ps. The actual time resolution of streak camera At will be given by 

At = (ArJ + At\ + Ar|) 1/2 (5) 

which comes out to be ~ 7 ps. This is the estimated time resolution of the streak camera 
obtained using (5). 

The time resolution of this streak camera was measured using a Michelson inter¬ 
ferometric system with a mode-locked Nd:YAG laser (M/s Continuum, USA). The 
main beam was split into two beams and was directed towards a pin hole which was 
focussed on the photocathode of the streak camera. By moving one of the two mirrors 
of the interferometer one pulse was delayed with respect to the other according to the 
requirement. This experiment provided the time duration of the laser and time 
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Figure 18. Contact photograph of streak image of X-rays and UV emission from 
laser produced plasma. A, B and C are images of different shots. 


resolution of streak camera as a ~39ps and ~40ps respectively. However, time 
resolution was also calculated using the following relation 

t„=(t|+A (o 

where t m « 39 ps and z R « 35 ps are the measured and reported (M/s Continuum, USA) 
time durations of the Nd:YAG laser pulse respectively. Equation (6) provides a time 
resolution At % 17 ps which is more than the expected value of ~ 7 ps and less than the 
measured value of ~ 40 ps. This indicates that the measured value of time resolution is 
being limited by the large time duration of the test pulse (35 ps). Even 10-20% error is 
possible in the measurement of time duration from the recorded images. These factors 
can lead to increased value of measured time resolution. This indicates the necessity of 
a smaller time duration optical pulse as well as a good optical image recording system 
for the measurement of time resolution of the streak camera. However, optimization of 
the camera parameters, such as biasing voltages, wiring arrangement in streak camera 
at deflection plates and improvement in optical imaging system at the input and output 
of the camera will essentially provide an even better time resolution and accurate time 
information about the events. 

5.3 Study using X-ray streak camera 

Figure 18 shows the photograph of UV and X-ray radiations emitted from the laser 
produced plasma using a 35 picosecond mode locked laser irradiated copper target. 
The time duration of UV radiation was found more than 20 ns whereas, X-ray emission 
last for <200ps. The streak rates were found as ~ 8 mm/ns and 15 mm/ns when 
a sweep voltage of 5 ns and 1 ns rise time were used respectively. These parameters are 
similar to that of an optical streak camera because of the similarity in the streak and 
intensifier tubes along with its driving electronics systems. 

Inspite of all similarity in both the cameras, the photocathodes used are different. 
The transit time spread (A t x ) of photoelectrons emitted from gold photocathode 
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(energy spread, a = 5 eV) in the case of X-ray streak camera, is calculated to be ~ 5 ps. 
This factor leads to a higher value of time resolution than the optical streak camera. 
The expected time resolution for this streak camera is ~ 15 ps. However, we could not 
find the experimental value of time resolution for X-ray streak camera due to 
experimental limitations. 

6. Conclusions 

In summary, we have developed an optical and an X-ray streak camera which is 
operating successfully with a fastest streak speed of ~ 1-5 x 10 9 cm/s and temporal 
resolution of ~ 17 ps (optical streak camera). Since the sweep circuit developed for this 
purpose provides a positive and negative polarity ~ 1 kV amplitude pulse with a rise time 
< 1 ns, therefore, an improvement in wiring arrangement for applying sweep voltage to 
deflection plates and an optimization of the biasing voltages along with better input and 
output imaging and recording systems will improve the streak speed and time resolution of 
the streak camera. However, to measure the time resolution accurately it is necessary to 
have an optical pulse of few ps time duration. A combination of trigger pulse generator and 
laser diode driver circuit used with single heterojunction GaAs laser diode provided 
a simple and stable picosecond optical pulse generator which generated a single as well 
as multiple pulses depending on the amplitude of input current flowing in it. It was used 
successfully to test the optical streak camera. 


The helpful discussions with V Chevokin from General Physics Institute, Moscow, 
S Majumder of Delli Delti, UK, B S Narayan, K S Bindra, A G Bhujle are gratefully 
acknowledged. We are thankful to R Arya for providing some of the high voltage power 
supplies, R K Khardekar for providing CCD camera and H S Vora for providing the 
software ‘PROMISE’ to analyse the streak camera images. 
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Abstract. A mathematical model is developed for the analysis of spatially 
varied steady flow in irrigation canals. The model accounts for canal 
seepage and effect of control structures at the upstream and downstream 
ends of the canal. Two computational methods developed to solve the 
spatially varied steady flow equations for the irrigation canals are presented 
here.The governing differential equations are solved iteratively using fourth 
order explicit Runge-Kutta method. The model results are verified with 
experimentally observed water surface profiles available in literature. The 
effects of bed seepage, canal condition and backwater curves on the 
■ discharge carrying capacity and variation of flow depth are studied through 
model application on a canal reach. It is found, that in most of the situations 
the backwater curves spread sufficiently upstream and significantly affect 
the performance of the control structure at the upstream end. In many 
situations, it may not even be possible to operate the canal at design 
discharges. 

Keywords. Steady flow; open channels; irrigation canals. 


I. Introduction 

By and large, canals are designed based on uniform flow criterion coupled with regime 
concepts. However, there is an ambiguity about the applicability of the uniform flow 
concept to spatially varied flows. In practice, seepage losses are estimated based on 
experience and are lumped at canal ends. However, the water is continuously lost as 
seepage along the canal reach. Even under idealized conditions of no seepage, 
a uniform flow at all locations in a canal reach is not feasible due to the presence of 
measuring/regulation control structures. The actual depth and discharge in the canals 
are significantly different from the design discharges. In developing countries like India, 
almost all the canal systems are manually controlled type. The supplies are made under 
steady state. The steady flow simulation in irrigation canals can play a very important 
role in irrigation scheduling of the existing canal systems and in design and planning of 
proposed canal irrigation projects. 
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The analysis of spatially varied steady flow in irrigation canals involves the solution 
continuity and momentum equation for open channels. In the limiting case of no lateral 
flow, these equations reduce to dynamic equation of gradually varied steady flow. 
Many methods, such as graphical methods, direct integration methods and step 
methods (Chow 1959; French 1986) are used to numerically integrate the dynamic 
equation of gradually varied flow. Standard step method is widely used to compute 
varied flow profiles. This procedure involves the iterative solution of nonlinear 
algebraic equations at every computational node (Paine 1992). The use of same 
numerical techniques to analyse gradually varied or spatially varied flow problems in 
an irrigation canal is not simple. 

Hager (1983, 1991) and Schropp & Fontijn (1989) presented a general theory of 
spatially varied flows with increasing and decreasing discharges. They discussed in 
detail the nature of steady spatially varied flow profiles, requirement of boundary 
conditions, properties of saddle and nodal (singular) points. Many investigators (Chow 
1959; Hager 1983) analysed the problem of side channel spillway (increasing discharge) 
wherein the rate of lateral inflow is independent of the flow in the channel being 
modelled. Whereas in case of a side weir or in a channel with bottom racks (decreasing 
discharges) usually channel friction and bed slope are not considered (Subramanya 
1986; Uyumaz & Smith 1991). However, the seepage from canals is characterized by 
canal size and shape, hydraulic conductivity of the soil, location of water table, weed 
growth and flow depth in the canal (Leliavsky 1965; Wachyan & Rushton 1987). The 
order of magnitude of bed slope, friction and seepage terms in governing equations are 
comparable (Misra 1993). 


1.1 Statement of the problem 

The analysis problem often encountered' in irrigation canals is to compute water 
surface profile and evaluate discharge carrying capacity of a canal reach for given canal 
geometry, roughness, seepage coefficients and control structure settings at both the 
canal ends (figure 1 a). The problem is implicit in the sense that the discharge or depth at 
either of the boundaries is not known. The discharge from the upstream control 
structure may depend on the flow depth at the upstream end (submerged gate) and is 
not known a priori. On the other hand, the depth at the tail end depends upon the 
discharge which can not be fixed without estimating seepage losses and discharge from 
the upstream gate. Moreover, the rate of seepage at a location in a canal, depends upon 
the flow depth and is not a constant. 

French (1986) suggested an iterative procedure to compute discharge released into 
a channel from a reservoir, for a given channel geometry for specified upstream or 
downstream depth. However, he did not include the effect of seepage, downstream 
control and submergence of upstream control as stated above. 

This paper presents a mathematical model for the analysis of spatially varied steady 
flow in irrigation canals. Two iterative computational procedures to analyse spatially 
varied steady flow in canals are presented. In both the methods iterations are 
performed to satisfy the boundary conditions. The continuity and momentum equa¬ 
tions of spatially varied steady flow are solved using fourth order explicit Runge-Kutta 
method. The model results are compared with the experimental observations of 
Schropp & Fontijn (1989). The effect of canal seepage, canal condition, and backwater 
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Figure 1. Definition sketch of 
a canal reach (a) elevation and 
(b) cross-section. 


curves on varied flow profiles and canal capacity are studied through model applica¬ 
tion for different sets of parameters on a canal reach. 


2. Governing equations 


The governing equations of spatially varied steady flow in a canal are written as 
(Chow 1959), 

dQ/dx = q s (1) 


dy 

dx 




( 2 ) 


where x = distance along the canal; Q = discharge; y = depth of flow; g = acceleration 
due to gravity; T = top width; A = area of cross-section; S 0 = bed slope; /? = momen¬ 
tum correction coefficient, which is taken as 1; S f = friction slope given by Manning’s 
equation written as, 

S f =n 2 Q 2 /A 2 R 413 (3) 

where n = Manning’s coefficient and R = hydraulic radius. 

In (1) and (2), q s — rate of seepage per unit length and is given by 


<ls=- p M Y P + y), 


( 4 ) 


where a, /? and y = constants; P s = perimeter of seepage face. For a = K d fd, j0= 1, 
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y = K d , (4) reduces to Darcy’s law, written as, 

q,= -K d P,l{y + d)ld] (5) 

where K d = permeability; d = thickness of the porous bed. However, in field situation it 
is difficult to'estimate the thickness d. In canal engineering, it is customary to express 
canal losses in units of length/time (Wachyan & Rushton 1987) thereby inferring that 
the canal losses are the function of the wetted perimeter of the canal. By substituting 
a = K; /? = 0; 7 = 0 in (4), we get 

q s =~KP s , (6) 

where K - seepage coefficient expressed in cumecs/million m 2 . Depending upon the 
soil texture and type of lining, suitable values of K can be chosen (Leliavsky 1965; 
Wachyan & Rushton 1987). 

By and large, the irrigation canals are trapezoidal in shape and for this shape the 
governing equations ( 1 ) and ( 2 ) can be written as, 

dQ/dx = #(x,y) = K{B + 2y(z 2 + 1) 1/2 ), (7) 

dy 

-r = ^(*,y,0 = 


2pKQ{B + 2y(z 2 + 1) 1/2 ) 
g{{B + zy)y} 2 

Q 2 (B + 2zy) - 
L P g{(B + zy)y} 2 _ 

where B = bed width of the canal; z = side slope (figure lb). For all the results presented 
in § 6 of this paper, (7) and ( 8 ) are used. 



S - 


n 2 Q 2 (B + 2y(z 2 + l ) 1/2 ) 4/3 
{(■B + zy)y r 3 


2.1 Boundary conditions 

To solve (1) and (2) for two unknowns <2(x) and y(x), two boundary conditions are 
required. The specified depth and discharge or rating curve can be used as boundary 
condition. The discharge equations from the control structures such as weirs, gates, 
siphons, falls etc. can also be used as boundary condition. Under subcritical flow 
conditions, which by and large prevail in irrigation canals, at least one boundary 
condition is to be specified at the downstream end and flow profile computations are 
carried out opposite to the direction of flow. However, both depth and discharge can 
also be specified at the downstream end, though it is not common. In case of an 
irrigation canal, the discharge is regulated by the control structure situated at the 
upstream end, whereas supply level (flow depth) is maintained using the downstream 
control. 

For the results presented in this paper, a submerged gate at the upstream end and 
a rectangular weir at the downstream end are used as boundary conditions (figure 1 ). 
The submerged gate discharge equation is written as (Bos 1976), 


Q=(2g)^C d C„b c g,(h*-yJ' n 


( 9 ) 
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and the discharge equation for the weir is written as, 



1/2 

C<C v b e h 3 J 2 , 


( 10 ) 


where Q = discharge from the structure; C d = coefficient of discharge and for a weir it 
depends upon many factors such as, shape, (hjl), ( hjp ) etc.; b c = width of the control 
section; g t = gate opening; h u = heat upstream of the control structure; h d = head down¬ 
stream of the control structure; C v = approach velocity coefficient and is obtained from. 


CfJ 3 - 



= 0 


( 11 ) 


where, A u = area of flow upstream of the control structure; A 0 = area of flow at control 
structure and for a gate A 0 = b c g t and for a weir A 0 = b c h u . 


3, Computational procedure 

The initial guess values of unknowns viz., y and Q are to be specified to start the 
iterative procedure. For the guessed/corrected discharges, the depth at the downstream 
end is computed using the downstream boundary condition by Newton’s method. For 
this depth, spatially varied flow computations are performed in the canal. Knowing the 
depth at the upstream end of the canal, the discharge in the canal is corrected. The 
discharge at the downstream end is now updated, and the procedure of computing the 
depth at the downstream end and varied flow profile computations, is repeated until 
the depth and discharge convergence is met. 

■ Based on the above procedure, two different computational methods are developed. 
These methods mainly differ in the procedure used to solve varied flow equations. The 
concept of standard step method can also be extended to solve spatially varied flow 
equations. But this method requires an iterative procedure to compute depth and 
discharge at every computational node and may not be computationally economical.. 
Hence, in the present analysis fourth order explicit Runge-Kutta method is used for 
both the methods as discussed below. 

3.1 Method 1 

In this method, spatially varied flow equations (1) and (2), are solved as a coupled 
system of differential equations from the tail end to the upstream end using four point 
explicit Runge-Kutta method on a variable grid. The computational procedure is as 
given' below. 

(I) For the known/guessed discharge, obtain depth at downstream end using 
downstream boundary condition such as (10). Newton’s method is used. 

(II) Solve (1) and (2) or (7) and (8) for the entire canal reach using the depth and 
discharge at downstream end as starting values. The fourth order explicit 
Runge-Kutta method is used. The depth and discharge computations are 
performed from downstream end to upstream end of the canal. 

(III) Correct the discharge at the upstream end for the upstream depth obtained from 
step II, using upstream boundary condition such as (9). 
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(IV) Check for convergence in discharge at upstream end. If convergence is not 
reached, correct the discharge at the downstream end and go to step I. The new 
discharge (corrected) at the downstream end is obtained as, 

Qa = Q' d + ( 12 ) 

where a = relaxation coefficient; Q d = new discharge at the downstream end, and 
Qa ~ previous iteration value of the discharge. A Q u = discharge correction made 
at the upstream end. 

The solution procedure using Runge—Kutta method for solving coupled system of 
differential equations (1) and (2) is as follows: 

Depth is computed from, 

ki = ki+1 + + 2a 2 + 2u 3 + a 4 ]; i = (JV— 1)---1 (13) 

and discharge is computed from, 

Qi = Qi+1 + \t b i + 2b 2 + 2 b 3 + 5J; i = (N - 1)-. 1 (14) 

where 

a 1 =Ax^(x i+i ,y i+i ,Q i+1 ), 
b 1 =Ax^(x i + l ,y i+1 ), 

a 2 = Ax& r {x i + 1 +(Ax/2),y i+1 +(a l /2), Q i+1 +{bJ2)), 

b 2 = Ax&(x i + 1 +(Ax/2),y i + 1 +(aJ2)), 

a 3 = Ax^(x i + 1 +{Ax/2\y i+1 +{a 2 /2),Q i+i +{bJ2)), 

b 3 = Ax&{x i + 1 + (Ax/2), y i + 1 + (a 2 f 2)), 

a 4 = Ax^(x i+1 + A x,y i+l +a 3 ,Q i+1 +b 3 ), 

b 4 = Ax<^(x i+1 + Ax,y i+1 + a 3 ), (15) 

where N — number of computational nodes and Ax = space step and is taken as 
(x f — x i+1 ). Here i indicates the node numbering from upstream end to downstream 
end. In (15) ^(x,y, Q ) and &(x,y) are given by (7) and (8) respectively. 

3.2 Method 2 

In this method, (1) and (2) or (7) and (8) are solved separately. Runge-Kutta method is 
modified as discussed below. This method consists of two steps: (i) solving for 
discharges in the forward sweep and (ii) solving for depth in the backward sweep. In the 
forward sweep, for the discharge obtained from the upstream boundary condition and 
for the guessed/known depth in the canal, the discharge computations are made from 
upstream end to downstream end of the canal using the continuity equation (7). For the 
computed discharge at the downstream end, the depth is obtained from the boundary 
condition using Newton’s method. For this depth and known discharge in the canal, 
the momentum equation, (8) is solved for the depth in the canal from downstream end 
to upstream end. This procedure is repeated until the depth and discharge convergence 
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is met at boundaries. The discharge computations (forward sweep) using Runge-Kutta 
method on a variable grid are as follows: 

Qi +1 = Qi + (l/ 6 )[bi + 2b 2 + 2b 3 + b 4 ]; i = 1 - - - (JV — 1 ) (16) 

where 

b x = Ax^Cx.-.y,), 

b 2 = Ax^(x ; . + (Ax/2), y. + (d l J2)), 
b 3 = Ax^(X; + (Ax/2), y { + (d 2 /2)), 

b 4 = Ax^(x ( - + Ax, y ( . + a 3 ), (17) 

where, Ax = x t+ L - x f ; N = number of computational nodes; and #(x, y) is given by (7). 
In (17), a are obtained using a values in the backward sweep for depth computations 
using (19). The depth computations in the backward sweep using Runge-Kutta 
procedure are as follows: 

3 >,= y i+ i +(l/ 6 )[a! + 2a 2 + 2a 3 + a 4 ]; i = {N- 1)—1 (18) 

where 

a i — Ax# r (x i+1 ,y i+1 ,Q i+1 ), 

a 2 = Ax^(x i + 1 +(Ax/2),y i+1 +(a 1 /2),Q i+1 +(bJ2)) t 

a 3 = AxJ zr (x i + 1 +(Ax/2),y i + 1 +(a 2 /2),Q i + 1 +{b 2 /2)), 

a 4 = Ax# r (x i+1 +Ax,y i+1 +a 3 ,Q i+1 + b 3 ), (19) 

where Ax = x { — x i + 1 and tF(x,y, Q) is given by ( 8 ). In (19), 6 are obtained using!? values 
in the forward sweep for discharge computations using (17). 
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4. Model verification 

The model results are verified for many steady flow problems (no seepage) presented 
by Chow (1959) and French (1986). The results of varied flow analysis with seepage are 
verified with experimentally observed water surface profiles obtained by Schropp 
& Fontijn (1989) on a laboratory channel. The seepage rate q s is obtained from (5). 
Both the numerical methods presented above gave identical results. Figure 2 presents 
the comparison for profile I (higher discharge) whereas figure 3 presents the com¬ 
parison for profile II (lower discharge) of Schropp & Fontijn (1989). In figure 2 and 3, 
curve 1 presents the comparison of water surface profile obtained using proposed 
numerical methods with the ones computed by Schropp & Fontijn (1989) using same 
roughness and seepage coefficients. The two results agree very well with each other. 
A better fit to the experimental data could be obtained (curve 2 in figures 2 and 3) by 
tuning the roughness coefficient as given in table 1. From the table it is found that 
Manning’s n value is significantly different for the two discharges. This may be 


Table 1. Data for model verification. 


Parameters 

Computed 

Experimental 

Profile I 

Profile II 

Profile I 

Profile II 

Manning’s n 

0010 

0-010 

0-0265 

0-0156 

K d (W-*m/s) 

1-690 

0-980 

1-690 

0-980 

d(m) 

0-100 

0-100 

0-100 

0-100 

e„(10- 3 m 3 /s) 

0-091 

0-035 

0-091 

0-035 

Width(m) 

0-80 

0-80 

0-80 

0-80 
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associated with the significant differences in seepage rates for the two discharges (Rao 
et al 1994), 


5. Comparison of methods 

The relative merits and demerits of the two methods are studied through their 
application to a number of steady flow situations on different canals. These merits and 
demerits are discussed in terms of their generality to simulate complex steady flow in 
a canal, their simplicity in implementation, storage requirement and computational 
efficiency. The requirement of initial guess for the two methods, to start the iterative 
procedure, is also discussed. 

The method 2 requires the initial guess values of Q and y at all computational nodes, 
whereas method 1 requires only the initial guess of discharge at the downstream end to 
start the computations. The uniform flow depth and discharges are used as the initial guess 
of unknowns in the present study. Apart from the initial guess, method 2 requires the initial 
values of a x ,d 2 ,a 3 and a 4 to start the iterative procedure. The starting values of these 
Runge-Kutta coefficients are computed using initial guess values of Q and y for all the 
computational nodes. It should be noted here that method 2 requires extra storage for 
Runge-Kutta coefficients such as a and b, which is not required for method 1. The method 
2 is found to be computationally more efficient. Both the methods produced identical 
results for all the problems studied. Based on the experience method 2 is recommended. 

6. Application 

To demonstrate the applicability of the numerical methods and to study the effect of 
different parameters on water surface profiles in irrigation canals, the results obtained 
for a canal reach are presented. The canal is bounded by a submerged gate at the 
upstream end and a weir at the downstream end. Following data are used in the 
computations. 

Canal geometry: Bed width = 2-0m; side slope = 1:1-5; bed slope = 1:2500; 
length = 2000-0 m; shape = trapezoidal; 

Upstream gate: Width = 2-0 m; gate opening = l-4m; coefficient of discharge = 0-6; 
sill height = ,0-0; upstream head = 2-0 m; 

Downstream weir: width = 1-75 m; weir height (p) = 0-6m; weir length (/) = 2-0 m; 
coefficient of discharge is given as (a fit to the experimental data of Ramakrishnan 
1979), 

hjl^ 0-4, C d = 0-867, 

0-4 < hjl < 1-6, C d = 0-191 (h/l) + 0-792, (20) 

hjl> 1-6, C d = 1-09. 

6.1 Effect of seepage 

To study the effect of seepage on varied flow, the results obtained for different seepage 
coefficients are presented. Equation (6) is used to estimate the rate of seepage from the 
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Table 2. Discharge distribution for different seepage coefficients. 


Case 

Seepage 
(10“ 6 m/s) 

Discharge (m 3 /s) 

Upstream end 

Downstream end 

1 

0-0000 

4-424 

4-424 

2 

0-3048 

4-426 

4-421 

3 

2-4384 

4-438 

4-396 

4 

9-7536 

4-477 

4-313 

5 

30-4800 

4-583 

4-077 

6 

1000000 

4-883 

3-311 

7 

304-8000 

5-415 

1-383 


canal. Table 2 presents the values of seepage parameter K and the corresponding 
discharges at the upstream and downstream ends of the canal. Case 1 presents the 
limiting case of spatially varied flow analysis, wherein the governing equations (1) and 
(2) or (7) and (8) reduce to gradually varied flow equations (Chow 1959). It should be 
noted here that no computational procedure is suggested for the analysis of gradually 
varied flow in irrigation canals as discussed in § IT. Cases 2-5 present the field range of 
K values for irrigation canals (Leliavsky 1965). Cases 6 and 7 simulate the flow 
situations due to excessive seepage. Figure 4 presents the variation in the flow depth 
along the canal reach. 

It is observed that for a very low seepage (case 2 in table 2) the flow profile in the canal 
is very close to the gradually varied flow profile (case 1 in table 2) in the canal (figure 4) 
as may be expected. For the practical range of seepage coefficients (2-5 in table 2) the 
change in flow depth at the upstream end is not very significant as can be seen from 


m 



Figure 4. Effect of seepage 
on flow depth. 
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curves 2-5 in figure 4. The change in depth between 2 and 5 is 1-38%, whereas the 
seepage coefficient is increased 100 times. Due to the smaller change in depth with 
seepage coefficient, the change in discharge at the gate is not significant. The difference 
in discharges between cases 2 and 5 is 3-42% (table 2). However at the downstream end, 
the flow depth depends on the tail end discharge and the characteristic of the weir. Due 
to increased seepage losses from cases 2-5, the discharge at the downstream end is 
reduced (table 2). Due to this reduced discharge the flow depth also gets reduced. The 
changes in depth at tail end between cases 2 and 5 are more significant (3-13%) as 
compared to upstream end. It should be noted here that for cases 1-5 (table 2) the 
effects of canal slope are dominant over the cumulative effects of canal friction and 
seepage, i.e. all along the canal reach, 

S 0 >(S f -(2pKQPJgA 2 )), (21) 

and hence, the water surface profile exhibits the characteristics similar to the backwater 
profile discussed in Chow (1959). 

In situations of excessive seepage (cases 6 and 7), the depth and discharge changes at 
upstream canal ends are significant as seen in table 2 and figure 4. For these cases, the 
nature of water surface profile gets changed within the canal reach. The variation in 
flow depth along the canal exhibits the characteristics of both, backwater and 
drawdown profiles. For the distances (0 < x < L c ) upto which, 

S 0 <(S f -(2pKQPJgA 2 )), (22) 

the water surface profile exhibits the characteristics similar to the drawdown profile, 
and for the distances (L c < x < L), for which 

S 0 > (S f — (2PKQPJgA 2 )), ' (23) 

the water surface exhibits the characteristics similar to the backwater profile. At x — L c 
the water surface is parallel to the bed. However, finding such a location by analytical 
means is not possible. For case 6 such a situation occurs at x = 650 m whereas for case 
7 such a situation occurs at x = 1550 m. In case 6, the backwater effects due to the weir 
are felt upto 1350 m from the downstream end whereas in case 7 the backwater effects 
are felt only upto 450 m from the downstream end. The drawdown in the upstream 
canal reach (0m < x < 650 m for case 6 and 0 m < x < 1550m for case 7) is mainly due 
to excessive seepage from the canal. It should be noted here that the nature of these 
drawdown profiles due to seepage is not the same as drawdown profiles discussed in 
Chow (1959); French (1986) for gradually varied flow (no seepage). 

6.2 Effect of canal condition 

With the passage of time and due to inadequate maintenance, the condition of canals 
deteriorates. The canals become irregular in shape due to silting, erosion and weed 
growth changing the roughness characteristics and seepage losses. All these effects in 
general considerably reduce the discharge carrying capacity of the canals. In the 
present study, it is assumed that the mean canal shape is maintained. The effect of 
backwater profiles, changed roughness and seepage loss on discharge carrying capacity 
of the canals are studied through simulations made for different sets of parameters 
given in table 3. The data regarding canal geometry, upstream gate and downstream 
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Table 3. Roughness and seepage coefficients for different canal 
conditions. 


Case 

Manning's 

n 

Seepage coefficient 
K(10" 6 m/s) 

Canal condition 

1 

0-012 

0-3048 

Very good (improved) 

2 

0-020 

2-4384 

Good (improved) 

3 

0-030 

9-7536 

Reference (design) 

4 

0-040 

9-7536 

Poor (deteriorated) 

5 

0050 

30-4800 

Bad (deteriorated) 


weir are same as given earlier. Table 4 presents the discharges at both the ends of 
the canal for different sets of parameters listed in table 3. For comparison purposes, 
the last column in the table presents the discharge assuming uniform flow in the 
canal as used in design. This discharge is obtained by solving the gate equation and 
uniform flow equation simultaneously. Figure 5 presents the variation in depth along 
the canal, whereas figure 6 presents the water surface profiles, for all the cases listed 
in table 3. 

As the canal condition deteriorates, the roughness and seepage losses are increased. 
The increased roughness increases the flow depth in the canal which in turn reduces the 
discharge from the upstream gate. On the other hand the increase in seepage loss 
should decrease the flow depth which will lead to higher discharge from the upstream 
gate. However, for the range of parameters studied (cases 1-5 in table 3), it is observed 
that the effects of increased canal roughness on flow profiles are more dominant as 
compared to increase in canal seepage. In general, the increased roughness and seepage 
losses affect the performance of the upstream control significantly. The discharge from 
upstream gate is reduced by 35-8% as the canal condition deteriorates from case 1 to 
case 5, whereas the associated increase in upstream depth is 20% (table 4 and figure 5). 

The flow depth at the tail end depends upon the discharge and the characteristics of 
the downstream control-(weir). The cumulative effect of reduced upstream discharge 
and increased seepage losses significantly reduce the discharge at the canal end. For the 
range of parameters studied (table 3) the discharge at the tail end of the canal is reduced 
by 44-8%, whereas the reduction in depth is 22-0% (table 4 and figure 5). 


Table 4. Discharges for different canal conditions. 


Discharge (m 3 /s) 


Canal Actual 

condition - Design 

(case) Upstream end Downstream end (Uniform flow) 


1 

5-565 

5-560 

6-728 

2 

5-069 

5-028 

5-631 

3 

4-477 

4-313 

' 4-668 

4 

3-929 

3-765 

3-954 

5 

3-575 

3-071 

3-397 
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Figure 5. Effect of canal con¬ 
dition on flow depth. 


6.3 Backwater effect 

By and large, canals are designed based on uniform flow criterion and control 
structures are also designed based on the uniform flow depth. But uniform flow at all 
locations in a canal is not feasible due to efflux created by the control at the downstream 



Figure 6. Effect of canal 
condition on water surface 
profiles. 
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end. Furthermore, in the presence of seepage, the concept of uniform flow is itself 
ambiguous. In several situations, the backwater effects due to downstream control may 
extend sufficiently upstream so as to effect the performance of the control structure at 
the upstream end. The present comparisons show that for cases 1-4 (table 3) the actual 
depth is more than the uniform flow depth confirming the effects of backwater. Hence, 
the design discharges for these cases are considerably higher than the actual discharge 
in the canal. In some situations, it may not be feasible to operate the canal at design 
discharges as flow may overtop the embankment. Also, there may not be adequate 
flexibility available in the gate opening to compensate for these effects. In case 5 of table 
3, it is found that uniform flow depth is more than the actual flow depth. This is due to 
the drawdown caused by higher seepage. For this case the actual discharge is higher 
than the design discharge as seen in table 4. 

In general, the cumulative effects of seepage, canal condition and backwater, reduce 
the discharge carrying capacity of the canal significantly. A comparison of the 
discharges obtained from uniform flow concept for very good canal condition (case 1, 
column 4 in table 4) with the actual discharge for very bad canal condition (case 5, 
column 2 in table 4) shows a reduction of 46-86% in the canal carrying capacity. 


7. Conclusion 

A mathematical model is developed for the simulation of steady flow in a canal reach. 
The model accounts for canal seepage and control structures at upstream and 
downstream ends of the canal. Two iterative methods are also presented to compute 
spatially varied steady flow profiles in a canal reach. Method 2 is found to be 
computationally more efficient. The results are compared with both observed and 
computed water surface profiles available in the literature. The effects of seepage 
coefficient, canal condition and backwater on spatially varied flow in the canal are 
studied. In general, it is observed that for the range of field parameters, the effect of 
roughness on varied flow profiles and the performance of upstream control are more 
significant than seepage. However, excessive seepage may considerably change the 
nature of the water surface profile. It is observed that in general, the cumulative effects 
of seepage, canal condition and backwater significantly reduce the discharge carrying 
capacity of the canal. In some situations, it may not be possible to achieve design 
discharge in the canal. 
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